ClassyCat in Presto: CV2 4789 #97

ashkankzme · 2024-07-10T21:41:59Z

Description

This PR is for implementing the three main classycat endpoints into presto: schema_create, schema_lookup, and classify

Reference: TICKET-ID (to provide additional context)
https://meedan.atlassian.net/jira/software/c/projects/CV2/issues/CV2-4789

How has this been tested?

Has it been tested locally? Yes
Are there automated tests? Yes, unit tests under test_classycat.py

Are there any external dependencies?

Are there changes required in sysops terraform for this feature or fix?

openai dependency added to requirements.txt
new env vars added to .env_file
new SQS queues needed for serving classycat requests

Have you considered secure coding practices when writing this code?

Please list any security concerns that may be relevant.
N/A

…sting.

… in/out

…rect run (phew)

… refactoring

computermacgyver · 2024-07-11T03:31:56Z

test/lib/model/test_classycat.py

+                        }
+                    ]
+                },
+                "callback_url": "http://host.docker.internal:9888"


It looks like tests are failing at an earlier stage right now around getting CLASSYCAT_BATCH_SIZE_LIMIT, but I suspect the tests will also fail with a timeout from http://host.docker.internal:9888 on Linux. This has been my experience.

If you are using Docker-for-mac or Docker-for-Windows 18.03+, connect to your mysql service using the host host.docker.internal (instead of the 127.0.0.1 in your connection string).

If you are using Docker-for-Linux 20.10.0+, you can also use the host host.docker.internal if you started your Docker container with the --add-host host.docker.internal:host-gateway option, or added the following snippet in your docker-compose.yml file :

extra_hosts: - "host.docker.internal:host-gateway"

More at https://stackoverflow.com/a/24326540

thanks for flagging Scott.

I think the tests failing bc of CLASSYCAT_BATCH_SIZE_LIMIT (or other en vars) is due to the fact that these new environment variables are not being correctly set in the test cases. I will look into what the issue is and try to figure it out.

re: callbacks not working -- I think that makes sense and is expected. the address mentioned there is a local server I have spun up to catch the callbacks from Presto. I did not commit the server with the PR code (but happy to do so if helpful), and therefore it doesn't exist to receive the results.

I'm now realizing that the other tests have been mocking the callback functionality, and I have not done so for classycat. I will do that soon and update the PR. it should fix the timeout error.

Yes, my personal preference as well is mocking callback functionality for intra-container tests

skyemeedan

Looks great, awesome how compact it is!

I'm still not clear how a consumer of the model will get the schemas uploaded and deployed (in the tests it is assuming that consumer has a local copy of the schema text to submit before doing classification)
does it need some docker-compose.py entries to be able to spin up model locally?
I think it needs more examples and validation of input and output data structures to document what needs to be submitted

skyemeedan · 2024-07-11T13:57:09Z

lib/s3.py

    # Extract the file name from the local file path
    try:
        s3_client.head_bucket(Bucket=bucket)
    except ClientError as e:
        if e.response['Error']['Code'] == 'NoSuchBucket' or int(e.response['Error']['Code']) in [403, 404]:
            # Create the bucket since it does not exist
            s3_client.create_bucket(Bucket=bucket)
-            logger.info(f'Created bucket {bucket} in MinIO.')
+            logger.info(f'Created bucket {bucket} in MinIO.') # is this accurate? i.e. is this code only for minio?


I guess in S3 the assumption is the bucket already exists?

this is code that I only refactored, and it had existed prior to classycat.
I think what could change here is the logging message: depending on which environment we are operating in, we could be making buckets in either minio or s3. the code is not just limited to running on minio only.

you could assume that buckets already exist in s3 but I think in case the assumption is not true, this may cause confusion in case this code is run on production.

not a big deal though

I believe the default pattern we established was to create the bucket locally but not in deployed environments, and that the create_bucket would fail because of permissions errors. Tagging @sonoransun to confirm/deny

I just updated the log message to be a bit clearer (feel free to push back) -- it now says creating a bucket in s3 (or minio, if running locally)

skyemeedan · 2024-07-11T13:59:13Z

.env_file.example

+CACHE_DEFAULT_TTL=86400
+
+CLASSYCAT_OUTPUT_BUCKET="litterbox-dev"


lol I've been "why is the output bucket called 'litterbox'? oh" but seems a bit cryptic when looking at a list of s3 buckets. would classycat-qa, classycat-live work?

haha, sure we can change the bucket naming

+1 for keeping it litterbox, lol

classycatlitter-qa?

lib/model/classycat_schema_create.py

skyemeedan · 2024-07-11T14:22:24Z

lib/schemas.py

+        elif event_type == 'schema_lookup' or event_type == 'schema_create':
+            result_instance = ClassyCatSchemaResponse(**result_data)
+        else:
+            result_instance = ClassyCatResponse(**result_data)
    elif 'video' in model_name:
        result_instance = VideoResponse(**result_data)
    else:


I think this else should be elif should raise error if unknown model name (otherwise will silently fail on typo?)

I don't think it will fail silently as is, since we check for the event type when processing the request later. please look at classycat.py/process().

I did this so I can respond with the same error structure as other classycat request, otherwise you may not receive an error directly from presto or it may be in a different format.

oh I was wrong. I understand now this is just mesage validation, I thought it was parsing incoming messages. I lost a couple of hours debugging because I was sending messages with the model 'yake' instead of 'yake_keywords' and it just silently accepted ... but this isn't the place to enforce that.

hm, for what its worth @skyemeedan we could raise an error if the model name is unknown to this function...

@DGaffney we do respond with an error to the caller in this case, it's just handled in the process function instead of here. happy to discuss further!

Just to clarify, @ashkankzme , I believe Devin was referring to a Presto model (YAKE, Video, Audio, etc.) and I believe you're referring to an LLM model. I think that's a good idea to respond with an error if the Presto model is unknown. That, however, can be a separate ticket / PR

oh I see what you mean, my bad. I do think this should be a separate ticket since we would want to test all/different presto models before making such changes. I will create a ticket for it.

here is the ticket: https://meedan.atlassian.net/browse/CV2-4986

lib/model/classycat_classify.py

skyemeedan · 2024-07-11T14:43:09Z

lib/model/classycat_classify.py

+            max_tokens=(max_tokens_per_item * items_count) + 15,
+            temperature=0.5
+        )
+


Can we check here for some of the OpenRouter specific errors (like out of tokens) so the model can error instead of passing downstream?

trying to better understand the scope, can you please explain a bit more which cases this would cover, and what the end goal would be?

Currently timpani sees errors like 'not enough credits', but doesn't know what that means, so it is going to keep sending additional requests. I think if classcat is unable to do any processing, it should be reporting that as an exception, and perhaps crashing/going offline to signal for help?

I see. I agree that it is helpful for the consumer of this service to distinguish between when it is ok to retry upon errors vs when it's a matter of service downtime and it's not ok to retry. I'll try to incorporate this into the code!

Let's defer this until the PR for consistent error/status output and error handling is in place (this is in separate work)

adding a comment for validating responses generated from LLMs (just the ticket) Co-authored-by: Skye Bender-deMoll <[email protected]>

# Conflicts: # docker-compose.yml

… in other tests, the callbacks are not being mocked, e.g. test_image.py, and tests run fine locally

computermacgyver

Looks great. Let's get this merged and on QA. I want to make sure we do a full test of video similarity on QA given this touches S3. There's also a few follow-up tickets we should put in the backlog (e.g., Presto should respond with an error if the Presto model requested is unknown, logging metrics for ClassyCat, etc.)---can you please file those?

computermacgyver · 2024-07-25T10:32:08Z

lib/schemas.py

+        elif event_type == 'schema_lookup' or event_type == 'schema_create':
+            result_instance = ClassyCatSchemaResponse(**result_data)
+        else:
+            result_instance = ClassyCatResponse(**result_data)
    elif 'video' in model_name:
        result_instance = VideoResponse(**result_data)
    else:


Just to clarify, @ashkankzme , I believe Devin was referring to a Presto model (YAKE, Video, Audio, etc.) and I believe you're referring to an LLM model. I think that's a good idea to respond with an error if the Presto model is unknown. That, however, can be a separate ticket / PR

adding logging to mark where future tickets will be implemented Co-authored-by: Skye Bender-deMoll <[email protected]>

ashkankzme added 19 commits June 25, 2024 15:15

WIP: adding create schema from classycat to presto

b1a05f5

WIP: classy cat schema_create functionality implemented, now on to te…

9f271cc

…sting.

WIP: classy cat schema look up code done, ready for testing

21f713e

WIP: porting classy cat classification code in presto, ready for testing

6939d4b

WIP: refactoring classycat endpoints to match presto requirements for…

916feed

… in/out

WIP: testing classy cat schema create end point and achieving one cor…

2a1f22d

…rect run (phew)

WIP: refactoring other classy cat endpoints to match the input format…

1c2d815

… refactoring

WIP: merging all three endpoints into one

6e7e7e0

WIP: updating the env var templates

860ea5d

WIP: minor bug fixes

ed5ab1c

WIP: minor bugfix

b63c012

WIP: minor bugfix

d393c41

finished testing the classycat endpoints

2b55566

removing minor comment

cd3280b

WIP: starting classycat unit tests

53a7e92

fixing the broken s3 tests + first classycat test passing!!

b39095d

adding a second test for schema lookup

fca42b0

classify success test

2524726

added two failure cases for classify endpoint in classycat

4e910f9

ashkankzme requested review from DGaffney, computermacgyver and skyemeedan as code owners July 10, 2024 21:42

computermacgyver reviewed Jul 11, 2024

View reviewed changes

skyemeedan reviewed Jul 11, 2024

View reviewed changes

ashkankzme and others added 6 commits July 11, 2024 09:17

Merge branch 'master' into CV2-4789

9b89cfd

Update lib/model/classycat_classify.py

61c9f04

adding a comment for validating responses generated from LLMs (just the ticket) Co-authored-by: Skye Bender-deMoll <[email protected]>

updating the presto docker file to include classycat env vars

2eedf93

Merge remote-tracking branch 'origin/CV2-4789' into CV2-4789

bc61e4f

# Conflicts: # docker-compose.yml

editted image tests (wrong model name for input)

cc4d0bc

reverting docker compose changes

fb84707

ashkankzme and others added 22 commits July 15, 2024 11:14

changed callback urls in tests to something generic -- confirmed that…

d21efac

… in other tests, the callbacks are not being mocked, e.g. test_image.py, and tests run fine locally

updating the default bucket names for classycat

e9c8bb6

creating bucket if not exists for file uploads on s3/minio

9c186a0

minor changes in the classify endpoint

eae4b25

removing unnecessary comments

76f1136

adding support for direct access to anthropic models

0c95c1a

adding input output examples to the endpoints

d40c927

updating the presto dockerfile to include a classycat endpoint

2271d5f

Merge branch 'master' into CV2-4789

ddc8ee9

adding classycat design docs to presto

4389363

retrieve /test/presto secrets from SSM

a026b54

CV2-4710-retrieve /test/presto secrets from SSM

c4d01f4

CV2-4710-fix secrets

b87b4c1

CV2-4710-retrieve /test/presto secrets from SSM

df60309

fix test ssm parameters

37a29e3

fix test ssm parameters

8c50bf8

fix test ssm parameters

cd4b0c8

fix test ssm parameters

3089214

fix test ssm parameters

36795b9

add env to ci-test-branches.yml

089e2e2

add env to ci-test-branches.yml

c80475d

add env to ci-test-branches.yml

688146b

computermacgyver approved these changes Jul 25, 2024

View reviewed changes

Update lib/model/classycat_classify.py

1e675bf

adding logging to mark where future tickets will be implemented Co-authored-by: Skye Bender-deMoll <[email protected]>

ashkankzme merged commit 589d144 into master Jul 25, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ClassyCat in Presto: CV2 4789 #97

ClassyCat in Presto: CV2 4789 #97

ashkankzme commented Jul 10, 2024

computermacgyver Jul 11, 2024

ashkankzme Jul 11, 2024

DGaffney Jul 11, 2024

skyemeedan left a comment

skyemeedan Jul 11, 2024

ashkankzme Jul 11, 2024

DGaffney Jul 11, 2024

ashkankzme Jul 15, 2024

skyemeedan Jul 11, 2024

ashkankzme Jul 11, 2024

DGaffney Jul 11, 2024

skyemeedan Jul 12, 2024

skyemeedan Jul 11, 2024

ashkankzme Jul 11, 2024

skyemeedan Jul 11, 2024

DGaffney Jul 12, 2024

ashkankzme Jul 15, 2024

computermacgyver Jul 25, 2024

ashkankzme Jul 25, 2024

ashkankzme Jul 25, 2024

skyemeedan Jul 11, 2024

ashkankzme Jul 11, 2024

skyemeedan Jul 11, 2024

ashkankzme Jul 15, 2024

computermacgyver Jul 18, 2024

computermacgyver left a comment •

edited

Loading

computermacgyver Jul 25, 2024

		CACHE_DEFAULT_TTL=86400

		CLASSYCAT_OUTPUT_BUCKET="litterbox-dev"

ClassyCat in Presto: CV2 4789 #97

ClassyCat in Presto: CV2 4789 #97

Conversation

ashkankzme commented Jul 10, 2024

Description

How has this been tested?

Are there any external dependencies?

Have you considered secure coding practices when writing this code?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

skyemeedan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

computermacgyver left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

computermacgyver left a comment •

edited

Loading