Model Caching #5

CalebCourier · 2024-04-22T16:17:17Z

This PR demonstrates how we can properly cache local models during post-install for quick recall in a containerized environment (i.e. docker).

Currently it saves to a relative directory in the root of wherever the validator is installed which isn't ideal. We will need to figure out what directories we can use both locally and in a docker environment that will not require additional permissions (the reason we're not writing to something like .cache).

Beside caching the model weights for quick retrieval, we also introduce a singleton pattern for initializing the instance of the model.

In order to test and demonstrate the above, we switch the default encoding function back to the sentence-transformer model.

CalebCourier · 2024-04-22T16:18:06Z

validator/main.py

+            def st_embed_function(sources: list[str]):
+                print("Running st_embed_function...")
+                return DefaultEncodingModel().encode(sources)
+
+            embed_function = st_embed_function


We probably don't need st_embed_function anymore as a wrapper. We should be able to just pass DefaultEncodingModel().encode

CalebCourier added 9 commits April 19, 2024 10:50

re-enable st for embeddings

3ac8bbb

add save directory

c9667fa

save to more specific directory

d78722f

opt

b94ad5e

try cache directory

045dd3f

use local dir

5d1f675

load model from local dir

0d8556a

add some debug logs

981713a

init model in singleton

8946e8d

CalebCourier commented Apr 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Caching #5

Model Caching #5

CalebCourier commented Apr 22, 2024

CalebCourier Apr 22, 2024

Model Caching #5

Are you sure you want to change the base?

Model Caching #5

Conversation

CalebCourier commented Apr 22, 2024

CalebCourier Apr 22, 2024

Choose a reason for hiding this comment