Create Recipes for common use case #410

dennyabrain · 2024-10-15T05:12:56Z

Recipes are supposed to be end to end examples of using Feluda for a particular use case. While feluda itself should be easy to use and configured by an experienced python/ML engineer, these recipes would provide easy to copy-paste examples of using feluda for specific use cases. As part of this issue, we should

Identify commonly useful recipes
Write example recipes
discover and fix bugs that we discover along the way
publish them in the wiki/website

dennyabrain · 2024-10-17T15:46:43Z

Examples of commonly useful recipes would be

Index a collection of images and search similar images in them
Index a collection of videos and search for similar videos through them
Index a collection of audios and search for similar audios through them
Process a collection of videos and cluster them into predefined categories
Process a collection of newspaper clippings and search through the text in them

plon-Susk7 · 2024-10-18T07:10:30Z

Hi @dennyabrain , should we start with the examples provided by you first?

dennyabrain · 2024-10-18T07:54:42Z

Yes @plon-Susk7 let's do one of them. Video is interesting and sufficiently complex. Should we do the recipes related to videos?
So 1. Demonstrating the use of feluda to index and search through videos and 2. Demonstrating the use of feluda to cluster a collection of videos into groups.

@aatmanvaidya might be able to direct you better to which operaters to look at and any relevant documentation.

plon-Susk7 · 2024-10-18T08:23:12Z

Yeah let's go with video first. I'll fetch additional details from @aatmanvaidya .

plon-Susk7 · 2024-10-25T05:43:03Z

Steps to run notebook:

Install jupyter lab inside virtual environment first (venv).

pip install jupyter lab

After installation of jupyter lab, deactivate venv and run the following command.

jupyter lab --ip 0.0.0.0 --port 8888 --no-browser --allow-root --NotebookApp.token=''

You can run notebook by navigating to http://localhost:8888 on your browser.

dennyabrain · 2024-10-26T04:27:10Z

@aatmanvaidya @plon-Susk7 something to think about, should we considering creating collab notebooks as well? I have two reasons to support this :

It will provide journalists and non tech folks a cloud environment to use feluda in without worrying about installing python and feluda on their machine
since collab integrates well with google drive, they could mount data from their own drives.

2 is useful because a lot of people are familiar with using google drive and create their little personal "datasets" on it all the time. So being able to process their own data using our notebooks would enhance feluda's usability for them.

aatmanvaidya · 2024-10-26T05:02:16Z

@dennyabrain even I was thinking about this

but currently, there could be some limitations

we will only be able to use operators in google colab.
- this is not bad, because clustering, t-SNE, extraction of text from image and lot of useful things can still be done. Only store and search won't work
elastic search won't work there -- we could think of using other vector databases from langchain etc, but that's a different discussion.
since feluda is not a python package yet, we will have to clone the entire repo on the google coalb (this is not a big deal), but this means that whatever operator a journalist/not tech person would want to use, they would have to manually install there, and other dependencies that could come with it like ffmpeg, tessarct-ocr etc

I think we should definitely have examples on google colab, as its just becomes one-click for someone to replicate and use feluda -- they don't have to worry about setting up docker etc.

But I feel, through the work Priyash has done so far on writing example notebooks, we should first finalise the public API and then move towards examples on colab.
What do you think Denny?

dennyabrain · 2024-10-26T16:20:06Z

@aatmanvaidya point taken about the need to publish the library first and to finalize the API. Was getting ahead of myself. Lets do collab later then.

This was referenced Oct 15, 2024

Release Feluda 1.0 #401

Open

Finalize Public API #409

Open

dennyabrain added the level:feature label Oct 17, 2024

dennyabrain mentioned this issue Oct 17, 2024

Video Operator should process video of any length and size #323

Open

dennyabrain assigned aatmanvaidya Oct 17, 2024

This was referenced Oct 20, 2024

Recipe for searching similar videos #412

Merged

Recipe for searching similar audios #417

Merged

Recipe for embedding reduction [Videos] #418

Merged

This was referenced Oct 25, 2024

Recipe for creating clusters [Videos] #419

Merged

Recipe for classifying videos into predefined category #420

Merged

plon-Susk7 mentioned this issue Oct 26, 2024

Recipe for processing newspaper and searching text through them. #424

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create Recipes for common use case #410

Create Recipes for common use case #410

dennyabrain commented Oct 15, 2024 •

edited

Loading

dennyabrain commented Oct 17, 2024

plon-Susk7 commented Oct 18, 2024

dennyabrain commented Oct 18, 2024

plon-Susk7 commented Oct 18, 2024

plon-Susk7 commented Oct 25, 2024 •

edited

Loading

dennyabrain commented Oct 26, 2024

aatmanvaidya commented Oct 26, 2024

dennyabrain commented Oct 26, 2024

Create Recipes for common use case #410

Create Recipes for common use case #410

Comments

dennyabrain commented Oct 15, 2024 • edited Loading

dennyabrain commented Oct 17, 2024

plon-Susk7 commented Oct 18, 2024

dennyabrain commented Oct 18, 2024

plon-Susk7 commented Oct 18, 2024

plon-Susk7 commented Oct 25, 2024 • edited Loading

dennyabrain commented Oct 26, 2024

aatmanvaidya commented Oct 26, 2024

dennyabrain commented Oct 26, 2024

dennyabrain commented Oct 15, 2024 •

edited

Loading

plon-Susk7 commented Oct 25, 2024 •

edited

Loading