-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Handle batch processing when few files fails in the whole batch #50
Comments
Here's the final version that I ended up incorporating into my latest release, to avoid the issue, but would still be very interested in knowing if there's a way to address a single file to cause the entire batch processing of multiple files to fail... https://github.com/BBC-Esq/WhisperS2T-transcriber/releases/tag/v1.1.0 |
Hey @BBC-Esq ! I think there can be a simple fix for this. I will add the fix in next release. PS: I'm slightly stuffed with my office work. Expect some delay in the next release (end of march probably). PPS: Next release will also include end-to-end deployment ready server for WhisperS2T !! |
Do you have time to continue to work on this repository? Ctranslate2 just implemented flash attention BTW. |
When my script batch processes a bunch of audio files using the approach you gave me to use a list of files and their settings when processing, if a single file fails for any reason, it prevents the transcriptions of all files' transcriptions from being done? I created a workaround to process each file to the
transcribe_with_vad
method (each using its own tqdm) and added error handling, which works. I was wondering if there's a way to make it so I can use your most efficient approach and still have error handling for a specific audio file? Here is the original script and a comparison with the single audio file processing with error handling:The text was updated successfully, but these errors were encountered: