You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am attempting to run whisperx with word-level timestamps, but despite passing the relevant option, the output is of the form {'segments': [ ... ], 'language': 'en'} with no word_segments.
I dug around a bit but could not find out why this is happening. I have confirmed that model.options.word_timestampts is True, so I believe it is an internal issue with model.transcribe, and perhaps the options are not properly being wrapped to faster-whisper.
I am attempting to run whisperx with word-level timestamps, but despite passing the relevant option, the output is of the form
{'segments': [ ... ], 'language': 'en'}
with noword_segments
.I dug around a bit but could not find out why this is happening. I have confirmed that
model.options.word_timestampts
is True, so I believe it is an internal issue withmodel.transcribe
, and perhaps the options are not properly being wrapped to faster-whisper.My code:
It should be noted that running via command line works fine:
whisperx \ --model large-v2 \ --compute_type int8 \ --output_format json \ --suppress_numerals \ --task transcribe \ --language en \ $input_file
This properly includes
word_segments
in the json output.The text was updated successfully, but these errors were encountered: