Update feature_extractor.py #1038

BBC-Esq · 2024-10-05T00:44:43Z

Added mel filter bank caching to FeatureExtractor class to optimize memory usage and reduce computational overhead when processing multiple audio files with identical parameters, particularly beneficial for batch processing scenarios.

abodacs

@BBC-Esq
Thanks for your work

abodacs · 2024-10-26T21:44:49Z

faster_whisper/feature_extractor.py

        if padding:
-            waveform = torch.nn.functional.pad(waveform, (0, self.n_samples))
-
-        window = torch.hann_window(self.n_fft).to(waveform.device)


Why was the hann_window deleted?

I'll take a look...

This is actually an optimization. In the old version, a new Hann window was being created and moved to the device every time call was executed. The new version creates it once during initialization and caches it as an instance variable (self.window).

abodacs · 2024-10-26T21:47:42Z

faster_whisper/feature_extractor.py

-            else waveform
-        )
+        # Move waveform to the target device if necessary
+        if self.device == "cuda" and not waveform.is_cuda:


Improved readability, thank you.

MahmoudAshraf97 · 2024-11-03T11:30:01Z

Hello, the way you implemented caching is not going to work because the only parameter that might change when creating the mel filters is n_mels, to change this parameter you need to initialize a new FeatureExtractor instance which will clear cache thus invalidating its purpose. Anyways, creating window takes around 6us which represents less than 0.05% of the execution time and the mel filters is created only once thus there is no need for caching.

In the future, it's preferable to use a caching decorator such as functools.lru_cache as an easy caching solution instead of implementing it as a dictionary

BBC-Esq added 2 commits October 4, 2024 20:44

Update feature_extractor.py

9c5975c

Merge branch 'SYSTRAN:master' into feature_extractor.py

a31b95e

abodacs suggested changes Oct 26, 2024

View reviewed changes

BBC-Esq added 6 commits October 26, 2024 19:21

Update feature_extractor.py

c5d6c61

Update feature_extractor.py

3144fe9

Update feature_extractor.py

16ffd5d

Update feature_extractor.py

6f463a0

Update feature_extractor.py

a53259a

Update feature_extractor.py

aae510c

BBC-Esq closed this Nov 3, 2024

BBC-Esq deleted the feature_extractor.py branch November 3, 2024 18:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update feature_extractor.py #1038

Update feature_extractor.py #1038

BBC-Esq commented Oct 5, 2024 •

edited

Loading

abodacs left a comment

abodacs Oct 26, 2024

BBC-Esq Oct 26, 2024

BBC-Esq Oct 26, 2024

abodacs Oct 26, 2024

MahmoudAshraf97 commented Nov 3, 2024 •

edited

Loading

Update feature_extractor.py #1038

Update feature_extractor.py #1038

Conversation

BBC-Esq commented Oct 5, 2024 • edited Loading

abodacs left a comment

Choose a reason for hiding this comment

abodacs Oct 26, 2024

Choose a reason for hiding this comment

BBC-Esq Oct 26, 2024

Choose a reason for hiding this comment

BBC-Esq Oct 26, 2024

Choose a reason for hiding this comment

abodacs Oct 26, 2024

Choose a reason for hiding this comment

MahmoudAshraf97 commented Nov 3, 2024 • edited Loading

BBC-Esq commented Oct 5, 2024 •

edited

Loading

MahmoudAshraf97 commented Nov 3, 2024 •

edited

Loading