Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slow Faster-Whisper #45

Open
Deep-unlearning opened this issue Nov 26, 2024 · 4 comments
Open

Slow Faster-Whisper #45

Deep-unlearning opened this issue Nov 26, 2024 · 4 comments

Comments

@Deep-unlearning
Copy link

Hello,

I attempted to run evaluations for Faster-Whisper from https://github.com/huggingface/open_asr_leaderboard/tree/main/ctranslate2

However, I observed that it was significantly slower than the original whispers.

This is what I got for tiny.en: hf-audio-esb-datasets-test-only-sorted_ami_test: WER: 23.5 % RTFx: 52.97

vs WER: 24.24 % RTFx: 214.27

Are there known issues causing slower runtime?

Also, I noticed that the evals for Faster-Whisper is with a batch_size=1, is this intentional ?

Note: I noticed that the transcribe function uses a default beam_size=5. Even after changing it to beam_size=1, it remained slower than the original Whisper.

Thanks !

@yuekaizhang
Copy link
Contributor

@Deep-unlearning
Copy link
Author

@Deep-unlearning FYI, I had a batched faster whisper here https://github.com/yuekaizhang/open_asr_leaderboard/blob/sherpa/tensorrtllm/run_faster_whisper.sh.

I will look into that, thanks !

@yuekaizhang
Copy link
Contributor

However, the faster whisper used a vad internally, you may implement a chunked long form algorithm to do an apple-to-apple comparsion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants