-
Notifications
You must be signed in to change notification settings - Fork 176
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Whisper transcription app big performance regression #712
Comments
A one hour podcast used to take ~1 minute, so big drop in performance. |
I think first thing to do is to replace the use of NFS |
would be great if the official example uses WhisperX, it can transcribe one hour podcast in 1 minute using only 1 container(or more specific, 1 graphi card with 16G vram, using |
made a poc repo here https://github.com/ahxxm/serverless-audio-transcriber |
I've been using my own dogfood for a while, this is how it looks like with A10G The audio files I sent range from 30 minutes to 70 minutes, and I scheduled a 5 minute interval This translates to around $0.01~$0.02 per hour of transcription, wonder how does the current official approch look like? Before and after regression, will it be cheaper and faster? |
Thanks @ahxxm this is awesome, especially the bit where you've listed benchmarks. It looks like Runpod is the cheapest? I'd bet that the |
Yeah A4500 is a new, performant(for Whisper) and cheap(Modal T4 price) one, would be great if Modal also supports that. I think it's comparable but not a completely fair comparison, Runpod has FlashBoot that loads weights within 1 second instead of 20+ seconds, so I added a bit more CPU and memory to Modal codes |
@thundergolfer @ahxxm hello, thank your for this very informative thread. I would like to ask how could I use my own model using modal, how can i send my file to the container image? |
@gongouveia I doubt this is relevant to the issue, but are you asking about how to send custom model/weights into Docker image, or send the file you want to transcribe(see all examples in this repo or mine then)? |
https://modal-com.slack.com/archives/C069RAH7X4M/p1713624663717089
The text was updated successfully, but these errors were encountered: