Fix for small segments #57

Pranjalya · 2024-04-05T11:36:20Z

Patch

Fix for small segments, when the audio duration is less than max_seg_len
Fallback for generate_segment_batched in case the seq_len and seq_metadata is not provided

Add tensorrt backend

Create updates

BBC-Esq · 2024-05-25T02:38:31Z

I like it!

Sembiance · 2024-06-12T16:03:32Z

Great fix, without it WhisperS2T is useless for small duration audio.

HIGHLY recommend merging this pull request :)

shashikg · 2024-07-06T05:38:00Z

Hi @Pranjalya @Sembiance !
Can you describe here or link an issue related to small duration audio?

Pranjalya · 2024-09-03T01:15:20Z

Hey @shashikg, the issue was in the loop where we segment audio into parts and the case where the original audio's duration is < 1s. Using the range function and setting the end timestamp as int(audio_duration) will lead it to it being 0, which when used on range returns an empty list. Using a math.ceil function ensures that it is rounded up to the next ceiling integer and the audio segment timestamp is logged.
This bug is potentially dangerous as well if someone is using indexing to map the audio segments, as it leads to missing of the parts.

Pranjalya added 4 commits December 28, 2023 06:35

🐰 fix breaking code

514b1cd

Merge pull request #2 from shashikg/main

7fbc846

Add tensorrt backend

Merge pull request #3 from shashikg/main

22d5cbd

Create updates

🐶 patch for small segment file

05c26eb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for small segments #57

Fix for small segments #57

Pranjalya commented Apr 5, 2024

BBC-Esq commented May 25, 2024

Sembiance commented Jun 12, 2024

shashikg commented Jul 6, 2024

Pranjalya commented Sep 3, 2024

Fix for small segments #57

Are you sure you want to change the base?

Fix for small segments #57

Conversation

Pranjalya commented Apr 5, 2024

BBC-Esq commented May 25, 2024

Sembiance commented Jun 12, 2024

shashikg commented Jul 6, 2024

Pranjalya commented Sep 3, 2024