Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
  • Loading branch information
aldrinjenson committed Feb 28, 2024
2 parents 27e05df + 6cd58c6 commit 366ffe7
Showing 1 changed file with 19 additions and 2 deletions.
21 changes: 19 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -215,9 +215,17 @@ Smaller chunks get very little context and becuase of this our model is sometime

![image](https://github.com/kurianbenoy/Indic-Subtitler/assets/24592806/d84082d5-6d1c-4ce5-8394-d749fbf6c8f7)

- Include more model families like faster-whisper, whisperX etc.
- Include more model families like faster-whisper, whisperX, vegam-Malayalam-whisper etc.

![image](https://github.com/kurianbenoy/Indic-Subtitler/assets/24592806/57152204-c7df-4a0f-9cf7-105c8a60b666)

![image](https://github.com/kurianbenoy/Indic-Subtitler/assets/24592806/77515f76-047a-4808-9c9f-67e838c29875)


- Evaluate the performance of models in Indic subtitler on custom videos.

Made progress by adding ground truth to English audios

##### Few extra approaches to consider:

- Improving the results of SeamlessM4T with GPT models.
Expand All @@ -230,9 +238,18 @@ Smaller chunks get very little context and becuase of this our model is sometime

</br>

**Week 4 onwards 🌕**
**Week 4 🌕**

- Evaluate the performance of Indic subtitler on various languages
- Audio quality enhancement with Demux

https://github.com/kurianbenoy/Indic-Subtitler/issues/4

- Information page about best set of models and when to use it.


**Week 5 onwards 🌕**

- Fine-tune ASR models based on performance for respective languages and integrate even whisper-based audio models.
- Build a desktop app similar to webapp for using all the functionalities

Expand Down

0 comments on commit 366ffe7

Please sign in to comment.