Skip to content

Commit

Permalink
Add animated diarization pipeline diagram
Browse files Browse the repository at this point in the history
  • Loading branch information
juanmc2005 committed Nov 15, 2023
1 parent c88228f commit 84dc122
Show file tree
Hide file tree
Showing 2 changed files with 13 additions and 4 deletions.
17 changes: 13 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -55,12 +55,21 @@

## ⚡ Quick introduction

Diart is a python framework to build AI-powered real-time audio applications. With diart you can
create your own AI pipeline, benchmark it, tune its hyper-parameters, and even serve it on the web using websockets.
Diart is a python framework to build AI-powered real-time audio applications.
Its key feature is the ability to recognize different speakers in real time with state-of-the-art performance,
a task commonly known as "speaker diarization".
The pipeline `diart.SpeakerDiarization` combines a speaker segmentation and a speaker embedding model
to power an incremental clustering algorithm that gets better as the conversation progresses:

**We provide pre-trained AI pipelines for:**
<p align="center">
<img width="100%" src="/pipeline.gif" title="Real-time speaker diarization pipeline" />
</p>

With diart you can also create your own custom AI pipeline, benchmark it,
tune its hyper-parameters, and even serve it on the web using websockets.

**We also provide pre-trained pipelines for:**

- Speaker Diarization
- Voice Activity Detection
- Transcription ([coming soon](https://github.com/juanmc2005/diart/pull/144))
- [Speaker-Aware Transcription](https://betterprogramming.pub/color-your-captions-streamlining-live-transcriptions-with-diart-and-openais-whisper-6203350234ef) ([coming soon](https://github.com/juanmc2005/diart/pull/147))
Expand Down
Binary file added pipeline.gif
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 84dc122

Please sign in to comment.