Skip to content

Commit

Permalink
Updated for recent release
Browse files Browse the repository at this point in the history
  • Loading branch information
comodoro committed Jul 21, 2021
1 parent 9d8063d commit e77b2bc
Showing 1 changed file with 5 additions and 1 deletion.
6 changes: 5 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Czech Deepspeech Model

This is an experimental deepspeech model for the Czech language. The model is under the CC-BY-NC license, mainly because it has been trained on some CC-BY-NC datasets. All the datasets are:
This is an experimental deepspeech model for the Czech language. The model is under the CC-BY-NC license. Datasets used are:

- [Vystadial 2016 – Czech data](https://lindat.cz/repository/xmlui/handle/11234/1-1740) by Plátek, Ondřej ; Dušek, Ondřej ; Jurčíček, Filip (CC-BY-SA 4.0)
- [OVM – Otázky Václava Moravce](https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-000D-EC98-3) by Šmídl, Luboš ; Pražák, Aleš (CC-BY-NC 3.0)
Expand All @@ -9,3 +9,7 @@ This is an experimental deepspeech model for the Czech language. The model is un
- [Common Voice Czech](https://commonvoice.mozilla.org/en/datasets) by Mozilla (CC0)
- Some private recordings and parts of audioboooks

The model has been originally transfer-learned from the [English Deepspeech/Coqui model](https://github.com/coqui-ai/STT/releases/tag/v0.9.3) version 0.9.3.

Released scorers have been created using the [CWC 2011 Corpus](https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-0006-B847-6) by Spoustová, Johanka and Spousta, Miroslav (CC-BY 3.0)

0 comments on commit e77b2bc

Please sign in to comment.