Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

你好,请问怎么训练双语流式语音识别模型,看demo好像都是仅中文或者仅英文 #1201

Closed
cxt520ppgsh opened this issue Aug 7, 2023 · 4 comments

Comments

@cxt520ppgsh
Copy link

可以介绍一下https://huggingface.co/csukuangfj/sherpa-ncnn-streaming-zipformer-bilingual-zh-en-2023-02-13这里的model是怎么训练的吗

@marcoyang1998
Copy link
Collaborator

我们有一个中英文数据集的recipe,可以参考这里的数据准备和训练的脚本:https://github.com/k2-fsa/icefall/tree/master/egs/tal_csasr/ASR

@cxt520ppgsh
Copy link
Author

我们有一个中英文数据集的recipe,可以参考这里的数据准备和训练的脚本:https://github.com/k2-fsa/icefall/tree/master/egs/tal_csasr/ASR

感谢回复,我看了issue1054的方案大致是
Use https://github.com/k2-fsa/icefall/tree/master/egs/librispeech/ASR/pruned_transducer_stateless7_streaming
to replace pruned_transducer_stateless5

如果仅用tal_csasr 这个recipe是不支持流式的吧。
所以应该参考issue1054用pruned_transducer_stateless7_streaming去训练tal_csasr是吗?

@marcoyang1998
Copy link
Collaborator

建议参考 https://github.com/k2-fsa/icefall/tree/master/egs/librispeech/ASR/zipformer 训练,这是 icefall 最好的模型,如果需要流式模型,把causal设成True即可

@JinZr
Copy link
Collaborator

JinZr commented Nov 25, 2023

您好,您也可以参考这个 PR #1265 的内容,使用 byte-level BPE 训练流式 Zipformer 模型。

@JinZr JinZr closed this as completed Nov 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants