-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing file 'get_all_titles_from_spoken_wikipedia.py' #9
Comments
it's in |
That's working, thanks. I am wondering though where the nemo_compatible/scripts/nlp/en_spellmapper/dataset_preparation/build_training_data.sh Line 23 in 45fdcea
|
oh, it's the spoken wikipedia folder |
Looks like the dataset is no longer available:
|
Ok, I put spoken_wiki_titles.txt to the repo, should be sufficient for training |
I am currently looking into the building of the training dataset but it seems like the referenced file is nowhere to be found:
nemo_compatible/scripts/nlp/en_spellmapper/dataset_preparation/build_training_data.sh
Line 23 in 27bce6d
./build_training_data.sh: 25: /nemo_compatible/scripts/nlp/en_spellmapper/dataset_preparation/NeMo/examples/nlp/spellchecking_asr_customization/evaluation/get_all_titles_from_spoken_wikipedia.py: not found
The text was updated successfully, but these errors were encountered: