Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing files #32

Open
Jalilnkh opened this issue Oct 12, 2023 · 1 comment
Open

Missing files #32

Jalilnkh opened this issue Oct 12, 2023 · 1 comment

Comments

@Jalilnkh
Copy link

You mentioned that for making the model Android-Compatible: We use an entirely different tokenization procedure.
Could you let us know where are these files?

  1. Run final_tokenizer_train.py
  2. Run spm_extractor.py

I couldn't find them in GitHub.

@harshitadd
Copy link
Contributor

Please navigate to the transformer-dev branch and use (a) make_concatenated_vocab.py followed by (b) spm_model_generator.py and finally (c) spm_extractor.py to train and generate the android compatible vocal files.

You can use the directory structure explanation to see the right description of each file for the transformer in case you notice any other files missing (possibly renamed and not adequately updated in the readme).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants