Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

There seems to be not a single sample in your epoch_iterator. At training dam model #5

Open
martinbombin opened this issue Jun 14, 2023 · 1 comment

Comments

@martinbombin
Copy link

Hi, after following the steps on the README I am facing this warning at training dam model for English. After it, I have division by 0 errors.

I am using Reta-LLM and what cause me the errors is the index_pipeline.py script.

[WARNING|trainer.py:2013] ... >> There seems to be not a single sample in your epoch_iterator, stopping training at step 0! This is expected if you're using an IterableDataset and set num_steps (X) higher than the number of available samples.

imagen
imagen

I have tried to change num_steps to 1, 10 and, 100000. All gave me same errors

@rucliujn
Copy link
Collaborator

Thank you for your attention to our projects and sorry for this error.
We find this error will occur when the number of examples (9 in your run) is smaller than the batch size (64).
We have fixed this error by minimizing the batch size with the num examples.
image
I hope this can fix your bugs. If you have any other problems, please let us know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants