Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

offer to best model for preTrainDArch #30

Open
alex7tula opened this issue Jan 26, 2018 · 3 comments
Open

offer to best model for preTrainDArch #30

alex7tula opened this issue Jan 26, 2018 · 3 comments
Assignees

Comments

@alex7tula
Copy link

Function preTrainDArch return last model.
May be is better to return best model + add stopAfterEpochs?

@saviola777
Copy link
Collaborator

I may add options for that in the future, but I don't think it has huge importance or impact, because the pre-training is unsupervised and mainly used to initialize the network weights in a way that allows fine-tuning to quickly converge, the actual error during pre-training is not so important.

@saviola777 saviola777 self-assigned this Jan 26, 2018
@alex7tula
Copy link
Author

If to start from the best starting position, we can faster and better trained in the final stage

@alex7tula
Copy link
Author

Otherwise it makes no sense to do pretraining more than 1 epoch.
10th epoch as accidental as the first.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants