-
Notifications
You must be signed in to change notification settings - Fork 160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
你好,请问训练该模型要多长时间呢?在什么机器上呢? #13
Comments
titan xp, 一个晚上就可以了 |
好的,谢谢,想问下,为什么分类的时候不用Linear需要用卷积层作为输出呢? |
Linear是全连接层,会从整张图像特征预测,Conv是从局部特征预测,就识别文字而言,局部特征效果更好,Linear反而会混淆不同文字的特征 DenseNet会差好几个百分点 |
你好,我用的是densenet官方预训练模型,训练数据是自己生成的,当字符集为100或1000时(class_num=100,或1000)时,收敛都都很好。当字符集扩大到6863时,训练7小时(2块1080ti)也未见收敛,batch_size =32 ,lr=0.001,loss一直在0.1-0.2之间。 请问可能会是什么原因? 期待你的回复,非常感谢 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
你好,请问训练该模型要多长时间呢?在什么机器上呢?
The text was updated successfully, but these errors were encountered: