-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The value of loss stays high from beginning to end within training time #6
Comments
@snailchan , I also have the same problem, have you solved it ? |
@killerjian007 , I stopped here for a long time. If you had a solution, I would appreciate your sharing! |
Hi @snailchan @killerjian007 , I got the following error: I used @snailchan matchnet_siamese & solver.prototxt above. For your help, thank you very much. |
Hi @pribadihcr |
@killerjian007 |
Hi, @killerjian007 , have you solved the problem? |
i have the same problem also. i used my own data to train. maybe the format of training data is wrong??(i am trying to find the error).do you preprocess the training data as the paper?? |
@snailchan ,i've been on this experiment for a long long time,and my loss was still about 0.69,how can i know the way the author train this network |
@LeonSCZ i had used matconvnet to do the network ,and found that the learning rate should be less than 0.00005, you can try it in caffe |
@zhengxiawu can you please share your solver parameters or loss function? Is the LR the only thing you changed? He used SoftmaxWithLoss here, but actually the paper talkes about Softmax+CrossEntropy, which isn't available in Caffe, since there is only SigmoidCrossEntropy. No idea if that is the problem though. I have the same 0.69 loss. UPDATE: It turns out it had to do more with the weight initialization than the LR. If u try out some different fillers, it'll work. In my case, it worked with using guassian fillers with 0.1 std on the first conv layers. |
@mkaraki48 Do you have solve the problem? |
@pribadihcr How did you solve the error? |
hi, |
@mkaraki48 Could you please share your solver.prototxt that you have used to get reasonable response from training the network? I still get the 0.69 loss the entire training time even with the parameters that you have shared in your update. Thanks a lot! |
Can you please tell me how to train this model on my dataset |
The value of loss stays high from beginning to end within training time
I’m sorry to disturb you again. For academic study, I planned to recurrent your experiment result. Thus I written the network file “matchnet_siamese.prototxt” and the solver file “matchnet_siamese_solver.prototxt”, according to the file you shared. However, limited by my ability, I just trained the network without pipelines you introduced.
But, when I supervised the output window, the value of loss stayed high, vibrating around 0.69, much higher than which I obtained when training other classifier network.
So I want to please you help me check where the error comes from.
Thanks for your reading.
matchnet_siamese.txt
matchnet_siamese_solver.txt
The text was updated successfully, but these errors were encountered: