Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sudden drop in accuracy #12

Open
wing212 opened this issue Dec 1, 2021 · 18 comments
Open

Sudden drop in accuracy #12

wing212 opened this issue Dec 1, 2021 · 18 comments
Assignees
Labels
question Further information is requested

Comments

@wing212
Copy link

wing212 commented Dec 1, 2021

Hello, I want to ask why the accuracy has suddenly dropped, and the accuracy of my reproduced article is much lower than that of the original text. I use a single 3090ti graphics card for training.
image

@voldemortX
Copy link
Owner

voldemortX commented Dec 1, 2021

@wing212 Hi! Did you run code from the latest master branch? And could I ask what is your exact setup? e.g. PyTorch & TorchVision version, CUDA version, whether mixed precision is used, etc.

Btw, I don't think there is 3090ti in the market yet, did you mean 3090?

@wing212 wing212 closed this as completed Dec 1, 2021
@voldemortX voldemortX added the question Further information is requested label Dec 1, 2021
@voldemortX voldemortX self-assigned this Dec 1, 2021
@voldemortX
Copy link
Owner

Did you accidentally closed the issue?

@voldemortX voldemortX reopened this Dec 1, 2021
@wing212
Copy link
Author

wing212 commented Dec 1, 2021

@wing212 Hi! Did you run code from the latest master branch? And could I ask what is your exact setup? e.g. PyTorch & TorchVision version, CUDA version, whether mixed precision is used, etc.

Btw, I don't think there is 3090ti in the market yet, did you mean 3090?
It is 3090. I have used ape
RIXX7V9B3)0M 5D0EXAE_RB
E$ }_00Q(2_K~4 E@FX8Y7F
x.

@wing212
Copy link
Author

wing212 commented Dec 1, 2021

@wing212 Hi! Did you run code from the latest master branch? And could I ask what is your exact setup? e.g. PyTorch & TorchVision version, CUDA version, whether mixed precision is used, etc.

Btw, I don't think there is 3090ti in the market yet, did you mean 3090?

should I adjust my learning rate?

@voldemortX
Copy link
Owner

voldemortX commented Dec 1, 2021

The performance should be reproduceable from my provided shells. Although if you use different PyTorch versions and NVIDIA's apex, it could lead to problems like gradient explosions. Could you give me more details about your shell commands and code version? Instead of using a different learning rate, I recommend full precision training first to isolate the problem.

EDIT: And your baseline (p0) performance also seems off.

@wing212
Copy link
Author

wing212 commented Dec 1, 2021

Yes, there are problems with the baseline, I don't know why. I am using the code of the single card training version
}52OQL(QD9II{MAXW(2AU
X5NW_)O~S1QR1JK _{RRE
.

@voldemortX
Copy link
Owner

voldemortX commented Dec 1, 2021

@wing212 Firstly, if you are using PyTorch >= 1.6, You best use the master branch. Also a clear environment without the standalone apex package. I suspect the problem is from there. Maybe refer to #2 and check your loss in tensorboard.

Secondly, for 1/30 cityscapes baselines, the number of epochs should be longer. See Appendix B in the paper. Although yours seems good enough there, we still should align the settings.

[important] Your lr for cityscapes seem wrong? It should be 0.004 from Table 1.

@wing212
Copy link
Author

wing212 commented Dec 1, 2021

@wing212首先,如果您使用的是 PyTorch >= 1.6,则最好使用主分支。也是一个没有独立顶点包的清晰环境。我怀疑问题出在哪里。也许参考#2并检查你在张量板中的损失。

其次,对于1/30的城市景观基线,纪元的数量应该更长。见文件附录B。虽然你的看起来足够好,但我们仍然应该对齐设置。

[重要]你对城市景观的了解似乎不对吗?它应为表 1 中的 0.004。
I originally wanted to try training with other learning rates to see what the results are. I forgot to change the learning rate to 0.001. I will change the epoch to 329 without using apex.

@voldemortX
Copy link
Owner

@wing212 Great! Let me know if you still can't reproduce results. FYI, the std for Cityscapes 1/30 experiments is 2.56 in my records.

@wing212
Copy link
Author

wing212 commented Dec 1, 2021

@wing212伟大!如果您仍然无法重现结果,请告诉我。仅供参考,在我的记录中,Cityscapes 1/30实验的标准是2.56。

Thank you, I will keep trying. I would like to ask what is 2.56?

@voldemortX
Copy link
Owner

@wing212伟大!如果您仍然无法重现结果,请告诉我。仅供参考,在我的记录中,Cityscapes 1/30实验的标准是2.56。

Thank you, I will keep trying. I would like to ask what is 2.56?

STD (标准差)

@wing212
Copy link
Author

wing212 commented Dec 1, 2021

@wing212 great! If you still can't reproduce the results, please let me know. For informational purposes only, the standard for the Cityscapes 1/30 experiment is 2.56 in my records.

Thank you, I will keep trying. I would like to ask what is 2.56?

STD (STANDARD DIFFERENCE)

ok

@wing212
Copy link
Author

wing212 commented Dec 2, 2021

Sorry to interrupt you again, I did not use mixed-precision this time, but the result is still wrong. The 1/50 challenge of VOC data set is very low accuracy, and the 1/30 challenge of cityscapes data set is abnormally high
S%GABQ4600I8 5J _K5E8)A
U@{5K0D$THLP XXRM92)}1](https://user-images.githubusercontent.com/68488554/144337970-8aa51984-42b4-46c3-b303-6dc18baa015e.png) ![S3HOL9L{36K1TDJ1R15CZ2

6@AW}AP5JHR 42H8YP@VX

.

@wing212
Copy link
Author

wing212 commented Dec 2, 2021

U@{5K0D$THLP XXR`M92)}1

@voldemortX
Copy link
Owner

@wing212 Actually, your results look fine. I did get a 57.24 for city-30-1, that is why we need to take averages in these experiments (the STD is quite large). My voc-50-1 was 63.50 (it was weirdly low but I kept it to be fair), try 50-0 or 50-2 the results should be higher and give you an average around 67.

@wing212
Copy link
Author

wing212 commented Dec 2, 2021

@wing212 Actually, your results look fine. I did get a 57.24 for city-30-1, that is why we need to take averages in these experiments (the STD is quite large). My voc-50-1 was 63.50 (it was weirdly low but I kept it to be fair), try 50-0 or 50-2 the results should be higher and give you an average around 67.

I will continue, thank you.

@voldemortX
Copy link
Owner

But it looks like you have sid=2 for the voc experiment, do check the dataset and data splits if you can't get similar avg from 3 runs.

@wing212
Copy link
Author

wing212 commented Dec 2, 2021

但看起来 voc 实验的 sid=2,如果无法从 3 次运行中获得类似的平均值,请检查数据集和数据拆分。

Ok, I will do it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants