Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

efficacy of usfan and shot #1

Open
zwyang6 opened this issue Dec 4, 2022 · 6 comments
Open

efficacy of usfan and shot #1

zwyang6 opened this issue Dec 4, 2022 · 6 comments

Comments

@zwyang6
Copy link

zwyang6 commented Dec 4, 2022

Thanks for your contributions to the community

When I reproduce your code I found that 'source train' achieved the best results of Task: STL9, Accuracy: 73.79%. But when i launched the 'shot_lap' or 'shot' train, the results are worse than Accuracy: 73.79%. I only got 68.50% for 'shot_lap' and 66.0% for 'shot', respectively. Does the usfan help to alleviate the domain-shift problem from cifar9 -> stl9 in your experiments?

Look forward to your reply !

@roysubhankar
Copy link
Owner

Hi,

In our paper we actually report the cifar9->stl9 setting. If you run cifar9 as source and stl9 as target then you will see improvement over shot with our usfan method.

@zwyang6
Copy link
Author

zwyang6 commented Dec 4, 2022

Thanks for your reply !!!

Following your work, I did get the improvement over shot with usfan method from 66.0% to 68,.5%.
But I found the best results is not acquired by neither usfan nor shot. when i reproduce your code I found that 'source train' without usfan or shot, achieved the best results of Task: STL9, Accuracy: 73.79%. It confused me

Look forward to your further explaination.

@zwyang6
Copy link
Author

zwyang6 commented Dec 4, 2022

I see the henssian file generated at the source model training is not updated during target adaptaion training step. Should the hessian file be updated when train on target domain ?

@roysubhankar
Copy link
Owner

Thanks for your reply !!!

Following your work, I did get the improvement over shot with usfan method from 66.0% to 68,.5%.
But I found the best results is not acquired by neither usfan nor shot. when i reproduce your code I found that 'source train' without usfan or shot, achieved the best results of Task: STL9, Accuracy: 73.79%. It confused me

Look forward to your further explaination.

For the cifar->stl there is improvement with both shot and usfan over the source trained baseline. However shot's improvement is marginal wrt the baselines. Usfan i think gains around 2%.

Yes, you are right, stl->cifar indeed doesn't bring gain with both the source free methods over source baseline.

@roysubhankar
Copy link
Owner

I see the henssian file generated at the source model training is not updated during target adaptaion training step. Should the hessian file be updated when train on target domain ?

Ideally you can update the Hessian computation during target training. Since we train on one target we keep it fixed, otherwise support from the source model will be lost. However, in a continual learning scenario with several sequential target domains, you may want to update the Hessian after each target domain. We didn't explore this scenario.

@zwyang6
Copy link
Author

zwyang6 commented Dec 5, 2022

For the cifar->stl there is improvement with both shot and usfan over the source trained baseline. However shot's
Yes, you are right, stl->cifar indeed doesn't bring gain with both the source free methods over source baseline.

So it embarassinhgly means that both source free methods don't work in this scenario ? Shouldn't the proposed method brings improvement to the source baseline ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants