-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LPD biasses do not follow the original paper #139
Comments
Totally agree! I think this is caused by LION still being a bit WIP and sometimes stuff gets changed because I test this or that paper etc. LION/LION/models/iterative_unrolled/LPD.py Lines 191 to 215 in a28be48
|
Add conv_bias parameter in LPD (Issue #139)
Thanks for your quick reply! Having a separate original paper settings, and LION default settings sounds like a good idea to me. In my (somewhat limited) experiments, LPD also performed better without convolution biasses. In the pull request I didn't include the two versions of the settings yet. |
Yup, will likely do that. I hope to tidy all this soon and get some
examples and demos by the end of the year, so it gets easier to use. Let me
know if you have any questions meanwhile, as its all a bit in the middle of
construction now
…On Thu, 17 Oct 2024, 09:12 Dirk Schut, ***@***.***> wrote:
Thanks for your quick reply!
Having a separate original paper settings, and LION default settings
sounds like a good idea to me. In my (somewhat limited) experiments, LPD
also performed better without convolution biasses. In the pull request I
didn't include the two versions of the settings yet.
—
Reply to this email directly, view it on GitHub
<#139 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AC2OENHRYQ2WINNELNM5XSTZ35WNPAVCNFSM6AAAAABQBNZE6SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMJYHA3DCOBXGE>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
In the LPD implementation the bias terms of the convolution layers are turned off (bias=False in nn.Conv2d). However, in the original LPD paper they say that the convolution layers do have biasses. (Section IV.A In the equation after the line "where the k:th component is given by"). I also checked the original code, and it does not specify the bias, but the Tensorflow default is True.
Was it a deliberate choice to not follow the paper? Otherwise I think it's better to change the code so that bias=True for all convolution layers.
The text was updated successfully, but these errors were encountered: