Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lambda_m trainable #3

Open
Serega6678 opened this issue Jul 21, 2023 · 1 comment
Open

Lambda_m trainable #3

Serega6678 opened this issue Jul 21, 2023 · 1 comment

Comments

@Serega6678
Copy link

Hello,
In the paper, you say that lambda_m and lamda'_m are trainable parameters. Could you please clarify where are they located in the code? To me, it seems that weight=0.1 (which is not trainable) in the forward method of the model and this is the exact weight used to compute L_aux.
What am I missing?
Many thanks!

@Mckysse
Copy link
Owner

Mckysse commented Sep 5, 2023

Dear,
Good question! As you can see at the Section 4.1 of the paper, we had a footnote that illustrated "We did study semantic-wise weights by projecting the [CLS] token embeddings to a set of trainable parameters, but no further improvement could be achieved". So, we finally used the weight=0.1 as the fixed parameter.
Thanks for your attention!
Best,
Chen

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants