Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Cosine Similarity #69

Open
dhkim0225 opened this issue Feb 28, 2024 · 0 comments
Open

[Question] Cosine Similarity #69

dhkim0225 opened this issue Feb 28, 2024 · 0 comments

Comments

@dhkim0225
Copy link

The total number of data points of cosine similarity figure presented in the paper was confirmed to be 36,
and based on this, I'm making several attempts to guess which model among small, base, and large was used.

When I performed visualization using the feature just before the residual connection, I found a significant difference from the feature trend presented in the paper.

My questions are as follows:

  1. What model was used for visualization? (Which one: small, base, large?)
  2. What specifically was the output feature of the feature mentioned in the paper and what layer was it? (The final output feature of the block? The feature just before further operations?)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant