Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add configurability to dropout in MultiHeadSelfAttention Module #11

Closed
theissenhelen opened this issue Jul 31, 2024 · 0 comments · Fixed by #12
Closed

Add configurability to dropout in MultiHeadSelfAttention Module #11

theissenhelen opened this issue Jul 31, 2024 · 0 comments · Fixed by #12
Assignees
Labels
enhancement New feature or request

Comments

@theissenhelen
Copy link
Collaborator

Is your feature request related to a problem? Please describe.

Currently, the MultiHeadSelfAttention module has a fixed dropout rate of 0.00, which limits the ability to tune this hyperparameter for different use cases. This lack of configurability can hinder model optimization and performance, especially in scenarios where overfitting may occur due to smaller datasets.

Describe the solution you'd like

I would like to see the addition of a configurable dropout parameter to the MultiHeadSelfAttention module. This parameter should allow users to specify the dropout rate when initialising the module, enabling better customisation and optimization of the model.

Describe alternatives you've considered

No response

Additional context

No response

Organisation

No response

@theissenhelen theissenhelen added the enhancement New feature or request label Jul 31, 2024
@theissenhelen theissenhelen self-assigned this Jul 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant