-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fine-tune the LLM for mnemonics #123
Labels
Difficulty: Hard 😭
This issue is hard to solve, a lot of code needs to be changed
Improvement 📈
Improvement
In Progress 🚧
This issue is currently actively being worked on
Priority: Medium 🥈
Assign this label if this issue is used around once a day
Comments
StephanAkkerman
added
Difficulty: Medium 😐
This issue can be solved, but a decent amount of lines need to be changed
Improvement 📈
Improvement
Priority: Medium 🥈
Assign this label if this issue is used around once a day
labels
Dec 11, 2024
Or use LORA |
github-actions
bot
added
the
In Progress 🚧
This issue is currently actively being worked on
label
Dec 17, 2024
StephanAkkerman
removed
the
In Progress 🚧
This issue is currently actively being worked on
label
Dec 18, 2024
github-actions
bot
added
the
In Progress 🚧
This issue is currently actively being worked on
label
Dec 18, 2024
StephanAkkerman
removed
the
In Progress 🚧
This issue is currently actively being worked on
label
Dec 18, 2024
github-actions
bot
added
the
In Progress 🚧
This issue is currently actively being worked on
label
Dec 18, 2024
StephanAkkerman
removed
the
In Progress 🚧
This issue is currently actively being worked on
label
Dec 18, 2024
Closed
Like at https://huggingface.co/spaces/k-mktr/gpu-poor-llm-arena for some more models |
Simple guide on how to implement LORAs with huggingface: https://medium.com/@kram254/train-large-language-models-with-lora-and-huggingface-d04be693dd7a |
Could make our own dataset scraping sources like: |
Plan for generating the dataset:
|
StephanAkkerman
added
Difficulty: Hard 😭
This issue is hard to solve, a lot of code needs to be changed
In Progress 🚧
This issue is currently actively being worked on
and removed
Difficulty: Medium 😐
This issue can be solved, but a decent amount of lines need to be changed
labels
Jan 4, 2025
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Difficulty: Hard 😭
This issue is hard to solve, a lot of code needs to be changed
Improvement 📈
Improvement
In Progress 🚧
This issue is currently actively being worked on
Priority: Medium 🥈
Assign this label if this issue is used around once a day
Description:
Problem:
Currently we use a very general LLM for generating the mnemonics. However, we are using 1 template for input sentences and want 1 sentence as output. Maybe we can make it more efficient by finetuning the LLM.
Solution:
Look into the process of finetuning LLMs and see if it's possible for our use case.
Prerequisites:
[List any requirements or dependencies needed before starting.]
Tasks:
Additional context
Add any other context or screenshots about the feature request here.
The text was updated successfully, but these errors were encountered: