Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fine-tune the LLM for mnemonics #123

Open
StephanAkkerman opened this issue Dec 11, 2024 · 7 comments
Open

Fine-tune the LLM for mnemonics #123

StephanAkkerman opened this issue Dec 11, 2024 · 7 comments
Assignees
Labels
Difficulty: Hard 😭 This issue is hard to solve, a lot of code needs to be changed Improvement 📈 Improvement In Progress 🚧 This issue is currently actively being worked on Priority: Medium 🥈 Assign this label if this issue is used around once a day

Comments

@StephanAkkerman
Copy link
Owner

  1. Description:

    • Problem:
      Currently we use a very general LLM for generating the mnemonics. However, we are using 1 template for input sentences and want 1 sentence as output. Maybe we can make it more efficient by finetuning the LLM.

    • Solution:
      Look into the process of finetuning LLMs and see if it's possible for our use case.

    • Prerequisites:
      [List any requirements or dependencies needed before starting.]

  2. Tasks:

    • Research finetuning of LLM
    • Find out if this works for us
    • Train the model (or add a new issue for this)
  3. Additional context
    Add any other context or screenshots about the feature request here.

@StephanAkkerman StephanAkkerman added Difficulty: Medium 😐 This issue can be solved, but a decent amount of lines need to be changed Improvement 📈 Improvement Priority: Medium 🥈 Assign this label if this issue is used around once a day labels Dec 11, 2024
@TimKoornstra
Copy link
Collaborator

Or use LORA

@github-actions github-actions bot added the In Progress 🚧 This issue is currently actively being worked on label Dec 17, 2024
@StephanAkkerman StephanAkkerman removed the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024
@github-actions github-actions bot added the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024
@StephanAkkerman StephanAkkerman removed the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024
@github-actions github-actions bot added the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024
@StephanAkkerman StephanAkkerman removed the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024
@StephanAkkerman StephanAkkerman removed their assignment Dec 18, 2024
@StephanAkkerman
Copy link
Owner Author

@StephanAkkerman
Copy link
Owner Author

Like at https://huggingface.co/spaces/k-mktr/gpu-poor-llm-arena for some more models

@StephanAkkerman
Copy link
Owner Author

Simple guide on how to implement LORAs with huggingface: https://medium.com/@kram254/train-large-language-models-with-lora-and-huggingface-d04be693dd7a

@StephanAkkerman StephanAkkerman self-assigned this Dec 30, 2024
@StephanAkkerman
Copy link
Owner Author

@StephanAkkerman
Copy link
Owner Author

Plan for generating the dataset:

  • Scrape sources for mnemonics
  • Ask SOTA model to create the sentence
  • Save this dataset

@StephanAkkerman StephanAkkerman added Difficulty: Hard 😭 This issue is hard to solve, a lot of code needs to be changed In Progress 🚧 This issue is currently actively being worked on and removed Difficulty: Medium 😐 This issue can be solved, but a decent amount of lines need to be changed labels Jan 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Difficulty: Hard 😭 This issue is hard to solve, a lot of code needs to be changed Improvement 📈 Improvement In Progress 🚧 This issue is currently actively being worked on Priority: Medium 🥈 Assign this label if this issue is used around once a day
Projects
None yet
Development

No branches or pull requests

2 participants