Fine-tune the LLM for mnemonics #123

StephanAkkerman · 2024-12-11T09:36:35Z

Description:
- Problem:
  Currently we use a very general LLM for generating the mnemonics. However, we are using 1 template for input sentences and want 1 sentence as output. Maybe we can make it more efficient by finetuning the LLM.
- Solution:
  Look into the process of finetuning LLMs and see if it's possible for our use case.
- Prerequisites:
  [List any requirements or dependencies needed before starting.]
Tasks:
- Research finetuning of LLM
- Find out if this works for us
- Train the model (or add a new issue for this)
Additional context
Add any other context or screenshots about the feature request here.

StephanAkkerman · 2024-12-11T09:42:01Z

https://www.reddit.com/r/LocalLLaMA/comments/1fm59kg/how_do_you_actually_finetune_a_llm_on_your_own/?rdt=61769

TimKoornstra · 2024-12-11T11:47:30Z

Or use LORA

StephanAkkerman · 2024-12-23T20:32:56Z

https://www.reddit.com/r/MachineLearning/s/Vj23oXeGuu

StephanAkkerman · 2024-12-26T13:56:27Z

Like at https://huggingface.co/spaces/k-mktr/gpu-poor-llm-arena for some more models

StephanAkkerman · 2024-12-29T13:34:59Z

Simple guide on how to implement LORAs with huggingface: https://medium.com/@kram254/train-large-language-models-with-lora-and-huggingface-d04be693dd7a

StephanAkkerman · 2024-12-30T16:29:17Z

Could make our own dataset scraping sources like:

StephanAkkerman · 2025-01-04T17:48:41Z

Plan for generating the dataset:

Scrape sources for mnemonics
Ask SOTA model to create the sentence
Save this dataset

StephanAkkerman added Difficulty: Medium 😐 This issue can be solved, but a decent amount of lines need to be changed Improvement 📈 Improvement Priority: Medium 🥈 Assign this label if this issue is used around once a day labels Dec 11, 2024

github-actions bot added the In Progress 🚧 This issue is currently actively being worked on label Dec 17, 2024

StephanAkkerman removed the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024

github-actions bot added the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024

StephanAkkerman removed the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024

github-actions bot added the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024

github-actions bot assigned StephanAkkerman Dec 18, 2024

StephanAkkerman removed the In Progress 🚧 This issue is currently actively being worked on label Dec 18, 2024

StephanAkkerman removed their assignment Dec 18, 2024

StephanAkkerman mentioned this issue Dec 18, 2024

Fix labeller.yaml #146

Closed

StephanAkkerman self-assigned this Dec 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-tune the LLM for mnemonics #123

Fine-tune the LLM for mnemonics #123

StephanAkkerman commented Dec 11, 2024

StephanAkkerman commented Dec 11, 2024

TimKoornstra commented Dec 11, 2024

StephanAkkerman commented Dec 23, 2024

StephanAkkerman commented Dec 26, 2024

StephanAkkerman commented Dec 29, 2024

StephanAkkerman commented Dec 30, 2024

StephanAkkerman commented Jan 4, 2025

Fine-tune the LLM for mnemonics #123

Fine-tune the LLM for mnemonics #123

Comments

StephanAkkerman commented Dec 11, 2024

StephanAkkerman commented Dec 11, 2024

TimKoornstra commented Dec 11, 2024

StephanAkkerman commented Dec 23, 2024

StephanAkkerman commented Dec 26, 2024

StephanAkkerman commented Dec 29, 2024

StephanAkkerman commented Dec 30, 2024

StephanAkkerman commented Jan 4, 2025