Add qlora to our current codebase #22

samsja · 2023-05-30T07:06:59Z

Context

we want to add qlora (lora + 4 bits int quant) to our codebase.

The goal is to reduce memory usage and the cost of finetuning without degrading quality.

activate 4bits in Peft. Should be as easy as turning on a flag
run the modal on one GPU for one epoch and look at memory consumption compared to 8 bits training
run a full training (3 epochs) with the same parameters as 8bits training and compare result to see if we don't degrade quality

alaeddine-13 · 2023-05-30T09:07:22Z

you can get inspired by this PR: tloen/alpaca-lora#487

samsja assigned sebastian-weisshaar May 30, 2023

sebastian-weisshaar closed this as completed Jun 8, 2023