LLMs LORA Finetuning

Research a finetuning of LLMs with LORA.

LLaMA 7B and 13B, 8 and 4 bit finetuned with LORA and pytorch lit.
GPT-J 7B 8 bit finetuning.
Saving and loading finutened models.
Evaluating few-shot method.
Using finetuned models in new task.
Comparing models perfomance with ChatGPT and other OpenAI models

The purpose of this repo to make little research of GPT like models and approaches to finetune quantized LLMs. Сlassification task was chosen as a test task. I compared accuracy of different setups. Also, I compared final metrics with metrics of OpenAI GPT models. This code can be reused to finutene GPT like models.

WanDB Report

System requirements

At least 11 GB of VRAM
Linux (required for bitsandbytes package)

This code was tested on WSL Ubuntu 22.04, Geforce GTX 1080 TI, Cuda toolkit 11.7

Usage

To reproduce results locally:

Prepare environment

sudo apt update && sudo apt install git build-essential -y
conda install cuda=11.7.1 -c nvidia -y
conda install pytorch torchvision torchaudio pytorch-cuda=11.7 -c pytorch -c nvidia -y
conda install -c conda-forge cudatoolkit=11.7 ninja accelerate sentencepiece -y

Clone repo

git clone https://github.com/vetka925/llms-lora-8bit-4bit-finetuning-lit

Install requirements**

cd gpt-j-8bit-lightning-finetune
pip install -r requirements.txt

Install CUDA extension for 4bit operations

python setup_cuda.py install

Run Jupyter notebook finetune.ipynb

jupyter notebook

**For possible issues with bitsandbytes on WSL use this

Description

Full research description on Medium, Habr

Finetuning: finetune.ipynb
Finetuning OpenAI model: compare_openai.ipynb
Load finetuned models and validate new data: inference_finetuned.ipynb
Fewshot example: fewshot.ipynb

Test task is Hate Speech and Offensive Language Detection.
Data: 1000 train and 200 validation samples with balanced classes from Hate Speech and Offensive Language Dataset

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
peft		peft
.gitignore		.gitignore
README.md		README.md
autograd_4bit.py		autograd_4bit.py
compare_openai.ipynb		compare_openai.ipynb
custom_datasets.py		custom_datasets.py
fewshot.ipynb		fewshot.ipynb
finetune.ipynb		finetune.ipynb
finetuners.py		finetuners.py
gradient_checkpointing.py		gradient_checkpointing.py
inference_finetuned.ipynb		inference_finetuned.ipynb
modelutils.py		modelutils.py
quant.py		quant.py
quant_cuda.cpp		quant_cuda.cpp
quant_cuda_kernel.cu		quant_cuda_kernel.cu
requirements.txt		requirements.txt
setup_cuda.py		setup_cuda.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMs LORA Finetuning

System requirements

Usage

Description

About

Releases

Packages

Languages

vetka925/llms-lora-8bit-4bit-finetuning-lit

Folders and files

Latest commit

History

Repository files navigation

LLMs LORA Finetuning

System requirements

Usage

Description

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages