LLM Fine-Tuning

Overview

This project is designed for fine-tuning language models using the PyTorch Lightning framework, with a focus on advanced training techniques and optimizations. It is specifically tailored for English and Korean summarization tasks.

Key Components

l_sweep.py: The primary script for setting up and executing the fine-tuning process, including hyperparameter sweeps with Weights & Biases (Wandb).

Dependencies

Dependencies for this project are listed in the requirements.txt file.

Script Overview

The l_sweep.py script performs the following tasks:

Imports Required Libraries: Uses PyTorch Lightning, Ray, Weights & Biases, and Transformers libraries.
Initial Setup: Configures environment variables and initializes Wandb.
Training Function: Defines the l2ray_trainer function to set up the model, tokenizer, dataset, and Trainer for fine-tuning. This includes:
- Loading the model and tokenizer via get_model().
- Preparing the dataset using get_dataset().
- Configuring the Trainer with callbacks for early stopping, learning rate monitoring, model checkpointing, and logging.
- Running model training with Trainer.fit().
Ray Wrapping: Defines the ray_wrapped_trainer function for executing the training on Ray clusters.
Hyperparameter Sweeping: Uses Wandb to perform hyperparameter optimization, with a sweep configuration to explore various hyperparameter combinations.

Dataset

The script is set up to fine-tune models on English and Korean summarization datasets. Ensure that these datasets are properly prepared and accessible at jonathankang/ENKO-MEDIQA

Configuration

Model: Specify the model names from hf_model_list.
Epochs: Define the number of epochs for training.
Learning Rate: Set the learning rate range.
Gradient Accumulation: Adjust the number of gradient accumulation steps.
Gradient Clipping: Define the gradient clipping value.
LoRA Parameters: Configure the LoRA parameters such as rank, alpha, dropout, and initialization weights.

Usage

To execute the script, run:

bash sweep

to stop a run: ps -ef | grep sweep kill -9 [sweepnumber]

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
PEFT_enko.ipynb		PEFT_enko.ipynb
README.md		README.md
sweep		sweep
sweeplog		sweeplog

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Fine-Tuning

Overview

Key Components

Dependencies

Script Overview

Dataset

Configuration

Usage

About

Releases

Packages

Languages

j0ntendo/LLM-trainer-enko

Folders and files

Latest commit

History

Repository files navigation

LLM Fine-Tuning

Overview

Key Components

Dependencies

Script Overview

Dataset

Configuration

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages