Skip to content

junnoslab/lucky-vicky-DPO

 
 

Repository files navigation

LoRA

Repository for PEFT Fine-Tuning with LoRA / QLoRA.

Resources

Models

Datasets

How to start

I used pixi for dependency & project manager. You can install pixi simply with command below

curl -fsSL https://pixi.sh/install.sh | bash

Once pixi is installed, use pixi shell to enable venv from project's directory.

pixi shell

Or, you can simply run training or inference by using pixi's task commands

## Training
# t is short-name for train.
# wandb setup is required.
pixi r t
pixi r train

# td is short-name for train-debug.
# Use this command to disable wandb.
pixi r td
pixi r train-debug

## Inference
# i is short-name for infer.
pixi r i
pixi r infer

Result

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%