SemiEvol

This repository contains the official implementation of the paper: SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation.

TLDR

We have developed SemiEvol, which leverages labeled data to enhance inference on unlabeled data, thereby improving the reasoning capabilities of Large Language Models (LLMs). This process is characterized as a bi-level knowledge propagation and selection mechanism.

BibTex

If our work has been helpful to you, please consider citing it. Your citation serves as encouragement for our research.

@misc{luo2024semievol,
    title={SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation}, 
    author={Junyu Luo and Xiao Luo and Xiusi Chen and Zhiping Xiao and Wei Ju and Ming Zhang},
    year={2024},
    eprint={2410.14745},
    archivePrefix={arXiv},
    primaryClass={cs.CL},
    url={https://arxiv.org/abs/2410.14745}, 
}

You can also find the paper at HuggingFace.

Framework

SemiEvol maximizes the utility of labeled data through a bi-level knowledge propagation-and-selection framework, while leveraging collaborative learning among multiple LLMs to exploit unlabeled data, unleashing the full data potential.

Installation

Install dependencies using the requirements.txt file:

pip install -r requirements.txt

Dataset and Released Model Weights

Dataset: https://huggingface.co/datasets/luojunyu/SemiEvol

We have released model weights for two tasks to facilitate testing:

SemiEvol on MMLU based on Llama-3.1-8B: https://huggingface.co/luojunyu/Llama-3.1-8B-SemiEvol-MMLU
SemiEvol on MMLU-Pro based on Llama-3.1-8B: https://huggingface.co/luojunyu/Llama-3.1-8B-SemiEvol-MMLUPro

Usage

For evaluation, execute:

python eval.py

You can modify relevant parameters, including task and model, in eval.py.

To run the SemiEvol process, use:

python semievol.py

You can adjust parameters such as task, model, and other hyperparameters in semievol.py.

For evaluation of RAG methods, run:

python rag-eval.py

To convert data for fine-tuning, use:

python utils/convert_data.py

Please modify the parameters in the code before execution.

For Supervised Fine-Tuning (SFT) on Llama, we recommend referring to LlamaFactory. In our paper's experiments, we utilized LoRA for LLMs' SFT.
For SFT on GPT*, please refer to https://platform.openai.com/finetune and https://platform.openai.com/docs/guides/fine-tuning.
After SFT, consider using inference frameworks like vLLM to infer your model. Note that you need to register your model in config.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SemiEvol

TLDR

BibTex

Framework

Installation

Dataset and Released Model Weights

Usage

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
figs		figs
utils		utils
.gitignore		.gitignore
README.md		README.md
common.py		common.py
config.py		config.py
eval.py		eval.py
rag-eval.py		rag-eval.py
requirements.txt		requirements.txt
semievol.py		semievol.py

luo-junyu/SemiEvol

Folders and files

Latest commit

History

Repository files navigation

SemiEvol

TLDR

BibTex

Framework

Installation

Dataset and Released Model Weights

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages