SFT for D2L + Pre-Training (rename of the previous SFT) #102

llauraa23 · 2024-01-29T08:23:31Z

Implement SFT and use D2L as a demo case. Rename previous SFT to Pre-training and modify corresponding scripts/notebooks.

execute with "python -m example.rlhf.supervised_finetuning_d2l"

Temporarily use all entries in the dataset as training dataset (i.e., no eval)

…ers into a csv file

…ction, whether to disable evalution configurable

…s/notebooks

… Use trl DataCollatorForCompletionOnlyLM instead of customized one. Debug: cannot use ConstantLengthDataset or packing when using DataCollatorForCompletionOnly

CambioML · 2024-01-30T03:38:06Z

example/autorate/auto-rater.ipynb

qq: what is this .ipynb file for?

CambioML · 2024-01-30T03:38:14Z

example/data_generation/immigration_gen_data.ipynb

qq: what is this .ipynb file for?

This is used to generate synthetic immigration data by rephrasing.

CambioML · 2024-01-30T03:59:42Z

pykoi/rlhf/config.py

@@ -5,6 +5,7 @@

 from accelerate import Accelerator
 from peft import LoraConfig, TaskType
+# TODO: DH:    num_train_epochs=20,


nit: what is this comment code for?

CambioML · 2024-01-30T04:00:11Z

pykoi/rlhf/customize_data_collator.py

@@ -0,0 +1,40 @@
+from typing import Any, Dict, List, Union


qq: do we still need this customized collator per our discussion,

CambioML · 2024-01-30T04:01:28Z

pykoi/rlhf/pre_traning.py

+            seq_length=args.max_seq_length,
+            # chars_per_token=chars_per_token,
+        )
+        return {"train": train_dataset, "eval": eval_dataset}


nit: need a new line. Make sure you setup your linter properly as we discussed.

CambioML · 2024-01-30T04:02:00Z

pykoi/rlhf/supervised_finetuning.py

+            f"    Answer: {example[self._rlhf_config.answer_title]}")
+        return text
+
+    def prepare_d2l_text(self, example):


nit: let's rename this method because it can be used for other things.

CambioML · 2024-01-30T04:03:52Z

Also, please add what you have tested for this PR.

CambioML · 2024-01-30T06:35:59Z

pykoi/rlhf/supervised_finetuning.py

 from pykoi.chat.db.constants import (QA_CSV_HEADER_ANSWER, QA_CSV_HEADER_ID,
                                     QA_CSV_HEADER_QUESTION,
                                     QA_CSV_HEADER_VOTE_STATUS)
 from pykoi.chat.db.qa_database import QuestionAnswerDatabase
 from pykoi.rlhf.config import RLHFConfig
 from pykoi.telemetry.events import SFTStartEvent, SFTStopEvent
 from pykoi.telemetry.telemetry import Telemetry
+from trl import DataCollatorForCompletionOnlyLM
+# from pykoi.rlhf.customize_data_collator import DataCollatorForCompletionOnlyLM


nit: let's remove non-used code.

CambioML · 2024-01-30T06:42:04Z

pykoi/rlhf/supervised_finetuning.py

+        # resize the token embeddings to include the added special tokens
+        self.model.resize_token_embeddings(len(self.tokenizer))
+        data_collator = None
+        if self._rlhf_config.data_collator == "DataCollatorForCompletionOnlyLM":


You should consider to set data_collator to DataCollatorForCompletionOnlyLM class instead of a string for SFT training argument.

Then, here you should check None. Also, it looks like a bug for me that if people use SFT without passing in data_collator. Therefore, you should set proper default value in the config.py

I agree that a class is better than a string in the argument file.
I believe if set to None, the default Datacollator will be used when "None" is passed to trl.SFTTrainer. Since default Datacollator also depends on other parameters such as "pack", setting it to None by default makes more sense than a fixed class.

…ary data loading

…r, we have to initialize the SFTTrainer in another way

llauraa23 and others added 11 commits December 4, 2023 23:34

auto rater, sample data and prompt engineering

eb4878f

Merge branch 'CambioML:main' into main

5c9cb03

merge conflict

2a31400

support supervised fine tuning on d2l.

ec34371

execute with "python -m example.rlhf.supervised_finetuning_d2l"

resolve merge conflicts on gpu96

f0f7be0

support training multiple epochs in sft.

0329dfb

Temporarily use all entries in the dataset as training dataset (i.e., no eval)

implment evaluation of fine-tuned models with pykoi pipeline

62184d9

When evaluating the SFT model, store the questions and generated answ…

c7f8a1c

…ers into a csv file

code cleanup for d2l demo. In SFT, make data collator, formatting fun…

87e5ed3

…ction, whether to disable evalution configurable

rename previous sft as pre-training. Modify corresponding demo script…

60bcb60

…s/notebooks

In sft, make the number of entries in data for training configurable.…

fe1cc98

… Use trl DataCollatorForCompletionOnlyLM instead of customized one. Debug: cannot use ConstantLengthDataset or packing when using DataCollatorForCompletionOnly

llauraa23 requested a review from goldmermaid as a code owner January 29, 2024 08:23

CambioML reviewed Jan 30, 2024

View reviewed changes

llauraa23 added 5 commits February 6, 2024 19:29

remove redundant implementation of autorater

40aa741

initialize data_collator as a class instead of string. Remove unncess…

dbed085

…ary data loading

Debug: when padding==False and not using Constanat Length DataCollato…

1bcc4bb

…r, we have to initialize the SFTTrainer in another way

code clenup. Debug the dataset formatting func in sft

1b21184

Improved evaluation performance after debugging sft

8db6f09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SFT for D2L + Pre-Training (rename of the previous SFT) #102

SFT for D2L + Pre-Training (rename of the previous SFT) #102

llauraa23 commented Jan 29, 2024

CambioML Jan 30, 2024

CambioML Jan 30, 2024

llauraa23 Feb 6, 2024

CambioML Jan 30, 2024

CambioML Jan 30, 2024

CambioML Jan 30, 2024

CambioML Jan 30, 2024

CambioML commented Jan 30, 2024

CambioML Jan 30, 2024

CambioML Jan 30, 2024

llauraa23 Feb 6, 2024 •

edited

Loading

SFT for D2L + Pre-Training (rename of the previous SFT) #102

Are you sure you want to change the base?

SFT for D2L + Pre-Training (rename of the previous SFT) #102

Conversation

llauraa23 commented Jan 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CambioML commented Jan 30, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

llauraa23 Feb 6, 2024 • edited Loading

Choose a reason for hiding this comment

llauraa23 Feb 6, 2024 •

edited

Loading