Add fine tuning experiment #88

steventkrawczyk · 2023-09-12T01:27:36Z

No description provided.

NivekT

Looks good to me! Thank you for adding this. Just small comments:

Might need to ask users to install datasets from HuggingFace up top
Is file-name on this line openai.FineTuningJob.create(training_file="file-name", model="gpt-3.5-turbo") correct? I guess that may be the default name?
How long does the fine-tuning job take? We can mention that as well so they know what to expect.

I think adding a SQL syntax validator (as an eval function) in the future can be cool.

NivekT · 2023-09-12T18:16:55Z

prompttools/experiment/experiments/_utils.py

+ dfs_to_concat = [df[columns_with_multiple_unique_values], df[unhashable_columns]]
+ if 'prompt' in df:
+ dfs_to_concat.append(df['prompt'])
+ if 'messages' in df:
+ dfs_to_concat.append(df['messages'])


Might need to check if it already exists in dfs_to_concat . Otherwise, you might get duplicate? Unless pd.concat handles duplication

Good catch! I updated the logic to avoid duplicates

steventkrawczyk · 2023-09-12T20:23:36Z

Looks good to me! Thank you for adding this. Just small comments:

Might need to ask users to install datasets from HuggingFace up top

Is file-name on this line openai.FineTuningJob.create(training_file="file-name", model="gpt-3.5-turbo") correct? I guess that may be the default name?

How long does the fine-tuning job take? We can mention that as well so they know what to expect.

I think adding a SQL syntax validator (as an eval function) in the future can be cool.

Added relevant installations to the top section
Updated the name to make it more clear it's a placeholder.
Added some notes, it could be a few hours for tuning.

Totally agree that validating SQL is a good one, added an issue here: #89

steventkrawczyk requested a review from NivekT September 12, 2023 01:27

NivekT approved these changes Sep 12, 2023

View reviewed changes

steventkrawczyk added 3 commits September 12, 2023 13:13

Add fine tuning experiment

ba3ffa4

Finish example

e89845e

Address comments

9497015

steventkrawczyk force-pushed the ft-experiment branch from b05d7a8 to 9497015 Compare September 12, 2023 20:19

steventkrawczyk merged commit 44d997c into main Sep 12, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fine tuning experiment #88

Add fine tuning experiment #88

steventkrawczyk commented Sep 12, 2023

NivekT left a comment

NivekT Sep 12, 2023

steventkrawczyk Sep 12, 2023 •

edited

Loading

steventkrawczyk commented Sep 12, 2023

Add fine tuning experiment #88

Add fine tuning experiment #88

Conversation

steventkrawczyk commented Sep 12, 2023

NivekT left a comment

Choose a reason for hiding this comment

NivekT Sep 12, 2023

Choose a reason for hiding this comment

steventkrawczyk Sep 12, 2023 • edited Loading

Choose a reason for hiding this comment

steventkrawczyk commented Sep 12, 2023

steventkrawczyk Sep 12, 2023 •

edited

Loading