Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Train openAI curie model using the data sample of at least a 100 elements #9

Open
3 tasks
kislerdm opened this issue Feb 4, 2023 · 1 comment
Open
3 tasks
Assignees
Labels
ml Machine learning/data science tasks

Comments

@kislerdm
Copy link
Owner

kislerdm commented Feb 4, 2023

Problem

OpenAI provides LLMs which however do not perform well when it comes to text2diagram.

Proposed Solution

Perform fine-tuning of the curie model using the data sample of 100+ elements:

  • Training sample of 100+ data points will be collected as json file with the structure:
[{
     "prompt": "three connected boxes",
     "completion": {"nodes":[{"id":"0"},{"id":"1"},{"id":"2"}],"links":[{"from":"0","to":"1"},{"from":"1","to":"2"},{"from":"2","to":"0"}]}
},
...
]
  • The training will be submitted following the instructions. See the reference for details.
  • The resulting model will have the suffix of the commit sha corresponding to the *.py file with the training data sample.

DoD

  • The model is successfully trained

Acceptance Criteria

  • The model is available for inference
  • The model performance, i.e. training metrics are noted
@kislerdm kislerdm added the ml Machine learning/data science tasks label Feb 4, 2023
@kislerdm kislerdm added this to the MVP milestone Feb 4, 2023
@kislerdm kislerdm moved this to Todo in diagramastext Feb 5, 2023
@kislerdm kislerdm removed the status in diagramastext Feb 6, 2023
@kislerdm kislerdm modified the milestones: MVP, v0.0.3 Feb 14, 2023
@kislerdm kislerdm changed the title Train openAI curie model using the data sample of 10 elements Train openAI curie model using the data sample of at least a 100 elements Feb 20, 2023
@kislerdm
Copy link
Owner Author

@deim @diegohordi @ColeDrain @UtopicUnicorn Hey folks! Your help is crucial here! Please submit your examples to the directory, feel free to use the sample as the structure example.

@kislerdm kislerdm moved this to In Progress in diagramastext Feb 20, 2023
@kislerdm kislerdm removed this from the v0.0.3 milestone Mar 22, 2023
@kislerdm kislerdm moved this from In Progress to Todo in diagramastext Mar 22, 2023
@kislerdm kislerdm removed the status in diagramastext Mar 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ml Machine learning/data science tasks
Projects
Status: No status
Development

When branches are created from issues, their pull requests are automatically linked.

5 participants