Skip to content

A framework for generating instructions data. Develop your own domain knowledge instructions set with human seeds data.

License

Notifications You must be signed in to change notification settings

OdiaGenAI/Olive_Farm

Repository files navigation

Olive Farm

Olive Farm is a cutting-edge web application crafted by the innovative minds at OdiaGenAI. It's designed to effortlessly generate LLM (Language Model) instruction sets in Indic languages. Presently, it offers support for Hindi and Odia, with seamless scalability to incorporate additional languages on the horizon.

This versatile tool accommodates inputs from a variety of sources, including:

  • URLs,
  • PDF documents,
  • Plain text.

Additionally, OliveFarm features a collection of pre-existing templates, powered by ChatGPT, to streamline the process of generating instruction sets. Experience the future of Indic language instruction with OliveFarm!

Contributors:

  • AR Kamaldeen (KIIT University, India)
  • SK Shahid (Silicon Institute of Technology, India)
  • Sambit Sekhar (Odia Generative AI, India)
  • Parul Agarwal (Institute of Mathematics, India)
  • Dr. Shantipriya Parida (Silo AI, Finland)

Steps to generate an instruction set.

  1. Select the language.
  2. Select the input content type.
  3. Select the number of questions to generate.
  4. Select the format of instruction.
  5. Provide your OpenAI key and submit.
  6. Based on the input content type, input the Text/URL/PDF and submit.
  7. Click on “Generate Instructions”, which will generate the number of questions selected in Step 3.
  8. Select the questions by clicking on the checkboxes and click on “Generate Answers” which will generate the answers.
  9. Save the instruction set (questions and answers) by clicking on the “Save as jsonl” button which will save the instruction set. and make sure to rename the file to .jsonl while saving
  10. The instruction set will be saved in the system.
  11. After Each generation set click on clear to format the data

Citation:

If you find this repository useful, please consider giving ⭐ and citing:

@misc{Olivefarm,
  author = {AR Kamaldeen and SK Shahid and Sambit Sekhar and Parul Agarwal Shantipriya Parida},
  title = {Olive Farm: WebAPP for Indic Instruction Set Generation},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/shantipriyap/OdiaGenAI}},
}

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

CC BY-NC-SA 4.0

About

A framework for generating instructions data. Develop your own domain knowledge instructions set with human seeds data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published