You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Include a dummy_dataset.parquet file to test your code against. This dummy_dataset should include the columns for the data and metadata associated with the dataset, which will then be converted into the final format for language model consumption, along with an example row or rows that you can verify your code correctly collects. In addition to this file, include the unit test that evaluates your code against this dummy_dataset.
Give an example of the columns and data:
col1
col2
....
row1
row1
....
The text was updated successfully, but these errors were encountered:
"Question" threads, in which each comment is a "suggestedAnswer", and also provides the "acceptedAnswer" (potentially useful for some form of QA data?)
Title
Dataset URL - LinusTechTip
Does the dataset exist in a scraped format? No
Description
This well-known programming forum, just scanned there have more than 10.000 topics from 2013
Procedure
Tests
Include a dummy_dataset.parquet file to test your code against. This dummy_dataset should include the columns for the data and metadata associated with the dataset, which will then be converted into the final format for language model consumption, along with an example row or rows that you can verify your code correctly collects. In addition to this file, include the unit test that evaluates your code against this dummy_dataset.
Give an example of the columns and data:
The text was updated successfully, but these errors were encountered: