Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

📦 Support for packing tokenized datasets for SFT #2011

Merged
merged 3 commits into from
Nov 25, 2024

Commits on Nov 25, 2024

  1. feat: add support for packing tokenized datasetS

    Signed-off-by: Mehant Kammakomati <[email protected]>
    kmehant committed Nov 25, 2024
    Configuration menu
    Copy the full SHA
    c4508dd View commit details
    Browse the repository at this point in the history
  2. fix: address review comments

    Signed-off-by: Mehant Kammakomati <[email protected]>
    kmehant committed Nov 25, 2024
    Configuration menu
    Copy the full SHA
    5167c51 View commit details
    Browse the repository at this point in the history
  3. feat: add tests for pretokenized dataset packing

    Signed-off-by: Mehant Kammakomati <[email protected]>
    kmehant committed Nov 25, 2024
    Configuration menu
    Copy the full SHA
    685009f View commit details
    Browse the repository at this point in the history