ShareGPT appending #4

Kquant03 · 2024-06-30T22:37:47Z

How is the ShareGPT format handled with this workflow? I'm currently developing a dataset that could be greatly benefited from this technique. However, I hate training on "User" and "Assistant" tokens. It goes against my intentions when working with language models. With Axolotl, there's a way to change the header IDs for sharegpt datasets. I was wondering if there was something similar I could do here, or perhaps I could just do some data processing to change the format...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ShareGPT appending #4

ShareGPT appending #4

Kquant03 commented Jun 30, 2024

ShareGPT appending #4

ShareGPT appending #4

Comments

Kquant03 commented Jun 30, 2024