Is it possible to run llama 3-70B and/or mixtral 8x22b through this process? #1

RandomInternetPreson · 2024-06-26T19:48:48Z

I'm running the Llama-3-Instruct-8B-SPPO-Iter3 model locally and am very impressed by the improved quality from the original model. I can't help but wonder what the results would be if this finetuning process were run on larger models.

Is it possible to run the code on these larger models, or are the smaller versions too different form their larger counterparts; requiring a rework of the training scripts?

Thank you for what you have contributed, this is great stuff!

angelahzyuan · 2024-06-30T04:16:56Z

Thank you! We've trained a slightly larger model (UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3) which achieved an LC-win rate of 53.27, using the same parameters and scripts.

As long as your GPU has sufficient VRAM, the training script should perform well. We will keep you updated as we proceed to training larger models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to run llama 3-70B and/or mixtral 8x22b through this process? #1

Is it possible to run llama 3-70B and/or mixtral 8x22b through this process? #1

RandomInternetPreson commented Jun 26, 2024

angelahzyuan commented Jun 30, 2024

Is it possible to run llama 3-70B and/or mixtral 8x22b through this process? #1

Is it possible to run llama 3-70B and/or mixtral 8x22b through this process? #1

Comments

RandomInternetPreson commented Jun 26, 2024

angelahzyuan commented Jun 30, 2024