Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add bfloat16 support for ConcatTraining and SplitTraining ops #18280

Merged
merged 2 commits into from
Nov 7, 2023

Conversation

prathikr
Copy link
Contributor

@prathikr prathikr commented Nov 4, 2023

Description

Updates input/output type constraints on training operators ConcatTraining and SplitTraining to include bfloat16 which was introduced in IR version 4.

Motivation and Context

Enabling meta-llama/Llama-2-70b to be finetuned with ONNX Runtime training.

@prathikr prathikr changed the title add bfloat16 support add bfloat16 support for ConcatTraining and SplitTraining ops Nov 4, 2023
@prathikr prathikr added the stale issues that have not been addressed in a while; categorized by a bot label Nov 6, 2023
@prathikr prathikr merged commit 83c0275 into main Nov 7, 2023
90 of 91 checks passed
@prathikr prathikr deleted the prathikrao/concat-training-bfloat16-2 branch November 7, 2023 18:10
kleiti pushed a commit to kleiti/onnxruntime that referenced this pull request Mar 22, 2024
…oft#18280)

### Description
<!-- Describe your changes. -->

Updates input/output type constraints on training operators
ConcatTraining and SplitTraining to include bfloat16 which was
introduced in IR version 4.

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

Enabling `meta-llama/Llama-2-70b` to be finetuned with ONNX Runtime
training.

Co-authored-by: Prathik Rao <[email protected]@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
stale issues that have not been addressed in a while; categorized by a bot
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants