Enabled Dynamo exporter #21713

kobby-kobbs · 2024-08-13T00:43:25Z

Description

This PR modifies the run_dynamo_export function to ensure it mirrors the behavior of run_torchscript_merged_export rather than run_torchscript_separate_export. Additionally, I made adjustments to the main function to ensure that run_dynamo is correctly invoked.

Motivation and Context

The main motivation for this change is to enable successful export of LLaMA-2 and LLaMA-3 models using the Dynamo exporter to ONNX. Previously, the exporter was saving two copies of the weights, which is inefficient. The modified approach ensures that only one copy of the weights is saved, and the model can support both scenarios. These changes enhance the compatibility of the exporter with LLaMA models and subsequently other models and optimize the export process

onnxruntime/python/tools/transformers/models/llama/llama_parity.py

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py

onnxruntime/python/tools/transformers/models/llama/llama_inputs.py

onnxruntime/python/tools/transformers/models/llama/llama_parity.py

onnxruntime/python/tools/transformers/models/llama/llama_torch.py

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py

onnxruntime/python/tools/transformers/models/llama/llama_inputs.py

onnxruntime/python/tools/transformers/models/llama/llama_parity.py

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py

onnxruntime/python/tools/transformers/models/llama/llama_parity.py

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py

kobby-kobbs · 2024-08-16T00:56:56Z

/azp run Big Models, Linux Android Emulator QNN CI Pipeline, Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, Linux QNN CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline

azure-pipelines · 2024-08-16T00:57:03Z

Commenter does not have sufficient privileges for PR 21713 in repo microsoft/onnxruntime

kunal-vaishnavi · 2024-08-16T00:57:32Z

/azp run Big Models, Linux Android Emulator QNN CI Pipeline, Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, Linux QNN CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline

azure-pipelines · 2024-08-16T00:58:13Z

Azure Pipelines successfully started running 10 pipeline(s).

kunal-vaishnavi · 2024-08-16T00:58:35Z

/azp run Windows ARM64 QNN CI Pipeline, Windows CPU CI Pipeline, Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2024-08-16T00:59:01Z

Azure Pipelines successfully started running 7 pipeline(s).

kunal-vaishnavi · 2024-08-16T00:59:26Z

/azp run onnxruntime-binary-size-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2024-08-16T00:59:45Z

Azure Pipelines successfully started running 4 pipeline(s).

kobby-kobbs added 3 commits July 1, 2024 18:09

dynamo export success

65423a2

changes to dynamo

16c94c8

Added changes to enable dynamo exporter

6af5245

kobby-kobbs changed the title ~~Enablind Dynamo exporter~~ Enabled Dynamo exporter Aug 13, 2024

github-advanced-security bot found potential problems Aug 13, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Fixed Show fixed Hide fixed

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Fixed Show fixed Hide fixed

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Aug 13, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Fixed Show fixed Hide fixed

kunal-vaishnavi reviewed Aug 13, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Aug 13, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py Show resolved Hide resolved

kunal-vaishnavi reviewed Aug 13, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_inputs.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Aug 13, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Aug 13, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_torch.py Outdated Show resolved Hide resolved

resolved changes

0f0ef37

kobby-kobbs force-pushed the emmanuelDynamo branch from 033f6ee to 0f0ef37 Compare August 14, 2024 22:08

resolved changes

d10f079

kobby-kobbs requested a review from kunal-vaishnavi August 14, 2024 22:19

resolved changes

b703e0b

kunal-vaishnavi reviewed Aug 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Aug 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py Show resolved Hide resolved

kunal-vaishnavi reviewed Aug 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_inputs.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Aug 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Outdated Show resolved Hide resolved

github-advanced-security bot found potential problems Aug 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Aug 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/convert_to_onnx.py Fixed Show fixed Hide fixed

adding extra changes

174888c

kobby-kobbs force-pushed the emmanuelDynamo branch from ef421df to 174888c Compare August 15, 2024 00:53

kobby-kobbs requested a review from kunal-vaishnavi August 15, 2024 00:53

github-advanced-security bot found potential problems Aug 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Fixed Show fixed Hide fixed

onnxruntime/python/tools/transformers/models/llama/llama_parity.py Fixed Show fixed Hide fixed