Improve speed in combining per-channel data #21563

duanshengliu · 2024-07-30T13:43:28Z

Description

Improve speed in combining per-channel data for using a single np.concatenate instead of multiple np.concatenates within a for loop.

Motivation and Context

Fix the issue #21562

duanshengliu · 2024-07-30T13:51:20Z

@yihonglyu @xadupre @adrianlizarraga @yufenglee PTAL

yufenglee · 2024-07-31T17:31:54Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

azure-pipelines · 2024-07-31T17:32:31Z

Azure Pipelines successfully started running 9 pipeline(s).

yufenglee · 2024-07-31T17:57:55Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

yufenglee · 2024-07-31T17:58:17Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-07-31T17:58:30Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-07-31T17:58:34Z

Azure Pipelines successfully started running 10 pipeline(s).

yufenglee · 2024-07-31T20:21:45Z

/azp run Linux Android Emulator QNN CI Pipeline

azure-pipelines · 2024-07-31T20:21:54Z

Azure Pipelines successfully started running 1 pipeline(s).

yufenglee · 2024-07-31T20:22:41Z

Hi @duanshengliu, you need to sign the license/cla to move forward.

Signed-off-by: duansheng.liu <[email protected]>

duanshengliu · 2024-08-05T15:37:50Z

Hi @duanshengliu, you need to sign the license/cla to move forward.

Hi @yufenglee, I have updated the signature and the license/cla check has passed now. Could you please run CI again? Thanks.

yufenglee · 2024-08-05T16:52:35Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

yufenglee · 2024-08-05T16:52:49Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

azure-pipelines · 2024-08-05T16:53:13Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-08-05T16:53:29Z

Azure Pipelines successfully started running 10 pipeline(s).

yufenglee · 2024-08-05T16:53:47Z

/azp run Linux Android Emulator QNN CI Pipeline

azure-pipelines · 2024-08-05T16:53:56Z

Azure Pipelines successfully started running 1 pipeline(s).

yufenglee · 2024-08-05T16:57:47Z

/azp run Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-08-05T16:57:53Z

No pipelines are associated with this pull request.

yufenglee · 2024-08-05T17:00:12Z

/azp run Windows GPU DML CI Pipeline

azure-pipelines · 2024-08-05T17:00:21Z

No pipelines are associated with this pull request.

yufenglee · 2024-08-05T17:01:07Z

could you please sync to the main branch? Looks like Windows GPU CUDA CI Pipeline is not in your branch.

duanshengliu · 2024-08-05T17:30:38Z

could you please sync to the main branch? Looks like Windows GPU CUDA CI Pipeline is not in your branch.

Done，please try again.

yufenglee · 2024-08-05T17:32:10Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

yufenglee · 2024-08-05T17:32:23Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

azure-pipelines · 2024-08-05T17:32:47Z

Azure Pipelines successfully started running 9 pipeline(s).

yufenglee · 2024-08-05T17:33:00Z

/azp run Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline, Linux Android Emulator QNN CI Pipeline

azure-pipelines · 2024-08-05T17:33:03Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-08-05T17:33:18Z

Azure Pipelines successfully started running 1 pipeline(s).

yufenglee · 2024-08-05T17:34:01Z

/azp run Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline, Linux Android Emulator QNN CI Pipeline

azure-pipelines · 2024-08-05T17:34:10Z

Azure Pipelines successfully started running 1 pipeline(s).

duanshengliu · 2024-08-06T15:47:58Z

Hi @yufenglee, could you please review it again? By the way, please take a look at #21347 (comment)

duanshengliu mentioned this pull request Jul 30, 2024

[Performance] When using per-channel quantization for models with large weight sizes, the process can be extremely slow #21562

Closed

yufenglee previously approved these changes Jul 31, 2024

View reviewed changes

Improve speed in combining per-channel data

00fcfdb

Signed-off-by: duansheng.liu <[email protected]>

duanshengliu force-pushed the improve-combine-per-channel-data-speed branch 2 times, most recently from 00fcfdb to 3b77cf5 Compare August 5, 2024 13:49

duanshengliu dismissed yufenglee’s stale review via 00fcfdb August 5, 2024 17:12

duanshengliu force-pushed the improve-combine-per-channel-data-speed branch 2 times, most recently from ef8c09e to 00fcfdb Compare August 5, 2024 17:19

Merge branch 'main' into improve-combine-per-channel-data-speed

4fbb11a

duanshengliu requested a review from yufenglee August 5, 2024 17:31

yufenglee approved these changes Aug 6, 2024

View reviewed changes

yufenglee merged commit b95aa05 into microsoft:main Aug 6, 2024
78 of 80 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve speed in combining per-channel data #21563

Improve speed in combining per-channel data #21563

duanshengliu commented Jul 30, 2024

duanshengliu commented Jul 30, 2024

yufenglee commented Jul 31, 2024

azure-pipelines bot commented Jul 31, 2024

yufenglee commented Jul 31, 2024

yufenglee commented Jul 31, 2024

azure-pipelines bot commented Jul 31, 2024

azure-pipelines bot commented Jul 31, 2024

yufenglee commented Jul 31, 2024

azure-pipelines bot commented Jul 31, 2024

yufenglee commented Jul 31, 2024

duanshengliu commented Aug 5, 2024

yufenglee commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

duanshengliu commented Aug 5, 2024

yufenglee commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

duanshengliu commented Aug 6, 2024

Improve speed in combining per-channel data #21563

Improve speed in combining per-channel data #21563

Conversation

duanshengliu commented Jul 30, 2024

Description

Motivation and Context

duanshengliu commented Jul 30, 2024

yufenglee commented Jul 31, 2024

azure-pipelines bot commented Jul 31, 2024

yufenglee commented Jul 31, 2024

yufenglee commented Jul 31, 2024

azure-pipelines bot commented Jul 31, 2024

azure-pipelines bot commented Jul 31, 2024

yufenglee commented Jul 31, 2024

azure-pipelines bot commented Jul 31, 2024

yufenglee commented Jul 31, 2024

duanshengliu commented Aug 5, 2024

yufenglee commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

duanshengliu commented Aug 5, 2024

yufenglee commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

yufenglee commented Aug 5, 2024

azure-pipelines bot commented Aug 5, 2024

duanshengliu commented Aug 6, 2024