support/optimize ASR on HPU #280

Spycsh · 2024-06-12T08:50:12Z

Description

In this PR Whisper ASR inference is enabled on HPU, and at the same time optimized with one warmup with a long enough audio. (<The Whisper short-form case 30s limit). After this warmup, future shorter generation should follow the same subset of cached HPU graph and be fast. The e2e perf gain HPU compared to CPU can be up to roughly 2~3x.

Language is updated to be required as explicitly specified: (e.g. "english", "chinese"). Check the Whisper compatible language list. The default language is "english". We deprecate the auto language detection (where AudioSpeechRecognition.language is None) until #1049 is considered to be handled. It is not a must except for special usage. The default language is "english".

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)

Dependencies

None

Tests

None

Signed-off-by: Spycsh <[email protected]>

for more information, see https://pre-commit.ci

* optimize asr on hpu Signed-off-by: Spycsh <[email protected]>

Spycsh and others added 2 commits June 12, 2024 01:25

optimize asr on hpu

d234c21

Signed-off-by: Spycsh <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

32504bb

for more information, see https://pre-commit.ci

chensuyue assigned lvliang-intel Jun 12, 2024

chensuyue approved these changes Jun 12, 2024

View reviewed changes

hshen14 approved these changes Jun 12, 2024

View reviewed changes

hshen14 merged commit 2a48601 into opea-project:main Jun 12, 2024
8 checks passed

Spycsh added a commit to Spycsh/GenAIExamples that referenced this pull request Jun 19, 2024

support/optimize ASR on HPU (opea-project#280)

449c106

* optimize asr on hpu Signed-off-by: Spycsh <[email protected]>

yogeshmpandey pushed a commit to hteeyeoh/GenAIExamples that referenced this pull request Aug 12, 2024

support/optimize ASR on HPU (opea-project#280)

68e552c

* optimize asr on hpu Signed-off-by: Spycsh <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support/optimize ASR on HPU #280

support/optimize ASR on HPU #280

Spycsh commented Jun 12, 2024 •

edited

Loading

support/optimize ASR on HPU #280

support/optimize ASR on HPU #280

Conversation

Spycsh commented Jun 12, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

Spycsh commented Jun 12, 2024 •

edited

Loading