Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). #302

qgao007 · 2024-06-25T22:10:14Z

Type of Change

Change #1: Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests.
Change #2: enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc) to address the current messy issue of "tests not matching device type"

Description

template template class to wrap <gpu_arch, mma_engine> for runtime gpu_arch auto-select based on devices where kernels are executing.

USE_XETLA options for compilation on different devices (xe_lpg, xe_hpg, xe_hpc).

Expected Behavior & Potential Risk

No foreseeable risk related to CMake Compliation / code execution.

How has this PR been tested?

tested on mtl/dg2

Dependency Change?

No Libraries changed.

…e executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). Signed-off-by: Qun Gao <[email protected]>

qgao007 requested a review from DDEle June 25, 2024 22:10

DDEle added the XeTLA label Jul 8, 2024

Enable runtime gpu_arch auto-select based on devices where kernels ar…

12bafdc

…e executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). Signed-off-by: Qun Gao <[email protected]>

qgao007 force-pushed the qg/xetla branch from 6327396 to 12bafdc Compare July 12, 2024 04:11

qgao007 closed this Jul 12, 2024

qgao007 reopened this Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). #302

Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). #302

qgao007 commented Jun 25, 2024 •

edited

Loading

Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). #302

Are you sure you want to change the base?

Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). #302

Conversation

qgao007 commented Jun 25, 2024 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

qgao007 commented Jun 25, 2024 •

edited

Loading