Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). #302

Open
wants to merge 1 commit into
base: xetla
Choose a base branch
from

Conversation

qgao007
Copy link

@qgao007 qgao007 commented Jun 25, 2024

Type of Change

Change #1: Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests.
Change #2: enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc) to address the current messy issue of "tests not matching device type"

Description

template template class to wrap <gpu_arch, mma_engine> for runtime gpu_arch auto-select based on devices where kernels are executing.

USE_XETLA options for compilation on different devices (xe_lpg, xe_hpg, xe_hpc).

Expected Behavior & Potential Risk

No foreseeable risk related to CMake Compliation / code execution.

How has this PR been tested?

tested on mtl/dg2

Dependency Change?

No Libraries changed.

@qgao007 qgao007 requested a review from DDEle June 25, 2024 22:10
@DDEle DDEle added the XeTLA label Jul 8, 2024
…e executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc).

Signed-off-by: Qun Gao <[email protected]>
@qgao007 qgao007 changed the title Enable device-specific compilation feature MACRO using Option DEVICE with target_compile_definitions (GPU_ARCH, MMA_ENGINE) Enable runtime gpu_arch auto-select based on devices where kernels are executing for gemm_int4 tests; enable device-specific compilation using USE_XETLA (xe_lpg, xe_hpg, xe_hpc). Jul 12, 2024
@qgao007 qgao007 closed this Jul 12, 2024
@qgao007 qgao007 reopened this Jul 12, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants