Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. #18619
Azure Pipelines / Linux Android Emulator QNN CI Pipeline
succeeded
Feb 28, 2024 in 10m 24s
Build #20240228.26 succeeded
Loading