Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 561 Bytes

readme.md

File metadata and controls

18 lines (13 loc) · 561 Bytes

Awesome-Cute

Cute is new programming model in Nvidia GPU introduced from Cutlass3.0. Cute provides efficient abstraction for GPU programming to help developpers implement high performance CUDA code and more flexibly extend operator, such as epilogue or OP fusion.

This repo aims to implement some basic operators in deep learning by Cute from scratch.

Support Matrix

  • gemm_multstage
  • gemm_streamk
  • gemm_naive_warp_specialization
  • more implementation to update

Build project

git submodule update --init --recursive
bash ./build.sh