NVIDIA / cutlass Public

Notifications You must be signed in to change notification settings
Fork 980
Star 5.7k

Code
Issues 173
Pull requests 33
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Issues: NVIDIA/cutlass

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

173 Open 981 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

how does the threadblock_tile_offset read the global memory in gemm_splitk_parallel.h ? ? - Needs Triage question

Question

#1957 opened Nov 21, 2024 by pily1

[QST] make_tiled_copy_B generates incompatible layouts ? - Needs Triage question

Question

#1953 opened Nov 20, 2024 by phantaurus

[BUG] Example 09_turing_tensorop_conv2dfprop does not work ? - Needs Triage bug

Something isn't working

#1952 opened Nov 19, 2024 by IzanCatalan

[QST] Modify how to load Activations and Filters ? - Needs Triage question

Question

#1950 opened Nov 18, 2024 by IzanCatalan

[BUG] Wrong assertion in integer_subbyte.h ? - Needs Triage bug

Something isn't working

#1949 opened Nov 18, 2024 by Algy

[QST]Question about vectorized memory accesses. ? - Needs Triage question

Question

#1946 opened Nov 15, 2024 by leizhao1234

[QST] Can hopper_int4_fp8_gemm support Scale with zero-point mode? ? - Needs Triage question

Question

#1944 opened Nov 15, 2024 by ZZBoom

[QST] Does TMA overlap memory copy from/to global memory address from another GPU return by cudaIpcGetMemHandle? ? - Needs Triage question

Question

#1943 opened Nov 15, 2024 by umiswing

[QST] What does "l" in "mnkl" mean in cutlass? ? - Needs Triage question

Question

#1939 opened Nov 13, 2024 by umiswing

[QST] FP8 with row-wise scaling on Ada-Lovelace ? - Needs Triage question

Question

#1937 opened Nov 11, 2024 by vgoklani

[QST] bfloat16 x int8 GEMM ? - Needs Triage question

Question

#1936 opened Nov 11, 2024 by sycz00

[QST] How to define a new custom kernel ? - Needs Triage question

Question

#1930 opened Nov 8, 2024 by IzanCatalan

[QST] Why tma_load.get_slice(0) here always need 0? ? - Needs Triage question

Question

#1929 opened Nov 8, 2024 by ziyuhuang123

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? ? - Needs Triage question

Question

#1928 opened Nov 7, 2024 by SimpleTheoryOfTypes

[QST] Question Regarding To The Use Of Swizzle ? - Needs Triage question

Question

#1927 opened Nov 7, 2024 by Yanksi

[QST] Why did I get a wrong result from GemmGrouped? ? - Needs Triage question

Question

#1924 opened Nov 7, 2024 by WangNorthSea

[QST] Is there a Cutlass GEMM example to read inputs with custom padding? ? - Needs Triage question

Question

#1922 opened Nov 6, 2024 by ghostplant

[FEA] Better grid size for H100 GPU with SXM5 ? - Needs Triage feature request

New feature or request

#1921 opened Nov 6, 2024 by zhipeng93

[BUG] Cutlass python does not detect GPU ? - Needs Triage bug

Something isn't working

#1919 opened Nov 5, 2024 by IzanCatalan

[QST] Modifyinf a conv2d kernel and using it with python and pytorch ? - Needs Triage question

Question

#1918 opened Nov 5, 2024 by IzanCatalan

[BUG] TMA Cooperative GeMM with Stream-K scheduler hangs ? - Needs Triage bug

Something isn't working

#1917 opened Nov 4, 2024 by NihalPotdar

[QST] Is cutlass::bfloat16_t x cutlass::int2b_t GEMM possible? ? - Needs Triage question

Question

#1915 opened Nov 3, 2024 by areddy2022

[BUG] Unused variable ? - Needs Triage bug

Something isn't working

#1913 opened Oct 31, 2024 by r-barnes

[QST] Inconsistency in Rounding Implementations: Round-to-Nearest for TFloat32 vs. Round-to-Nearest-Even for BFloat16 ? - Needs Triage question

Question

#1908 opened Oct 30, 2024 by shanliang1992

[QST]Synchronizing Threads Between Loading Q/K and V in WASP ? - Needs Triage question

Question

#1900 opened Oct 27, 2024 by ziyuhuang123

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly