Skip to content

Commit

Permalink
Merge pull request #143 from YichengDWu/docs
Browse files Browse the repository at this point in the history
Update docs, fix bugs
  • Loading branch information
YichengDWu authored May 1, 2024
2 parents 4caf3a2 + 390f789 commit 0e5a43b
Show file tree
Hide file tree
Showing 16 changed files with 12,728 additions and 7,161 deletions.
8,036 changes: 8,036 additions & 0 deletions docs/src/assets/TF32F32.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
7,796 changes: 746 additions & 7,050 deletions docs/src/assets/smem_copy_A.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
3,376 changes: 3,376 additions & 0 deletions docs/src/assets/smem_copy_B.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 1 addition & 1 deletion docs/src/manual/async.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ function matmul_kernel(A, sA_layout, copy_A,

cp_async_wait()

@gc_preserve gemm!(mma_C, tCsA, tCsB, tCrC)
@gc_preserve gemm!(mma_C, tCrC, tCsA, tCsB, tCrC)
sync_threads()
end

Expand Down
4 changes: 2 additions & 2 deletions docs/src/manual/matmul.md
Original file line number Diff line number Diff line change
Expand Up @@ -76,7 +76,7 @@ end
```
You can also call [`gemm!`] to perform the same operation:
```julia
gemm!(tCsA, tCsB, tCrC)
gemm!(tCrC, tCsA, tCsB, tCrC)
```

The complete kernel code is as follows:
Expand Down Expand Up @@ -121,7 +121,7 @@ function matmul_kernel(A, sA_layout, tA,
cp_async_wait()
sync_threads()

@gc_preserve gemm!(tCsA, tCsB, tCrC)
@gc_preserve gemm!(tCrC, tCsA, tCsB, tCrC)
sync_threads()
end

Expand Down
4 changes: 2 additions & 2 deletions docs/src/manual/pipeline.md
Original file line number Diff line number Diff line change
Expand Up @@ -67,7 +67,7 @@ function matmul_kernel(A, sA_layout, copy_A,
copyto!(copy_B, tBsB, view(tBgB, :, :, :, k+1))
end

@gc_preserve gemm!(mma_C, tCrA, tCrB, tCrC)
@gc_preserve gemm!(mma_C, tCrC, tCrA, tCrB, tCrC)
end

copyto!(tCgC, tCrC)
Expand Down Expand Up @@ -161,7 +161,7 @@ for the next tile. We prefetch the next tile from global memory to shared memory
smem_read, smem_write = smem_write, smem_read
end

@gc_preserve gemm!(mma_C, tCrA[:, :, k_block], tCrB[:, :, k_block], tCrC)
@gc_preserve gemm!(mma_C, tCrC, tCrA[:, :, k_block], tCrB[:, :, k_block], tCrC)
end
end

Expand Down
Loading

0 comments on commit 0e5a43b

Please sign in to comment.