Add support of CUDA builtins #1092

kchristin22 · 2024-09-12T10:13:10Z

No description provided.

github-actions · 2024-09-12T10:19:54Z

clang-tidy review says "All clean, LGTM! 👍"

codecov · 2024-09-12T10:20:30Z

Codecov Report

Attention: Patch coverage is 0% with 2 lines in your changes missing coverage. Please review.

Project coverage is 94.24%. Comparing base (e2b8e35) to head (6cd24b3).
Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
lib/Differentiator/ReverseModeVisitor.cpp	0.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1092      +/-   ##
==========================================
- Coverage   94.26%   94.24%   -0.03%     
==========================================
  Files          55       55              
  Lines        8445     8447       +2     
==========================================
  Hits         7961     7961              
- Misses        484      486       +2

Files with missing lines	Coverage Δ
include/clad/Differentiator/ReverseModeVisitor.h	`98.18% <ø> (ø)`
lib/Differentiator/ReverseModeVisitor.cpp	`97.77% <0.00%> (-0.08%)`	⬇️

Files with missing lines	Coverage Δ
include/clad/Differentiator/ReverseModeVisitor.h	`98.18% <ø> (ø)`
lib/Differentiator/ReverseModeVisitor.cpp	`97.77% <0.00%> (-0.08%)`	⬇️

parth-07

Looks great!

test/CUDA/GradientKernels.cu

github-actions · 2024-09-12T16:15:09Z

clang-tidy review says "All clean, LGTM! 👍"

parth-07 · 2024-09-13T06:40:35Z

test/CUDA/GradientKernels.cu

+  cudaMalloc(&d_in, 5 * sizeof(int));
+
+  auto add = clad::gradient(add_kernel, "in, out");
+  add.execute_kernel(dim3(1), dim3(5, 1, 1), dummy_out, dummy_in, d_out, d_in);


Shouldn't dummy_out and dummy_in pointers be of size 5 ints? Currently, they are of size 1 int.

parth-07 · 2024-09-13T06:43:26Z

test/CUDA/GradientKernels.cu

+  cudaMemcpy(d_out, out, 5 * sizeof(int), cudaMemcpyHostToDevice);
+
+  int *d_in;
+  cudaMalloc(&d_in, 5 * sizeof(int));


Please initialize d_in values to 0 to avoid any undefined behavior.

github-actions · 2024-09-13T10:55:05Z

clang-tidy review says "All clean, LGTM! 👍"

parth-07

Looks good to me.

parth-07

Can you please squash all the commits into one?

kchristin22 · 2024-09-14T10:59:50Z

Can you please squash all the commits into one?

This will be done automatically when merging this PR

parth-07 · 2024-09-14T11:11:40Z

This will be done automatically when merging this PR

The primary goal of squashing commits manually in this case is to properly write and structure the commit message. For example, the commit message should describe which CUDA builtins are enabled by this pull-request.

parth-07 · 2024-09-14T13:35:48Z

test/CUDA/GradientKernels.cu

+  out[threadIdx.x] += in[threadIdx.x];
+}
+
+// CHECK:    void add_kernel_2_grad(int *out, int *in, int *_d_out, int *_d_in) {


Can you investigate why we don't have __attribute__((device)) here?

Clang is printing __attribute__((device)) if we replace clang::CUDADeviceAttr::CreateImplicit call in m_Derivative->addAttr(clang::CUDADeviceAttr::CreateImplicit(m_Context)); with m_Derivative->addAttr(clang::CUDADeviceAttr::Create(m_Context));

Yes, I've seen that. The attribute is "hidden" only for the compiler to see when using implicit. I can change that to the explicit creation (Create instead of CreateImplicit) for clarity purposes if necessary in another PR.

My concern is that it may confuse the user into thinking that it is not a kernel actually being executed (as we don't print the overload)

Please create an issue for using Create instead of CreateImplicit.

My concern is that it may confuse the user into thinking that it is not a kernel actually being executed (as we don't print the overload)

I think it is more misleading to not show any attribute at all. Having the attribute present is also necessary for customers wanting to independently use Clad generated-derivatives.

Having the attribute present is also necessary for customers wanting to independently use Clad generated-derivatives.

But the attribute is not the correct one if the users want to execute the function themselves, as they would expect to have the multi-threaded execution offered in a global kernel. The attribute should be __global__ instead of __device__. A work-around would be to prepend the string "__global__ " before the printing or dumping to source file if we want this to be intuitive.

github-actions · 2024-09-14T13:57:48Z

clang-tidy review says "All clean, LGTM! 👍"

github-actions · 2024-09-15T11:34:03Z

clang-tidy review says "All clean, LGTM! 👍"

parth-07 · 2024-09-15T18:00:02Z

The commit message says:

Added support of CUDA grid configuration builtin variables. Builtins tested: threadIdx, blockIdx, blockDim, gridDim, warpSize

The pull-request does not seem to have tests for gridDim and warpSize. Can you either remove them from the commit message or add tests for them?

github-actions · 2024-09-21T11:32:17Z

clang-tidy review says "All clean, LGTM! 👍"

Added support of CUDA grid configuration builtin variables. Builtins tested: threadIdx, blockIdx, blockDim, gridDim, warpSize

github-actions · 2024-09-24T13:45:31Z

clang-tidy review says "All clean, LGTM! 👍"

kchristin22 requested review from parth-07 and vgvassilev September 12, 2024 10:13

kchristin22 self-assigned this Sep 12, 2024

parth-07 reviewed Sep 12, 2024

View reviewed changes

test/CUDA/GradientKernels.cu Outdated Show resolved Hide resolved

parth-07 requested changes Sep 13, 2024

View reviewed changes

parth-07 approved these changes Sep 14, 2024

View reviewed changes

parth-07 reviewed Sep 14, 2024

View reviewed changes

vgvassilev force-pushed the cuda-builtins branch from 2824439 to 9a6924e Compare September 14, 2024 13:46

kchristin22 force-pushed the cuda-builtins branch from 9a6924e to aaf593c Compare September 15, 2024 11:27

kchristin22 mentioned this pull request Sep 21, 2024

Fix appendage of nullptrs to args of a CUDA kernel #1102

Merged

kchristin22 added 2 commits September 24, 2024 13:32

Add support of CUDA builtins and tests

4c7977d

Added support of CUDA grid configuration builtin variables. Builtins tested: threadIdx, blockIdx, blockDim, gridDim, warpSize

Add warpSize and gridDim tests

6cd24b3

vgvassilev force-pushed the cuda-builtins branch from 15881d9 to 6cd24b3 Compare September 24, 2024 13:32

vgvassilev merged commit 2e5560e into vgvassilev:master Sep 24, 2024
88 of 90 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support of CUDA builtins #1092

Add support of CUDA builtins #1092

kchristin22 commented Sep 12, 2024

github-actions bot commented Sep 12, 2024

codecov bot commented Sep 12, 2024 •

edited

Loading

parth-07 left a comment

github-actions bot commented Sep 12, 2024

parth-07 Sep 13, 2024

parth-07 Sep 13, 2024

github-actions bot commented Sep 13, 2024

parth-07 left a comment

parth-07 left a comment

kchristin22 commented Sep 14, 2024

parth-07 commented Sep 14, 2024 •

edited

Loading

parth-07 Sep 14, 2024

kchristin22 Sep 14, 2024

kchristin22 Sep 14, 2024

parth-07 Sep 14, 2024

kchristin22 Sep 15, 2024

github-actions bot commented Sep 14, 2024

github-actions bot commented Sep 15, 2024

parth-07 commented Sep 15, 2024

github-actions bot commented Sep 21, 2024

github-actions bot commented Sep 24, 2024

Add support of CUDA builtins #1092

Add support of CUDA builtins #1092

Conversation

kchristin22 commented Sep 12, 2024

github-actions bot commented Sep 12, 2024

codecov bot commented Sep 12, 2024 • edited Loading

Codecov Report

parth-07 left a comment

Choose a reason for hiding this comment

github-actions bot commented Sep 12, 2024

parth-07 Sep 13, 2024

Choose a reason for hiding this comment

parth-07 Sep 13, 2024

Choose a reason for hiding this comment

github-actions bot commented Sep 13, 2024

parth-07 left a comment

Choose a reason for hiding this comment

parth-07 left a comment

Choose a reason for hiding this comment

kchristin22 commented Sep 14, 2024

parth-07 commented Sep 14, 2024 • edited Loading

parth-07 Sep 14, 2024

Choose a reason for hiding this comment

kchristin22 Sep 14, 2024

Choose a reason for hiding this comment

kchristin22 Sep 14, 2024

Choose a reason for hiding this comment

parth-07 Sep 14, 2024

Choose a reason for hiding this comment

kchristin22 Sep 15, 2024

Choose a reason for hiding this comment

github-actions bot commented Sep 14, 2024

github-actions bot commented Sep 15, 2024

parth-07 commented Sep 15, 2024

github-actions bot commented Sep 21, 2024

github-actions bot commented Sep 24, 2024

codecov bot commented Sep 12, 2024 •

edited

Loading

parth-07 commented Sep 14, 2024 •

edited

Loading