Dense shared memory workspaces #302

amaleewilson · 2020-06-22T06:28:12Z

This PR adds support for dense shared memory workspaces on GPUs. There's still a little bit of work to be done for the case where multiple precomputed temporaries are used in one kernel, though none of the tests explicitly check this case. Would be nice to get feedback on the choice to add the GPUWorkspace enums, especially since I'm attaching them to Var (similar to is_ptr and is_ tensor, but the Var is assigned a GPUWorkspace enum). Another important question I have is whether the precomputed temporary (at least, in the GPU case) needs the loop that initializes it to zero.

Also, this PR includes some debugging functions that I set up for myself but thought might be useful to others. After generating a kernel / taco temporary file, I wanted to be able to edit the just-generated file and then compile and run everything it again with my handwritten changes.

…ry-workspaces

amaleewilson · 2020-06-26T00:03:55Z

Wanted to add Changwan as a reviewer, but I don't know his GitHub username.

hochawa · 2020-10-05T21:00:19Z

Wanted to add Changwan as a reviewer, but I don't know his GitHub username.

Oh, I saw it just now. My Github username is "hochawa". Could you add me?

amaleewilson · 2020-10-05T23:04:24Z

Wanted to add Changwan as a reviewer, but I don't know his GitHub username.

Oh, I saw it just now. My Github username is "hochawa". Could you add me?

For whatever reason, it's not letting me add you as a reviewer, weird. I think you can review it anyway, though.

This commit adds a function `debugCompileSource` to a `Tensor` that allows for the `Tensor` to use a kernel from a provided source file instead of generating a new one. This allows developers to add prints/assertions to TACO generated code to debug faster. Inspired by Amalee's PR (tensor-compiler#302), I would have found a command like this very useful for debugging generated code.

amaleewilson added 6 commits June 14, 2020 23:26

Added shared memory workspaces for GPUs. Functional but needs cleanup.

c240b87

Merge remote-tracking branch 'upstream/master' into dense-shared-memo…

27e0b0e

…ry-workspaces

First pass to add support for dense shared memory workspaces for GPUs

83169e1

Remove redundant GPU workspace specification

fc9c191

More cleaned up first pass for dense shared memory GPU workspaces

bbff9d7

Merge remote-tracking branch 'upstream/master' into dense-shared-memo…

1f7bb50

…ry-workspaces

amaleewilson requested review from fredrikbk and stephenchouca June 26, 2020 00:02

rohany mentioned this pull request Jan 22, 2021

tensor,module: add a utility to allow for debugging generated kernels #378

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dense shared memory workspaces #302

Dense shared memory workspaces #302

amaleewilson commented Jun 22, 2020 •

edited

Loading

amaleewilson commented Jun 26, 2020

hochawa commented Oct 5, 2020

amaleewilson commented Oct 5, 2020 •

edited

Loading

Dense shared memory workspaces #302

Are you sure you want to change the base?

Dense shared memory workspaces #302

Conversation

amaleewilson commented Jun 22, 2020 • edited Loading

amaleewilson commented Jun 26, 2020

hochawa commented Oct 5, 2020

amaleewilson commented Oct 5, 2020 • edited Loading

amaleewilson commented Jun 22, 2020 •

edited

Loading

amaleewilson commented Oct 5, 2020 •

edited

Loading