Generalizes TACO Merge lattice construction #390

rawnhenry · 2021-02-01T08:07:44Z

This PR should now be largely functional on for the TACO CPU build.

This PR attempts to generalize the lattice construction of TACO. It largely implements the concepts discussed in this Master thesis. In short, this PR adds:

A generalized theory for Merge lattice construction - TACO can construct lattices for any iteration space instead of just complements and unions.
Introduces the concept of a fill value in TACO.
Introduces operator properties and allows TACO to infer some basic optimizations from these properties.

Why this is still in draft (TODO in priority order. ETA mid Feb)

~~Fix bug from merge :(~~
~~Fix segfault due to fill value with GPU backend :(~~
~~Fix compiler crash when using indexVars that have been scheduled as the operands to funcs~~
~~Fix segfault when running GPU spmv schedule~~ - Looks like this is the same issue as scheduling_eval.spmvGPU test failure #389, will not fix
The Merge Lattice implementation does not match the theory discussed in the thesis. The new theory is much simpler, requiring none of the complicated bookkeeping currently done in the code.
The code responsible for extracting iteration algebra from Index Expressions currently does not handle nested Funcs.
The code responsible for short circuiting will fail if attempting to break out of multiple loops. Need to add mechanism to detect this and insert goto statements when necessary. This is just a performance bug.

Current Limitations:

Currently, TACO default to fill value of zero whenever one is not specified (this is done in the constructor of the tensorVar). It would be nice to do this for only RHS operands and allow TACO to set the fill value of LHS expressions during computation.

…w propagation of indexVar type to generated code

…ler/taco into array_algebra

…omputing with IndexVars when using split command. Started implementing boilerplate code for iteration space algebra

…e generation.

…r tensorOp nodes.

…so users can create vectors of properties which are actually a vector of pointers. This is a work around to avoid object slicing when storing properties in a vector

…ith lowering generic tensorOp. Changed spec of special definition to take an IR function.

…ttice optimizations to the end of lattice construction. Conditions to apply these optimizations still not quite right so all tests pass for now. Need to also check that no producer regions have a special definition. Lowerer also needs to be altered to handle compute regions more generally instead of just sparse. Lowerer needs to be altered to handle explicit zeros.

…ice in lowerer. Added explicit zero checks in bottom loop of compute

…ing explicit zero checks before bottommost loop. Got Masked BFS optimization working for pull bfs!

…ue inferer that only uses properties.

…sparsifying a dense result using tensor ops

…tice construction introduce in the merge. Seems to always include the dimension iterator in the merge points even when the iterators are sparse

…orVar was merged incorrectly so all formats were dense inside the lattice

…the GPU backend to be completely non-functional

… an expression. Also, prevents generation of unnecessary merge loops being created at some loop levels

… array_algebra

rawnhenry · 2021-02-17T21:55:32Z

The slice multiple ways test fails after the most recent merge from master. If you have a chance, would you mind taking a look @rohany?

It also seems that WindowedIndexVars should also inherit from IndexExpr so that they can be used in computations.

This commit fixes a bug caused by merging together the windowing and array algebra work. In particular, the newly introduced deep equality defined on `Access` types did not include additions for the newly added components used in windowing.

rohany · 2021-02-18T00:25:51Z

The slice multiple ways test fails after the most recent merge from master. If you have a chance, would you mind taking a look @rohany?

I looked into it and opened a PR here rawnhenry#1. I believe that merging on your repository will cause this PR to get updated. I double checked that make test passes on that commit. This wasn't a fun one to track down though.

It also seems that WindowedIndexVars should also inherit from IndexExpr so that they can be used in computations.

The way I have this set up is that anything windowing related folds into an Access and is handled in lowering when looking at Access objects, rather than dealing with WindowedIndexVar's everywhere.

rohany · 2021-02-18T00:28:11Z

Once that PR is merged, we can set up the new branch. However, I don't have write permission here, so I wouldn't be able to make it. @stephenchouca either you can make the branch or add me and I'll make it.

stephenchouca · 2021-02-18T01:13:10Z

I've created a new array_algebra branch based off of the current master. @RawnH, you should be able to modify this PR to merge into that branch.

lower: fix a bug introduced by merging windowing and array algebra

rawnhenry added 25 commits January 13, 2020 02:59

IndexVars can be used in computation. Need to add more tests and allo…

535d777

…w propagation of indexVar type to generated code

Merge branch 'scheduling-language' of https://github.com/tensor-compi…

f9c676d

…ler/taco into array_algebra

Added tests

3c4560f

Moved output of indexVar closer to class definition. Added test for C…

42e7f99

…omputing with IndexVars when using split command. Started implementing boilerplate code for iteration space algebra

Added printer for iteration algebra and refactored some code

3cd7944

Added include to iteration_algebra_printer.h

e459616

Adds basic functionality to MergeLattices and lowerer for general cod…

a65a362

…e generation.

Started adding front end for defining new operators

9788524

Rework user facing API

9144c88

Added TensorOpNode and made tests for new iteration algebra functions

4f82ccb

Reformat some code and fixed some bugs in lowering. Added one test fo…

2316c38

…r tensorOp nodes.

Added some missing files to git

c75b227

Redesign of property class. Properties are now wrapped for functions …

67a4e6b

…so users can create vectors of properties which are actually a vector of pointers. This is a work around to avoid object slicing when storing properties in a vector

Merge and fix conflicts

70f6c65

Bug fixes. Added more tests for new lattice machinery. Fixed issues w…

6371d49

…ith lowering generic tensorOp. Changed spec of special definition to take an IR function.

Added fill value to tensor. Differentiated between case and loop latt…

09b5040

…ice in lowerer. Added explicit zero checks in bottom loop of compute

Moved tensorOp to its own Cpp file. Fixed bug with lowering not emitt…

008c385

…ing explicit zero checks before bottommost loop. Got Masked BFS optimization working for pull bfs!

Allowed code for any reductions. Fixed bugs. Added a trivial fill val…

68d3431

…ue inferer that only uses properties.

Reverted lower test framework packing

dbcbe5d

Fixed bug with double assembly. Added test for boolean semi-ring and …

2a1bd2b

…sparsifying a dense result using tensor ops

Added test for push

d42b1f3

Merge from master to fix OMP issue

2d53d70

Rename some variables

cfde49a

Attempt to merge master into this branch. There is a bug in merge lat…

5b23d14

…tice construction introduce in the merge. Seems to always include the dimension iterator in the merge points even when the iterators are sparse

rawnhenry changed the title ~~Array algebra~~ Generalizes TACO Merge lattice theory Feb 1, 2021

rawnhenry changed the title ~~Generalizes TACO Merge lattice theory~~ Generalizes TACO Merge lattice construction Feb 1, 2021

rawnhenry added 3 commits February 6, 2021 19:35

Rename opImpl and algImpl

4036438

Fixes bug that occurred during merge. One of the constructors of Tens…

607c6c1

…orVar was merged incorrectly so all formats were dense inside the lattice

Uses alloc_mem instead of malloc to allocate fill value. This caused …

e3c13f3

…the GPU backend to be completely non-functional

rawnhenry added 4 commits February 7, 2021 00:35

Fixes lattice construction when scheduled index variables are used in…

05d3c93

… an expression. Also, prevents generation of unnecessary merge loops being created at some loop levels

Merge branch 'master' into array_algebra

19466ea

Merge branch 'master' of https://github.com/tensor-compiler/taco into…

8dfb79e

… array_algebra

Make IndexVars inherit from IndexExpr again

a76408a

Merge pull request #1 from rohany/windowing-array-algebra-fix

b1f7f88

lower: fix a bug introduced by merging windowing and array algebra

weiya711 merged commit 7d84d5d into tensor-compiler:master Feb 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalizes TACO Merge lattice construction #390

Generalizes TACO Merge lattice construction #390

rawnhenry commented Feb 1, 2021 •

edited

Loading

rawnhenry commented Feb 17, 2021

rohany commented Feb 18, 2021

rohany commented Feb 18, 2021

stephenchouca commented Feb 18, 2021 •

edited

Loading

Generalizes TACO Merge lattice construction #390

Generalizes TACO Merge lattice construction #390

Conversation

rawnhenry commented Feb 1, 2021 • edited Loading

rawnhenry commented Feb 17, 2021

rohany commented Feb 18, 2021

rohany commented Feb 18, 2021

stephenchouca commented Feb 18, 2021 • edited Loading

rawnhenry commented Feb 1, 2021 •

edited

Loading

stephenchouca commented Feb 18, 2021 •

edited

Loading