Restructure differentiation schedule into a breadth first traversal #848

vaithak · 2024-04-01T11:54:49Z

No description provided.

codecov · 2024-04-03T00:56:10Z

Codecov Report

Attention: Patch coverage is 96.74419% with 7 lines in your changes are missing coverage. Please review.

Project coverage is 94.77%. Comparing base (7b2b713) to head (72fc487).
Report is 7 commits behind head on master.

❗ Current head 72fc487 differs from pull request most recent head 5c896e8. Consider uploading reports for the commit 5c896e8 to get more accurate results

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #848   +/-   ##
=======================================
  Coverage   94.77%   94.77%           
=======================================
  Files          49       49           
  Lines        7499     7563   +64     
=======================================
+ Hits         7107     7168   +61     
- Misses        392      395    +3

Files	Coverage Δ
include/clad/Differentiator/DerivativeBuilder.h	`100.00% <ø> (ø)`
include/clad/Differentiator/DiffPlanner.h	`100.00% <100.00%> (ø)`
lib/Differentiator/DiffPlanner.cpp	`98.63% <100.00%> (+0.04%)`	⬆️
lib/Differentiator/HessianModeVisitor.cpp	`99.47% <100.00%> (+0.01%)`	⬆️
lib/Differentiator/ReverseModeForwPassVisitor.cpp	`100.00% <100.00%> (ø)`
tools/ClangPlugin.h	`94.07% <100.00%> (+0.18%)`	⬆️
tools/DerivedFnInfo.cpp	`100.00% <100.00%> (ø)`
tools/DerivedFnInfo.h	`100.00% <ø> (ø)`
lib/Differentiator/BaseForwardModeVisitor.cpp	`98.98% <98.86%> (-0.09%)`	⬇️
tools/ClangPlugin.cpp	`95.95% <66.66%> (-0.57%)`	⬇️
... and 1 more

Files	Coverage Δ
include/clad/Differentiator/DerivativeBuilder.h	`100.00% <ø> (ø)`
include/clad/Differentiator/DiffPlanner.h	`100.00% <100.00%> (ø)`
lib/Differentiator/DiffPlanner.cpp	`98.63% <100.00%> (+0.04%)`	⬆️
lib/Differentiator/HessianModeVisitor.cpp	`99.47% <100.00%> (+0.01%)`	⬆️
lib/Differentiator/ReverseModeForwPassVisitor.cpp	`100.00% <100.00%> (ø)`
tools/ClangPlugin.h	`94.07% <100.00%> (+0.18%)`	⬆️
tools/DerivedFnInfo.cpp	`100.00% <100.00%> (ø)`
tools/DerivedFnInfo.h	`100.00% <ø> (ø)`
lib/Differentiator/BaseForwardModeVisitor.cpp	`98.98% <98.86%> (-0.09%)`	⬇️
tools/ClangPlugin.cpp	`95.95% <66.66%> (-0.57%)`	⬇️
... and 1 more

github-actions

clang-tidy made some suggestions

lib/Differentiator/BaseForwardModeVisitor.cpp

lib/Differentiator/HessianModeVisitor.cpp

tools/ClangPlugin.h

github-actions

clang-tidy made some suggestions

lib/Differentiator/DiffPlanner.cpp

github-actions

clang-tidy made some suggestions

github-actions · 2024-04-03T18:24:01Z

tools/ClangPlugin.h

@@ -288,6 +292,11 @@ namespace clad {
      return P.ProcessDiffRequest(request);
    }

+    void AddRequestToSchedule(CladPlugin& P,


warning: function 'AddRequestToSchedule' defined in a header file; function definitions in header files can lead to ODR violations [misc-definitions-in-headers]

void AddRequestToSchedule(CladPlugin& P, ^

Additional context

tools/ClangPlugin.h:294: make as 'inline'

void AddRequestToSchedule(CladPlugin& P, ^

vgvassilev

Thank you! This is a major improvement. We have discussed that now we can process diff schedules separately from the differentiation process. That is, we first plan and then act. This PR does not get us there just yet, and this is why it relies on a forward declarations because it only appends to the diff plan instead of inserting.

That's an incremental change but we should also try to make the next step maybe in a separate PR.

demos/Arrays.cpp

include/clad/Differentiator/DerivativeBuilder.h

lib/Differentiator/DiffPlanner.cpp

vgvassilev · 2024-04-06T19:01:20Z

lib/Differentiator/ReverseModeVisitor.cpp

+            if (arg->getName() == "p")
+              m_Variables[arg] = m_Result;
+            idx += 1;
+            continue;


Can we come up with a test here?

This is for Jacobian computation and the particular case is for something related to ROOT as captured here: #478 (comment).
I will look into this when I work on the Jacobain array-related issue: #472

github-actions

clang-tidy made some suggestions

github-actions · 2024-04-10T22:56:30Z

lib/Differentiator/VisitorBase.cpp

    CXXScopeSpec SS;
    bool isArrow = Base->getType()->isPointerType();
    auto ME = m_Sema
-                  .ActOnMemberAccessExpr(getCurrentScope(), Base, noLoc,
+                  .ActOnMemberAccessExpr(getCurrentScope(), Base, Loc,


warning: 3rd argument 'Loc' (passed to 'OpLoc') looks like it might be swapped with the 6th, 'noLoc' (passed to 'TemplateKWLoc') [readability-suspicious-call-argument]

.ActOnMemberAccessExpr(getCurrentScope(), Base, Loc, ^

Additional context

llvm/include/clang/Sema/Sema.h:5307: in the call to 'ActOnMemberAccessExpr', declared here

ExprResult ActOnMemberAccessExpr(Scope *S, Expr *Base, ^

This one seems irrelevant, it is using some heuristic to deduce that noLoc is more similar to OpLoc, maybe because of the Levenshtein distance as mentioned here: https://releases.llvm.org/13.0.0/tools/clang/tools/extra/docs/clang-tidy/checks/readability-suspicious-call-argument.html#levenshtein-distance-as-levenshtein.

lib/Differentiator/VisitorBase.cpp

parth-07

Looks really good.

vgvassilev

Looks like the clang-tidy reports are relevant. Can you address them?

.github/workflows/ci.yml

vgvassilev

LGTM! Do you want the merge the 6 commits the way they are or you want to squash or something else?

vaithak · 2024-04-12T14:34:59Z

I think the current order of commits is fine.

Plan for dynamic graph - The relations between different differentiation requests can be modelled as a graph. For example, if `f_a` calls `f_b`, there will be two differentiation requests `df_a` and `df_b`, the edge between them can be understood as `created_because_of`. This also means that the functions called by the users to be explicitly differentiated (or `DiffRequests` created because of these) are the source nodes, i.e. no incoming edges. In most cases, this graph aligns with the call graph, but in some cases, the graph depends on the internal implementation, like the Hessian computation, which requires creating multiple `fwd_mode` requests followed by a `rev_mode` request. - We can use this graph to order the computation of differentiation requests. This was already being done implicitly in the initial recursive implementation. Whenever we encountered a call expression, we started differentiation of the called function; this was sort of like a depth-first search strategy. - This had problems, as `Clang` reported errors when it encountered a new function scope (of the derivative of the called function) in the middle of the old function scope (of the derivative of the callee function). It treated the nested one like a lambda expression. The issue regarding this: vgvassilev#745. - To fix this, an initial strategy was to eliminate the recursive approach. Hence, a queue-based breadth-first approach was implemented in this PR: vgvassilev#848. - Although it fixed the problem, the graph traversal was still implicit. We needed some way to compute/store the complete graph and possibly optimize it, such as converting edges to model the `requires_derivative_of` relation. Using this, we could proceed with differentiation in a topologically sorted ordering. - It also required one caveat: although we don't differentiate the called function completely in a recursive way, we still need to declare it so that we can have the call expression completed (i.e. `auto t0 = f_pushforward(...)`). - To move towards the final stage of having the complete graph computed before starting the differentiation, we need the complete information on how the `DiffRequest` will be formed inside the visitors (including arguments or `DVI` info). This whole approach will require activity analysis in the first pass. - As an incremental improvement, the first requirement was to implement infrastructure to support explicit modelling of the graph and use that to have a breadth-first traversal (and eventually topological ordering). This is the initial PR for capturing the differentiation plan in a graphical format. However, the traversal order is still breadth-first, as we don't have the complete graph in the first pass - mainly because of a lack of information about the args required for `pushforward` and `pullbacks`. This can be improved with the help of activity analysis to capture the complete graph in the first pass, processing the plan in a topologically sorted manner and pruning the graph for user-defined functions. I started this with this approach, and the initial experimental commit is available here for future reference: 82c0b42.

Plan for dynamic graph - The relations between different differentiation requests can be modelled as a graph. For example, if `f_a` calls `f_b`, there will be two differentiation requests `df_a` and `df_b`, the edge between them can be understood as `created_because_of`. This also means that the functions called by the users to be explicitly differentiated (or `DiffRequests` created because of these) are the source nodes, i.e. no incoming edges. In most cases, this graph aligns with the call graph, but in some cases, the graph depends on the internal implementation, like the Hessian computation, which requires creating multiple `fwd_mode` requests followed by a `rev_mode` request. - We can use this graph to order the computation of differentiation requests. This was already being done implicitly in the initial recursive implementation. Whenever we encountered a call expression, we started differentiation of the called function; this was sort of like a depth-first search strategy. - This had problems, as `Clang` reported errors when it encountered a new function scope (of the derivative of the called function) in the middle of the old function scope (of the derivative of the callee function). It treated the nested one like a lambda expression. The issue regarding this: #745. - To fix this, an initial strategy was to eliminate the recursive approach. Hence, a queue-based breadth-first approach was implemented in this PR: #848. - Although it fixed the problem, the graph traversal was still implicit. We needed some way to compute/store the complete graph and possibly optimize it, such as converting edges to model the `requires_derivative_of` relation. Using this, we could proceed with differentiation in a topologically sorted ordering. - It also required one caveat: although we don't differentiate the called function completely in a recursive way, we still need to declare it so that we can have the call expression completed (i.e. `auto t0 = f_pushforward(...)`). - To move towards the final stage of having the complete graph computed before starting the differentiation, we need the complete information on how the `DiffRequest` will be formed inside the visitors (including arguments or `DVI` info). This whole approach will require activity analysis in the first pass. - As an incremental improvement, the first requirement was to implement infrastructure to support explicit modelling of the graph and use that to have a breadth-first traversal (and eventually topological ordering). This is the initial PR for capturing the differentiation plan in a graphical format. However, the traversal order is still breadth-first, as we don't have the complete graph in the first pass - mainly because of a lack of information about the args required for `pushforward` and `pullbacks`. This can be improved with the help of activity analysis to capture the complete graph in the first pass, processing the plan in a topologically sorted manner and pruning the graph for user-defined functions. I started this with this approach, and the initial experimental commit is available here for future reference: vaithak@82c0b42.

vaithak marked this pull request as ready for review April 1, 2024 12:12

vaithak marked this pull request as draft April 1, 2024 12:12

Change differentiation schedule for forward mode

993a719

vaithak force-pushed the diff-plans branch from ed695ad to 770faa8 Compare April 3, 2024 00:48

This was linked to issues Apr 3, 2024

Possible memory leak when differentiating call expressions in forward mode #745

Closed

Undo partially disable demo commit #821

Closed

github-actions bot reviewed Apr 3, 2024

View reviewed changes

Fix hessian with new schedule plan and add tests

8788574

vaithak force-pushed the diff-plans branch 2 times, most recently from bde43e1 to 8788574 Compare April 3, 2024 01:57

github-actions bot reviewed Apr 3, 2024

View reviewed changes

lib/Differentiator/DiffPlanner.cpp Outdated Show resolved Hide resolved

vaithak force-pushed the diff-plans branch 3 times, most recently from 7bfd4aa to 6918417 Compare April 3, 2024 18:03

Fix DVI info for pullback methods

c1fea0b

github-actions bot reviewed Apr 3, 2024

View reviewed changes

Modify ordering of pullbacks in reverse mode

076babf

vaithak force-pushed the diff-plans branch from 6918417 to 72fc487 Compare April 4, 2024 18:57

vaithak marked this pull request as ready for review April 5, 2024 12:41

vaithak requested review from vgvassilev, parth-07 and PetroZarytskyi April 5, 2024 12:41

vgvassilev reviewed Apr 6, 2024

View reviewed changes

Reorder filechecks in tests for pullbacks

785ba1f

vaithak force-pushed the diff-plans branch 5 times, most recently from d0b651e to 509ec63 Compare April 10, 2024 20:56

vaithak force-pushed the diff-plans branch from 509ec63 to 4927735 Compare April 10, 2024 21:50

vaithak mentioned this pull request Apr 10, 2024

Avoid access of plugin methods from Visitors #857

Open

vaithak force-pushed the diff-plans branch 2 times, most recently from a7236d0 to bfb2c7c Compare April 10, 2024 22:32

github-actions bot reviewed Apr 10, 2024

View reviewed changes

vaithak force-pushed the diff-plans branch 3 times, most recently from 4f5fabf to 541d126 Compare April 11, 2024 14:48

parth-07 approved these changes Apr 11, 2024

View reviewed changes

vaithak force-pushed the diff-plans branch 2 times, most recently from ba3b914 to 2b7334c Compare April 11, 2024 18:59

vgvassilev reviewed Apr 11, 2024

View reviewed changes

.github/workflows/ci.yml Outdated Show resolved Hide resolved

Fix failures in debug mode

5c896e8

vaithak force-pushed the diff-plans branch from 2b7334c to 5c896e8 Compare April 12, 2024 09:50

vgvassilev approved these changes Apr 12, 2024

View reviewed changes

vgvassilev merged commit 4f8292c into vgvassilev:master Apr 12, 2024
87 checks passed

vaithak mentioned this pull request Apr 30, 2024

Compute and process Differentiation Request graph #873

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructure differentiation schedule into a breadth first traversal #848

Restructure differentiation schedule into a breadth first traversal #848

vaithak commented Apr 1, 2024

codecov bot commented Apr 3, 2024 •

edited

Loading

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot Apr 3, 2024

vgvassilev left a comment

vgvassilev Apr 6, 2024

vaithak Apr 10, 2024

github-actions bot left a comment

github-actions bot Apr 10, 2024

vaithak Apr 12, 2024 •

edited

Loading

parth-07 left a comment

vgvassilev left a comment

vgvassilev left a comment

vaithak commented Apr 12, 2024

Restructure differentiation schedule into a breadth first traversal #848

Restructure differentiation schedule into a breadth first traversal #848

Conversation

vaithak commented Apr 1, 2024

codecov bot commented Apr 3, 2024 • edited Loading

Codecov Report

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot Apr 3, 2024

Choose a reason for hiding this comment

vgvassilev left a comment

Choose a reason for hiding this comment

vgvassilev Apr 6, 2024

Choose a reason for hiding this comment

vaithak Apr 10, 2024

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot Apr 10, 2024

Choose a reason for hiding this comment

vaithak Apr 12, 2024 • edited Loading

Choose a reason for hiding this comment

parth-07 left a comment

Choose a reason for hiding this comment

vgvassilev left a comment

Choose a reason for hiding this comment

vgvassilev left a comment

Choose a reason for hiding this comment

vaithak commented Apr 12, 2024

codecov bot commented Apr 3, 2024 •

edited

Loading

vaithak Apr 12, 2024 •

edited

Loading