Update `trust_region_step_exact_qr` #1363

YigitElma · 2024-11-13T07:06:39Z

Updates the initial guess for $\alpha$ to be 0 instead of 0.001* $\alpha$ _upper. I found out that the solution of $\alpha$ is usually pretty small. This reduces the total amount of QR's in the while loop.
Removes the last QR outside the while loop of trust_region_step_exact_qr. The loop condition already satisfies the step norm to be around trust radius.

Misc

Adds execute_on_cpu flag to desc.examples.get

github-actions · 2024-11-13T08:18:31Z

|             benchmark_name             |         dt(%)          |         dt(s)          |        t_new(s)        |        t_old(s)        | 
| -------------------------------------- | ---------------------- | ---------------------- | ---------------------- | ---------------------- |
 test_build_transform_fft_midres         |     +0.98 +/- 3.72     | +6.16e-03 +/- 2.34e-02 |  6.36e-01 +/- 1.9e-02  |  6.29e-01 +/- 1.3e-02  |
 test_build_transform_fft_highres        |     +0.98 +/- 2.65     | +9.67e-03 +/- 2.60e-02 |  9.94e-01 +/- 1.6e-02  |  9.84e-01 +/- 2.1e-02  |
 test_equilibrium_init_lowres            |     -1.54 +/- 3.78     | -6.41e-02 +/- 1.57e-01 |  4.10e+00 +/- 6.1e-02  |  4.16e+00 +/- 1.5e-01  |
 test_objective_compile_atf              |     +0.74 +/- 4.11     | +6.01e-02 +/- 3.31e-01 |  8.13e+00 +/- 2.6e-01  |  8.07e+00 +/- 2.0e-01  |
 test_objective_compute_atf              |     -1.80 +/- 2.28     | -1.97e-04 +/- 2.50e-04 |  1.07e-02 +/- 1.3e-04  |  1.09e-02 +/- 2.1e-04  |
 test_objective_jac_atf                  |     +0.91 +/- 3.45     | +1.75e-02 +/- 6.65e-02 |  1.94e+00 +/- 5.0e-02  |  1.93e+00 +/- 4.4e-02  |
 test_perturb_1                          |     -0.35 +/- 2.76     | -5.20e-02 +/- 4.06e-01 |  1.47e+01 +/- 2.6e-01  |  1.47e+01 +/- 3.2e-01  |
 test_proximal_jac_atf                   |     -0.57 +/- 1.00     | -4.66e-02 +/- 8.29e-02 |  8.20e+00 +/- 7.8e-02  |  8.25e+00 +/- 2.7e-02  |
 test_proximal_freeb_compute             |     +1.03 +/- 1.23     | +2.02e-03 +/- 2.41e-03 |  1.99e-01 +/- 1.6e-03  |  1.97e-01 +/- 1.8e-03  |
 test_solve_fixed_iter_compiled          |     -3.42 +/- 1.95     | -6.02e-01 +/- 3.44e-01 |  1.70e+01 +/- 1.4e-01  |  1.76e+01 +/- 3.1e-01  |
 test_build_transform_fft_lowres         |     -0.66 +/- 6.35     | -3.43e-03 +/- 3.29e-02 |  5.15e-01 +/- 2.1e-02  |  5.19e-01 +/- 2.5e-02  |
 test_equilibrium_init_medres            |     +0.83 +/- 1.00     | +3.37e-02 +/- 4.05e-02 |  4.07e+00 +/- 3.4e-02  |  4.04e+00 +/- 2.2e-02  |
 test_equilibrium_init_highres           |     +0.21 +/- 1.04     | +1.10e-02 +/- 5.55e-02 |  5.35e+00 +/- 3.8e-02  |  5.33e+00 +/- 4.0e-02  |
 test_objective_compile_dshape_current   |     +0.28 +/- 0.93     | +1.07e-02 +/- 3.55e-02 |  3.83e+00 +/- 2.2e-02  |  3.82e+00 +/- 2.8e-02  |
 test_objective_compute_dshape_current   |     +1.05 +/- 1.44     | +3.77e-05 +/- 5.15e-05 |  3.62e-03 +/- 2.9e-05  |  3.58e-03 +/- 4.2e-05  |
 test_objective_jac_dshape_current       |     +4.25 +/- 6.94     | +1.66e-03 +/- 2.71e-03 |  4.07e-02 +/- 1.4e-03  |  3.91e-02 +/- 2.3e-03  |
 test_perturb_2                          |     -0.33 +/- 1.47     | -6.18e-02 +/- 2.78e-01 |  1.89e+01 +/- 2.1e-01  |  1.89e+01 +/- 1.9e-01  |
 test_proximal_freeb_jac                 |     -0.71 +/- 1.04     | -5.24e-02 +/- 7.66e-02 |  7.35e+00 +/- 6.2e-02  |  7.40e+00 +/- 4.5e-02  |
 test_solve_fixed_iter                   |     -1.10 +/- 1.85     | -3.07e-01 +/- 5.17e-01 |  2.76e+01 +/- 4.0e-01  |  2.79e+01 +/- 3.3e-01  |
 test_LinearConstraintProjection_build   |     +0.13 +/- 0.89     | +2.84e-02 +/- 1.99e-01 |  2.25e+01 +/- 1.4e-01  |  2.25e+01 +/- 1.4e-01  |

f0uriest · 2024-11-13T17:37:31Z

isn't this the same as the cho option but using LU instead of cholesky?

YigitElma · 2024-11-13T17:57:11Z

isn't this the same as the cho option but using LU instead of cholesky?

Yes, pretty similar. jax.scipy.linlag.solve with symmetric A should call LAPACK ?sysv (docs), and it is something like LU. For finding alpha, I used Secant method.

Cholesky is also really fast (maybe faster than this one), but maybe this is more accurate, that's why I wanted to test it. If I remember correctly the problem with Cholesky was the accuracy.

f0uriest · 2024-11-13T18:02:26Z

Generally the innaccuracy comes from explitly forming J^T J when J is poorly conditioned, so I don't think you'll see any major improvements with this. Also cholesky should be faster.

…e-10 factor

…e/tr_direct

dpanici · 2024-11-20T18:36:21Z

run on multiple threads

…e/tr_direct

codecov · 2024-11-25T03:55:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 95.58%. Comparing base (17d4939) to head (e64f764).

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #1363   +/-   ##
=======================================
  Coverage   95.58%   95.58%           
=======================================
  Files          96       96           
  Lines       24597    24598    +1     
=======================================
+ Hits        23512    23513    +1     
  Misses       1085     1085

Files with missing lines	Coverage Δ
desc/examples/__init__.py	`100.00% <100.00%> (ø)`
desc/optimize/tr_subproblems.py	`99.44% <100.00%> (-0.01%)`	⬇️

… r_tr

f0uriest · 2024-12-03T21:34:32Z

desc/optimize/tr_subproblems.py

-                alpha,
-            )
+            alpha = jnp.where((alpha < alpha_lower), alpha_lower, alpha)
+            alpha = jnp.where((alpha > alpha_upper), alpha_upper, alpha)


do these changes help reduce the number of iterations of the subproblem? If so by how much?

We may want to make the same changes to the other trust region subproblems (svd, cho etc) to keep them consistent.

also this could be simplified to alpha = jnp.clip(alpha, alpha_lower, alpha_upper)

I don't know the exact effect of clipping alpha rather than setting it to 0.001*alpha_upper. My reasoning for these changes was to make the initial guess of alpha 0, and whenever alpha goes out of the range setting it to 0.001*alpha_upper seemed to be taking alpha away from the actual solution. So, it seemed necessary to clip alpha when I changed the initial guess.

The total number iterations for root finding depends on the case but these changes help to reduce it. Previously a root finding was taking 5-10 iterations, now around 3-6.

ok in that case can you make the same changes to trust_region_step_exact_{svd,cho}

add new trust region subproblem solver

ee24eeb

YigitElma added performance New feature or request to make the code faster enhancement General label for enhancement. Please also tag with "Speed", "Interface", "Functionality", etc labels Nov 13, 2024

YigitElma self-assigned this Nov 13, 2024

YigitElma added 3 commits November 13, 2024 02:33

update initial alpha

ae15c72

apply limits to alpha

ad5b29b

add execute_on_cpu decorator to des.example.get

141523d

YigitElma added 2 commits November 13, 2024 12:48

fix alpha updating

24f7b42

update docs

cce520e

YigitElma and others added 5 commits November 13, 2024 15:18

reduce rtol, not a fix but improves some tests

2854dac

use qr for main decomposition, use direct for tr subproblem, remove 1…

6b02b61

…e-10 factor

Merge branch 'master' into yge/tr_direct

4986cce

reduce rtol again

cac7612

Merge branch 'yge/tr_direct' of github.com:PlasmaControl/DESC into yg…

433768e

…e/tr_direct

dpanici and others added 9 commits November 20, 2024 13:36

Merge branch 'master' into yge/tr_direct

1a1b977

full direct

4954c73

Merge branch 'yge/tr_direct' of github.com:PlasmaControl/DESC into yg…

0c63bf7

…e/tr_direct

probably a stupid idea but anyway

fa6a733

at least try regularized

2b29060

Merge branch 'master' into yge/tr_direct

1e27266

update

22421ac

change initial alpha for qr

bd9fab3

fix typo

aa62a0a

remove test method

2a2e57c

YigitElma added 2 commits November 25, 2024 00:36

pass p in while loop, get rid of last qr, only make the norm equal to…

62ec64e

… r_tr

temporarily skip the test

1e036cf

YigitElma added the skip_changelog No need to update changelog on this PR label Nov 25, 2024

update coil test to use average change

3c3dadc

YigitElma changed the title ~~Add new trust region subproblem solver~~ Update trust_region _exact_qr Nov 25, 2024

YigitElma changed the title ~~Update trust_region _exact_qr~~ Update trust_region_step_exact_qr Nov 25, 2024

YigitElma removed the enhancement General label for enhancement. Please also tag with "Speed", "Interface", "Functionality", etc label Nov 25, 2024

YigitElma marked this pull request as ready for review November 27, 2024 06:38

dpanici requested review from a team, rahulgaur104, f0uriest, ddudt, dpanici, kianorr, sinaatalay and unalmis and removed request for a team November 27, 2024 18:40

f0uriest reviewed Dec 3, 2024

View reviewed changes

dpanici and others added 5 commits December 3, 2024 17:05

Merge branch 'master' into yge/tr_direct

47f00ca

Merge branch 'master' into yge/tr_direct

2165797

Merge branch 'master' into yge/tr_direct

0956783

Merge branch 'master' into yge/tr_direct

ba92240

Merge branch 'master' into yge/tr_direct

e64f764

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update `trust_region_step_exact_qr` #1363

Update `trust_region_step_exact_qr` #1363

YigitElma commented Nov 13, 2024 •

edited

Loading

github-actions bot commented Nov 13, 2024 •

edited

Loading

f0uriest commented Nov 13, 2024

YigitElma commented Nov 13, 2024 •

edited

Loading

f0uriest commented Nov 13, 2024

dpanici commented Nov 20, 2024

codecov bot commented Nov 25, 2024 •

edited

Loading

f0uriest Dec 3, 2024

f0uriest Dec 3, 2024

YigitElma Dec 3, 2024

YigitElma Dec 3, 2024

f0uriest Dec 5, 2024

Update trust_region_step_exact_qr #1363

Are you sure you want to change the base?

Update trust_region_step_exact_qr #1363

Conversation

YigitElma commented Nov 13, 2024 • edited Loading

Misc

github-actions bot commented Nov 13, 2024 • edited Loading

f0uriest commented Nov 13, 2024

YigitElma commented Nov 13, 2024 • edited Loading

f0uriest commented Nov 13, 2024

dpanici commented Nov 20, 2024

codecov bot commented Nov 25, 2024 • edited Loading

Codecov Report

f0uriest Dec 3, 2024

Choose a reason for hiding this comment

f0uriest Dec 3, 2024

Choose a reason for hiding this comment

YigitElma Dec 3, 2024

Choose a reason for hiding this comment

YigitElma Dec 3, 2024

Choose a reason for hiding this comment

f0uriest Dec 5, 2024

Choose a reason for hiding this comment

Update `trust_region_step_exact_qr` #1363

Update `trust_region_step_exact_qr` #1363

YigitElma commented Nov 13, 2024 •

edited

Loading

github-actions bot commented Nov 13, 2024 •

edited

Loading

YigitElma commented Nov 13, 2024 •

edited

Loading

codecov bot commented Nov 25, 2024 •

edited

Loading