Sparse pullback for big performance gain by unalmis · Pull Request #2170 · PlasmaControl/DESC

unalmis · 2026-04-17T22:59:32Z

Resolves Sparsity preserving pullbacks #2168 .
- Yields factor of $N \times$ improvement in memory and compute speed to get Jacobian $R^{N \times M}$ of map $f \colon R^M \to R^N$. Basically if you had some expensive function, with proper application throughout you could get Jacobian for around the cost you'd get a single row before.
- $N^2$ for check pointed funs
- Even if you are only doing a single vjp, it is much faster and more memory effecient due to reasons discussed in links threads.
- could do sparse lin alg on the cotangent update too, but i didn't make assumption that cotangent is sparse
- Functions added are sparse_pullback and sparse_pullback_map
Removes
- bounce1d optimization.
- Unneeded flags (is_reshaped, is_fourier) that users said were confusing (backwards compatible) as well as the developer flags Bref, Lref that should not be there.
Previously, pitch_batch_size was getting ignored. This fixes that by adding strip_dim0 flag to batch_map.
Switch resolution to per field period to simplify use and analysis #2182
Resolves the fixme comment so that gradients are consistent #2185

notes

They are decorator functions, so it is non-invasive and can improves many objective such as bounce, balloon, qs, omni, surface integral, free surface etc, but more generally can apply to any atomic computation. Trivial to use now that I have done all the math and resolved implementation sharp bits.
for example the 4d singular integral kernel can be reduced to 2d pullback
For this PR, I only apply the decorator at one point in bounce. Because itsdone at intermediate point I needed to change some code. Note the CI benchmarks won't show that improvement since those objectives were onto $R^1$, but benchmarking with tensor-board indeed shows the factor of $N$ improvement in speed and memory computing the Jacobian. And this is not yet applied to the full pipeline. See Single contangent pullback through compute pipeline #2171 .
Unrelated to this PR since it's always been like this, but computing the proximal projection Jacobian with Force balance constraint is more expensive that computing the bigger unconstrained Jacobian since the svd solve is way more expensive now than computing a vjp of bounce integrals. Perhaps the decorator could be applied to stuff there; I didn't look.
If the partial summation in Poloidal FFT Implementation #1508 is done so that the the transforms are factorized then we can extend the pullback wrappers to include the spectral to real space transform on each surface to by decorating the larger function with a sparse_pullback as well. Or use Fourier-Chebysev as i had suggested in Making transforms more efficient #1243 to make it simpler. This would have thr advantage of further reducing memory.
This should renew interest in Upsample data above midplane to full grid assuming stellarator symmetry #1206 and Compute function and gradient simultaneously for reverse mode #1872 and Generalize toroidal angle beyond phi cylindrical #465.

Co-authored-by: Kaya Unalmis <kayaunalmis@proton.me>

unalmis · 2026-05-07T20:06:50Z

Please just review desc batching.py and the derivatives.py then.

Yea so those are the only things that need reviewing before approval which should take like 10 min. Other stuff is just stuff reviewers requested on #2157 and #2147.

unalmis · 2026-05-15T13:43:35Z

when is this getting merged

f0uriest

Still about 1/2 to go but leaving these here for now.

One big point is that if we're messing with custom AD stuff I think it would be good practice to add tests comparing AD of the relevant objectives to finite differences (can use very low res, don't care about physics convergence), both to check that the implementation is correct and also to guard against us accidentally applying the sparse pullback in places where its not strictly correct.

unalmis · 2026-05-17T21:19:35Z

Still about 1/2 to go but leaving these here for now.

One big point is that if we're messing with custom AD stuff I think it would be good practice to add tests comparing AD of the relevant objectives to finite differences (can use very low res, don't care about physics convergence), both to check that the implementation is correct and also to guard against us accidentally applying the sparse pullback in places where its not strictly correct.

Locally I tested with the jacobian equivalent of test_compute_everything. It worked to machine precision.
I tested finite difference a bunch for these objectives, and it works well.
I prefer to keep the tests I add as independent unit tests that test correctness of code.
There are enough monolithic smoke tests. Part of the reason theses pr's are big is because I have to plumb through changes for the 100 different tests I have for these objectives. Adding more smoke tests makes it harder to maintain, annoys reviewers for pr leght, and my hunch is someone would eventually decide to delete for Reduce testing time/memory #914

(can use very low res, don't care about physics convergence),

That is not true/possible. See the supplementary information in publications. Briefly, For nontrivial computational problems where not everything is C^infinty, an algorithm to solve a problem needs to have amazing convergence properties, and be robust to topology changes, for the duscretization error to be correlated enough nearby a given point in the optimization space for the finite difference derivative to have any chance if being accurate. (Again explained better in the pdf).

You can see that finite difference derivatives only make sense at high resolution computations of the algorithm. Auto diff makes sense at any resolution because it estimates the derivative from only information at a single point in the optimization landscape. (Of course if discretization error is high then over an optimization it could still stall as varying discretization error can affect the decent direction,but that's unrelated for this discussion). In general you'll need high res to get finite diff to match auto diff.

unalmis and others added 30 commits April 9, 2026 19:03

Add fft grid and raz grid to test against master

8fb2b38

remove noise by tighten tolerance

25164d0

final attempt

0d23c66

rory comment

5e70567

fix last commit

945f1af

Increase correlation in discretization error for optimization

c2ecc4b

Merge branch 'master' into ku/test

bb8ac6a

.

7489317

increase tol for test

e240249

remove not implemented todo

be41c58

.

a55b170

add back short-circuit

2cff860

collect redundant docs

1727fba

Fix if statements

339643b

Co-authored-by: Kaya Unalmis <kayaunalmis@proton.me>

Merge branch 'master' into ku/test

bda562a

Resolves #2162

ff53f80

loosen tol on test

83ffee6

flake8

ccf228f

flake8 blank line space

f8a3515

future proof

d05bda1

daniel comments

c06687d

fix render

d5a682f

Apply suggestions from code review

f325bbf

Co-authored-by: Kaya Unalmis <kayaunalmis@proton.me>

Apply suggestions from code review

1792e91

Apply suggestions from code review

89479dc

dan comment v2

947641b

dan v2

13b6870

more dan

a58c075

last dan

ad86912

last commit to desc

f4faed4

unalmis requested review from ddudt and f0uriest May 4, 2026 20:32

Merge branch 'master' into ku/sparse_pullback

b39ab31

unalmis requested review from ddudt, dpanici, f0uriest and rahulgaur104 and removed request for ddudt, dpanici, f0uriest and rahulgaur104 May 9, 2026 19:37

unalmis added the P∞ P_infty. Ready to merge > 1 years. Top priority to merge to prevent further delay of research. label May 9, 2026

Merge branch 'master' into ku/sparse_pullback

1fd2258

unalmis mentioned this pull request May 16, 2026

Neo wrapper #1472

Closed

f0uriest reviewed May 17, 2026

View reviewed changes

address rory

81cb1a9

unalmis requested a review from f0uriest May 18, 2026 05:25

unalmis added 2 commits May 18, 2026 22:16

Merge branch 'master' into ku/sparse_pullback

eaaaff7

Merge branch 'master' into ku/sparse_pullback

4955144

unalmis requested review from ddudt, dpanici, f0uriest and rahulgaur104 and removed request for ddudt, dpanici, f0uriest and rahulgaur104 May 27, 2026 23:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse pullback for big performance gain#2170

Sparse pullback for big performance gain#2170
unalmis wants to merge 87 commits into
masterfrom
ku/sparse_pullback

unalmis commented Apr 17, 2026 •

edited

Loading

Uh oh!

unalmis commented May 7, 2026 •

edited

Loading

Uh oh!

unalmis commented May 15, 2026

Uh oh!

f0uriest left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

unalmis commented May 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

unalmis commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

notes

Uh oh!

unalmis commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

unalmis commented May 15, 2026

Uh oh!

f0uriest left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

unalmis commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

unalmis commented Apr 17, 2026 •

edited

Loading

unalmis commented May 7, 2026 •

edited

Loading

unalmis commented May 17, 2026 •

edited

Loading