TAO interface #143

jrmaddison · 2024-05-02T15:28:39Z

Optimisation using PETSc/TAO.

Uses the pyadjoint OverloadedType interface to provide a generic implementation. However these interfaces don't provide enough information about data decomposition with MPI parallelism. This is worked around in this PR using OverloadedType._ad_assign_numpy, but this requires a temporary, but global, NumPy array. Fixing this requires a change to OverloadedType and then other updates elsewhere (in particular in Firedrake).

Related, OverloadedType._ad_to_list returns a list rather than a NumPy ndarray.

Not added:

MaximisationProblem support.
Equality or inequality constraints.
Any optimisations to prevent extra forward or first adjoint reruns (e.g. with second order methods which might call TAOObjective.objective_gradient followed by TAOObjective.hessian).

pyadjoint/optimization/tao_solver.py

dham · 2024-06-19T10:25:06Z

I think that even if you are going to go via Numpy, this should be done by giving OverloadedType an ._ad_as_petsc_vec method so that we can go back and do it properly for Firedrake Functions afterwards. As it stands, I think that the TAO interface would have to be substantially rewritten in order to support OverloadedTypes that had a native mechanism for casting to PETSc Vec, and that seems unfortunate.

jrmaddison · 2024-06-20T08:36:55Z

I think that even if you are going to go via Numpy, this should be done by giving OverloadedType an ._ad_as_petsc_vec method so that we can go back and do it properly for Firedrake Functions afterwards. As it stands, I think that the TAO interface would have to be substantially rewritten in order to support OverloadedTypes that had a native mechanism for casting to PETSc Vec, and that seems unfortunate.

I think the 'one-or-more' pyadjoint interfaces mean that copying is going to be needed somewhere. e.g. there's nothing to stop a mixture of AdjFloats and firedrake.Function controls. Maybe those could be PETSc copy operations if the backend supplies PETSc Vecs -- I suggested NumPy as an interface layer as that's close to what OverloadedTypes already supply.

pyadjoint/optimization/tao_solver.py

dham · 2024-06-20T09:35:46Z

OK, I can see what's going on here. I actually think it would be an easy fix to do this without the global gather, it just requires a few very simple methods to be added to Overloaded type, but I guess that can be delayed. Please add a health warning about the global gather in the class docstring, because the comment on the PR will not be visible to users.

jrmaddison · 2024-06-20T11:48:42Z

OK, I can see what's going on here. I actually think it would be an easy fix to do this without the global gather, it just requires a few very simple methods to be added to Overloaded type, but I guess that can be delayed. Please add a health warning about the global gather in the class docstring, because the comment on the PR will not be visible to users.

It's not even a global gather, we just need to known the number of process local (owned) degrees of freedom, and OverloadedType doesn't provide this except indirectly via _ad_convert_numpy.

dham · 2024-06-20T12:55:36Z

OK, I can see what's going on here. I actually think it would be an easy fix to do this without the global gather, it just requires a few very simple methods to be added to Overloaded type, but I guess that can be delayed. Please add a health warning about the global gather in the class docstring, because the comment on the PR will not be visible to users.

It's not even a global gather, we just need to known the number of process local (owned) degrees of freedom, and OverloadedType doesn't provide this except indirectly via _ad_convert_numpy.

OK, I'm confused again. I think the current implementation does a global gather onto a single numpy array. Is that not correct?

jrmaddison · 2024-06-20T17:03:43Z

OK, I'm confused again. I think the current implementation does a global gather onto a single numpy array. Is that not correct?

You're correct. I'd missed (or forgotten) how OverloadedType._ad_assign_numpy and OverloadedType._to_list behave -- e.g. using a gather in OverloadedType._to_list.

This obviously won't scale, but I also don't see an alternative via the current OverloadedType interface.

jrmaddison · 2024-07-19T11:24:18Z

Firedrake test requires firedrakeproject/firedrake#3657

pyadjoint/adjfloat.py

pyadjoint/optimization/tao_solver.py

dham · 2024-07-24T10:45:11Z

pyadjoint/overloaded_type.py

+        The method should implement a routine for assigning other to `self`.
+
+        Note:
+            This method should not be employed for immutable types.


I think this line here highlights an issue with this approach. OverloadedType actually conflates two sets of functionality:

Things needed in order to tape a variable.

Things needed in order to work with variables when computing taped operations (replay, tlm, adjoint etc.) and optimisation.

This functionality is part of 2 (it's not needed for taping). At that stage of operations, all variables are treated as immutable, so assignment looks like a strange operation at best. The fact that it doesn't work for some types is a further red flag.

Assignment itself is also not really what is meant here. The use case in question seems to be assigning a scalar to a vector, which is not well-defined (think about non-point-evaluation FunctionSpaces, for example). I think what is really meant here is interpolate.

On the mutability point, I think this should be more like _ad_init_object and return a new value. I.e. the signature would be:

@classmethod def _ad_interpolate(cls, obj, value):

and the required semantics would be to return a new object which is identical to obj except that its value approximates value.

_ad_iadd and _ad_imul are existing in-place operations. However I agree here an in-place operation is not needed. I think the easiest is just to change VecInterface.to_petsc to handle the scalar case using VecSetValues.

I also don't think scalar bounds are clearly defined. These could mean

Bounds the dof values. I think this is in the intention, and is what VecSetValues would do.

Bound function values. Too difficult, but see Bound constraint possibly incorrect for higher order discretization #145.

Bound interpolated function values, but this is backend specific. Interpolation may not be meaningfully defined.

This would bring us back to the option of requiring the user to specify bounds of the control type. I think this is probably the right thing - Pyadjoint can't interpret the meaning of variables beyond basic Hilbert space operations (i.e. vector space operations + inner product).

I guess the corner case is the no bounds case. We could either make that a user problem or we'd have to add _ad methods for producing variables that are element-wise minimal or maximal. That would be convenient for users but I'm not sure it smells good.

requiring the user to specify bounds of the control type

I'd definitely prefer that. Existing pyadjoint examples use scalars, though.

I guess the corner case is the no bounds case.

That at least can be handled with VecSetValues and np.finfo(PETSc.ScalarType).min/np.finfo(PETSc.ScalarType).max.

I've implemented the 'scalar bound = bound on all dofs' case using VecSet. This might still lead to some surprises depending on the basis, but I think is consistent with pyadjoint behaviour elsewhere.

Current pyadjoint behaviour is described here.

pyadjoint/pyadjoint/optimization/optimization_problem.py

Lines 24 to 28 in 92121af

#: bounds: lower and upper bounds for the control (optional). None means

#: unbounded. if not None, then it must be a list of the same length as

#: the number controls for the reduced_functional. Each entry in the list

#: must be a tuple (lb, ub), where ub and lb are floats, or objects

#: of the same kind as the control.

The actual check allows None and int as well.

pyadjoint/pyadjoint/optimization/optimization_problem.py

Lines 51 to 55 in 92121af

for b in bound:

b = create_overloaded_object(b, suppress_warning=True)

klass = control.tape_value().__class__

if not (isinstance(b, (int, float, type(None), klass))):

raise TypeError("This pair (lb, ub) should be None, a float, or a %s." % klass)

As discussed in the meeting, this should be fixed separately from this PR.

pyadjoint/overloaded_type.py

pyadjoint/adjfloat.py

pyadjoint/optimization/tao_solver.py

pyadjoint/overloaded_type.py

pyadjoint/optimization/tao_solver.py

dham · 2024-08-07T15:13:22Z

pyadjoint/optimization/tao_solver.py

+    def finalize_callback(*args):
+        for arg in args:
+            if arg is not None:
+                arg.destroy()


@JDBetteridge says this needs to be .delayed_destroy.

Not available, but the possible deadlock is now documented.

@JDBetteridge is unconvinced. Let's discuss this next week.

This is intended to work around the following, which leaks memory on my machine (Ubuntu 24.04, run on two processes)

from firedrake import * from firedrake.petsc import garbage_cleanup import numpy as np from itertools import count import gc import weakref class Mat: def __init__(self, space): self._x = Function(space) self._test = TestFunction(space) self._memory = np.full(1000000, 1, dtype=np.uint8) def mult(self, x, y): with self._x.dat.vec_wo: x.copy(result=x_v) y_c = assemble(inner(self._x, self._test) * dx) with y_c.dat.vec_ro as y_v: y_v.copy(result=y) def attach_destroy_finalizer(obj, *args): def finalize_callback(*args): for arg in args: if arg is not None: arg.destroy() finalize = weakref.finalize(obj, finalize_callback, *args) finalize.atexit = False mesh = UnitSquareMesh(10, 10) space = FunctionSpace(mesh, "Lagrange", 1) u = Function(space, name="u") with u.dat.vec_ro as u_v: n, N = u_v.getSizes() class Test: pass for _ in count(): t = Test() A = Mat(space) mat = PETSc.Mat().createPython(((n, N), (n, N)), A, comm=u.comm) # attach_destroy_finalizer(t, mat) # Uncomment to fix leak del t gc.collect() garbage_cleanup(u.comm)

Arguably it would be better to track down the source of the leak -- but the case where I've hit this was much harder to debug than the above, hence the defensive use of destroy. The main downside is if properties are accessed and retained, e.g.

tao_solver = TAOSolver(...) tao = tao_solver.tao del tao_solver # TaoDestroy has been called for tao

The previous example leaks because garbage_cleanup(u.comm) doesn't call PETSc.garbage_cleanup(u.comm) (is this a separate bug?).

dham · 2024-08-07T15:18:04Z

pyadjoint/optimization/tao_solver.py

+        n = sum(vec.getLocalSize() for vec in vecs)
+        N = sum(vec.getSize() for vec in vecs)
+        _, isets = PETSc.Vec().concatenate(vecs)
+        for vec in vecs:


If the destructor is really not necessary, let's not do it.

if it is necessary then _ad_to_petsc() should probably be a context manager so as to remove the footgun.

dham · 2024-08-07T15:22:35Z

pyadjoint/optimization/tao_solver.py

+            x_sub.restoreSubVector(iset, x_sub)
+
+
+class PETScOptions:


Pirate the PETSc Options handler from Firedrake.

pyadjoint/optimization/tao_solver.py

dham · 2024-08-07T15:39:50Z

pyadjoint/optimization/tao_solver.py

+            to_petsc(ub_vec, ubs)
+            tao.setVariableBounds(lb_vec, ub_vec)
+            lb_vec.destroy()
+            ub_vec.destroy()


please don't destroy unless needed.

pyadjoint/optimization/tao_solver.py

dham · 2024-08-07T15:42:45Z

pyadjoint/optimization/tao_solver.py

+            self, tao, H_matrix, M_inv_matrix, B_0_matrix_pc, B_0_matrix, x)
+
+    @property
+    def taoobjective(self):


PEP8 name should have underscore.

Fixed, although I followed ROLSolver.rolobjective.

dham · 2024-08-07T15:44:26Z

pyadjoint/optimization/tao_solver.py

+        self.tao.solve()
+        self._vec_interface.from_petsc(self.x, M)
+        if self.tao.getConvergedReason() <= 0:
+            raise RuntimeError("Convergence failure")


Is this the conventional exception for solver fails?

Updated to use a TAOConvergenceError with a message similar to Firedrake linear solve failures.

pyadjoint doesn't seem to raise exceptions for optimizer failures in the SciPy or ROL interfaces. I'm pretty sure that's unsafe for the SciPy interface, #166.

Co-authored-by: Daiane Iglesia Dolci <[email protected]>

…bVector

…ove Firedrake specifics

jrmaddison marked this pull request as ready for review May 20, 2024 17:47

jrmaddison requested a review from dham May 20, 2024 17:47

Ig-dolci reviewed Jun 18, 2024

View reviewed changes

pyadjoint/optimization/tao_solver.py Outdated Show resolved Hide resolved

jrmaddison force-pushed the jrmaddison/tao branch from f771894 to a976ee9 Compare June 20, 2024 08:45

dham reviewed Jun 20, 2024

View reviewed changes

pyadjoint/optimization/tao_solver.py Outdated Show resolved Hide resolved

dham reviewed Jun 20, 2024

View reviewed changes

pyadjoint/optimization/tao_solver.py Show resolved Hide resolved

jrmaddison force-pushed the jrmaddison/tao branch from e851645 to 8996d71 Compare July 19, 2024 10:31

This was referenced Jul 19, 2024

Get petsc vec firedrakeproject/firedrake#3657

Merged

Get OverloadType PETSc.vec #157

Closed

Ig-dolci reviewed Jul 24, 2024

View reviewed changes

pyadjoint/adjfloat.py Outdated Show resolved Hide resolved

Ig-dolci reviewed Jul 24, 2024

View reviewed changes

pyadjoint/adjfloat.py Outdated Show resolved Hide resolved

Ig-dolci reviewed Jul 24, 2024

View reviewed changes

pyadjoint/optimization/tao_solver.py Outdated Show resolved Hide resolved

Ig-dolci reviewed Jul 24, 2024

View reviewed changes

pyadjoint/optimization/tao_solver.py Outdated Show resolved Hide resolved

Ig-dolci reviewed Jul 24, 2024

View reviewed changes

pyadjoint/optimization/tao_solver.py Show resolved Hide resolved

Ig-dolci reviewed Jul 24, 2024

View reviewed changes

pyadjoint/optimization/tao_solver.py Outdated Show resolved Hide resolved

Ig-dolci requested changes Jul 24, 2024

View reviewed changes

pyadjoint/optimization/tao_solver.py Outdated Show resolved Hide resolved

pyadjoint/optimization/tao_solver.py Show resolved Hide resolved

pyadjoint/optimization/tao_solver.py Show resolved Hide resolved

pyadjoint/optimization/tao_solver.py Outdated Show resolved Hide resolved

dham requested changes Jul 24, 2024

View reviewed changes

jrmaddison commented Jul 30, 2024

View reviewed changes

pyadjoint/overloaded_type.py Outdated Show resolved Hide resolved

Ig-dolci requested changes Aug 5, 2024

View reviewed changes

Ig-dolci approved these changes Aug 5, 2024

View reviewed changes

dham requested changes Aug 7, 2024

View reviewed changes

This was referenced Aug 8, 2024

ReducedFunctional should provide dual space derivatives #164

Open

Add _ad_new and _ad_new_dual methods to OverloadedType #165

Open

jrmaddison and others added 25 commits September 20, 2024 13:55

Apply scalar bounds using PETScVecInterface

86577c6

flake8

59cd350

Remove format specifiers

6f3a5a5

Tidying

39a2178

Remove OverloadedType._ad_assign

5995e89

Check convergence reason

0d9133b

Update pyadjoint/overloaded_type.py

dd0e192

Co-authored-by: Daiane Iglesia Dolci <[email protected]>

Update pyadjoint/overloaded_type.py

0f84623

Co-authored-by: Daiane Iglesia Dolci <[email protected]>

Update pyadjoint/adjfloat.py

25dab2d

Co-authored-by: Daiane Iglesia Dolci <[email protected]>

Update pyadjoint/optimization/tao_solver.py

e9a7090

Co-authored-by: Daiane Iglesia Dolci <[email protected]>

Add length check before zip

3142d98

Clarify that x_sub as returned by getSubVector is passed to restoreSu…

3caa063

…bVector

Use OptionsManager class from Firedrake, with very minor edits to rem…

17e0883

…ove Firedrake specifics

Document add_destroy_finalizer

0577926

Remove Vec.destroy calls

56b23d5

Rename M, X, Y, new_M, new_M_dual

233197c

Document PETSc Python contexts

1931549

Add comment for B_0_matrix

8b6021a

taoobjective -> tao_objective

79d6a99

More descriptive error message on convergence failure

19d7695

Minor fixes

733b50a

Check for PETSc in OptionsManager

590cb2d

Fix comment

1357952

PETSc/TAO API fix

08674cf

Remove attach_destroy_finalizer

cd51a38

jrmaddison force-pushed the jrmaddison/tao branch from 03d8384 to cd51a38 Compare September 23, 2024 12:09

flake8

d30c57e

dham approved these changes Oct 3, 2024

View reviewed changes

dham merged commit b9574fb into dolfin-adjoint:master Oct 3, 2024
2 checks passed

jrmaddison deleted the jrmaddison/tao branch October 3, 2024 13:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TAO interface #143

TAO interface #143

jrmaddison commented May 2, 2024 •

edited

Loading

dham commented Jun 19, 2024

jrmaddison commented Jun 20, 2024

dham commented Jun 20, 2024

jrmaddison commented Jun 20, 2024

dham commented Jun 20, 2024

jrmaddison commented Jun 20, 2024 •

edited

Loading

jrmaddison commented Jul 19, 2024 •

edited

Loading

dham Jul 24, 2024

jrmaddison Jul 24, 2024

dham Jul 24, 2024

jrmaddison Jul 24, 2024

jrmaddison Jul 30, 2024 •

edited

Loading

jrmaddison Jul 30, 2024

jrmaddison Jul 30, 2024

jrmaddison Aug 8, 2024

dham Aug 7, 2024

jrmaddison Aug 8, 2024

dham Sep 11, 2024

jrmaddison Sep 11, 2024 •

edited

Loading

jrmaddison Sep 19, 2024 •

edited

Loading

dham Aug 7, 2024

jrmaddison Aug 8, 2024

dham Aug 7, 2024

jrmaddison Aug 8, 2024

dham Aug 7, 2024

jrmaddison Aug 8, 2024

dham Aug 7, 2024

jrmaddison Aug 8, 2024

dham Aug 7, 2024

jrmaddison Aug 8, 2024 •

edited

Loading

	#: bounds: lower and upper bounds for the control (optional). None means
	#: unbounded. if not None, then it must be a list of the same length as
	#: the number controls for the reduced_functional. Each entry in the list
	#: must be a tuple (lb, ub), where ub and lb are floats, or objects
	#: of the same kind as the control.

	for b in bound:
	b = create_overloaded_object(b, suppress_warning=True)
	klass = control.tape_value().__class__
	if not (isinstance(b, (int, float, type(None), klass))):
	raise TypeError("This pair (lb, ub) should be None, a float, or a %s." % klass)

TAO interface #143

TAO interface #143

Conversation

jrmaddison commented May 2, 2024 • edited Loading

dham commented Jun 19, 2024

jrmaddison commented Jun 20, 2024

dham commented Jun 20, 2024

jrmaddison commented Jun 20, 2024

dham commented Jun 20, 2024

jrmaddison commented Jun 20, 2024 • edited Loading

jrmaddison commented Jul 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jrmaddison Jul 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jrmaddison Sep 11, 2024 • edited Loading

Choose a reason for hiding this comment

jrmaddison Sep 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jrmaddison Aug 8, 2024 • edited Loading

Choose a reason for hiding this comment

jrmaddison commented May 2, 2024 •

edited

Loading

jrmaddison commented Jun 20, 2024 •

edited

Loading

jrmaddison commented Jul 19, 2024 •

edited

Loading

jrmaddison Jul 30, 2024 •

edited

Loading

jrmaddison Sep 11, 2024 •

edited

Loading

jrmaddison Sep 19, 2024 •

edited

Loading

jrmaddison Aug 8, 2024 •

edited

Loading