Add initial LOBPCG top-k eigenvalue solver (#3112) #10962

vlad17 · 2022-06-03T02:07:37Z

This initial version is f32-only for accelerators, since it relies on an eigh call (which itself is f32 at most) in its inner loop.

For details, see jax.experimental.linalg.standard_lobpcg documentation.

google-cla · 2022-06-03T02:07:41Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

vlad17 · 2022-06-03T02:37:51Z

@shoyer @tabakg @lobpcg would any of you be interested in reviewing?

tabakg · 2022-06-04T18:52:05Z

tests/lobpcg_test.py

+      'jax_traceback_filtering': 'off',
+  }
+
+  # TODO(vladf): add f64 tests just to verify it compiles?


I think that would be good -- you could probably check if it's running on CPU and disable otherwise?

It looks like there are f64 tests auto-triggered by the github action matrix (via env var), so I think all I should really need to do here is adjust the epsilons.

shoyer · 2022-06-04T18:54:41Z

A couple high level thoughts:

It would be nice to support matrix-free linear operators defined on pytrees, like the solvers in jax.scipy.sparse.linalg. For example, imagine solving for the largest eigenvalue pairs of a function that evaluates a linearized neural network.
It would be nice to support an iterative interface with init and update functions, similar to the optimizers in Optax. This sort of inversion of control provides valuable flexibility in cases where a full eigenvalue solve from scratch is prohibitively expensive. For example, imagine performing a single LOBPCG step to re-estimate the top eigenvalue after each gradient descent step when training a neural net, as is done in spectral normalization.

Neither of these are deal breakers for the first iteration, though.

vlad17 · 2022-06-04T19:31:15Z

It would be nice to support matrix-free linear operators defined on pytrees, like the solvers in jax.scipy.sparse.linalg. For example, imagine solving for the largest eigenvalue pairs of a function that evaluates a linearized neural network.

Great idea, this should be easy to add and now that I think about it, and I think Shankar (skrishnan@google.com) was another user who'd immediately benefit from an interface like that.

It would be nice to support an iterative interface with init and update functions, similar to the optimizers in Optax. This sort of inversion of control provides valuable flexibility in cases where a full eigenvalue solve from scratch is prohibitively expensive. For example, imagine performing a single LOBPCG step to re-estimate the top eigenvalue after each gradient descent step when training a neural net, as is done in spectral normalization.

Could you elaborate on this? I'd be very willing to do this as a follow-on if I had a user to work with on their ideal API for this, but as-stated I don't quite see how setting the initial X to the previous value and maximum iteration m=1 wouldn't work.

shoyer · 2022-06-05T21:29:41Z

It would be nice to support an iterative interface with init and update functions, similar to the optimizers in Optax. This sort of inversion of control provides valuable flexibility in cases where a full eigenvalue solve from scratch is prohibitively expensive. For example, imagine performing a single LOBPCG step to re-estimate the top eigenvalue after each gradient descent step when training a neural net, as is done in spectral normalization.

Could you elaborate on this? I'd be very willing to do this as a follow-on if I had a user to work with on their ideal API for this, but as-stated I don't quite see how setting the initial X to the previous value and maximum iteration m=1 wouldn't work.

Wouldn't we want to calculate the matrix P from the previous iteration?

vlad17 · 2022-06-05T22:17:55Z

Ah, gotcha. I guess that makes sense, and it seems like it'd be a matter of exposing the body function in the current method. Maybe it'd be best to wait until the API solidifies a little though, since largest=False option is incoming, and it'd be good to figure out how that iterative interface would work with a preconditioner.

vlad17 · 2022-06-08T18:00:40Z

@shoyer @tabakg just updated the pr on a flight home, let me know what you think of the new interface and tests.

vlad17 · 2022-06-14T18:51:21Z

Here are some cool curves (which can be generated from the tests) of jax vs scipy f32 on 1000x1000 versions of the test matrices for top-10 eigs (I set convergence tol to 0 which is why nothing converges)

lobpcg · 2022-06-14T19:22:03Z

Here are some cool curves (which can be generated from the tests) of jax vs scipy f32 on 1000x1000 versions of the test matrices for top-10 eigs (I set convergence tol to 0 which is why nothing converges)

I would like to reproduce these tests in SciPy standalone and check if some bugs need to be fixed there, since the runs appear too unstable. Could you please upload the code that calls SciPy for these tests? Which version of SciPy have you used to get these plots?

vlad17 · 2022-06-14T19:59:00Z

@lobpcg I used 1.8.0 (did anything change in 1.8.1?). The examples are all the same as the unit tests in the PR but size 1000.

The actual colab to make the viz depends on some Google-internal features for unrelated things, but I can try to find some time to clean up the notebook for a public-facing version. Would it be more appropriate to post that as a scipy github issue? Just to keep this thread focussed on jax.

lobpcg · 2022-06-14T20:47:20Z

@lobpcg I used 1.8.0 (did anything change in 1.8.1?). The examples are all the same as the unit tests in the PR but size 1000.

The actual colab to make the viz depends on some Google-internal features for unrelated things, but I can try to find some time to clean up the notebook for a public-facing version. Would it be more appropriate to post that as a scipy github issue? Just to keep this thread focussed on jax.

1.8.0 is representative. I made changes in lobpcg specifically for float32, but that was before.

Of course if you create the reproducible issue in SciPy that would be ideal. No need to include the code to make the actual plots, just please add a reference to your post with the plots above and a ping to me.

If SciPy fails on smaller sizes like 100, all those would be good examples to include into SciPy as unit tests, if I find a fix so that they all run.

vlad17 · 2022-06-15T03:40:22Z

@lobpcg filled out scipy/scipy#16408 with just the cases, no viz.