ENH deal with tiny loss improvements in line search #724

lorentzenchr · 2023-11-02T07:31:43Z

This PR adds a 2. convergence check based on gradients in the line search. This can deal with tiny loss improvements.

This is taken from https://github.com/scikit-learn/scikit-learn/blob/9621539a9defe86ff55c890d5f2475f42697604f/sklearn/linear_model/_glm/_newton_solver.py#L263.

This might help with some of the tests in #723.

Checklist

Added a CHANGELOG.rst entry

MarcAntoineSchmidtQC

I'm good with this change. Does not have any negative performance impact on our current benchmarks, and the logic seems to make sense. The new conditional branch is rarely used in our current golden master tests, but used quite often in the test_glm.py tests that @lorentzenchr provided.

Two small requests before merging:

Is it possible to provide a source for the logic behind the tiny loss branch?
Can you add an entry to the changelog in the "Unreleased" section.

lorentzenchr · 2023-11-07T20:31:25Z

Is it possible to provide a source for the logic behind the tiny loss branch?

I developed this in scikit-learn/scikit-learn#24637, trying to satisfy my own (very strict) tests. I can't provide any literature on this.
The basic insight is as follows:

In general, the first order condition alias score equation is a better condition for checking convergence than loss improvements or step sizes (change in coefficients) because we know the absolute scale: it should be zero.
Empirically, if we are already close to the minimum and seeking for a high precision solution, the floating point precision of the loss might not be enough while the gradient has not yet achieved, say 1e-12 or lower. Therefore, the gradient has more information in such a setting.
The line search may then reject a (Newton) step that brings the gradient closer to zero, but that provides no improvement of the loss (but no worsening either up to machine precision).

MarcAntoineSchmidtQC · 2023-11-08T19:25:34Z

Thanks @lorentzenchr!

ENH deal with tiny loss improvements in line search

d554c7d

lorentzenchr requested review from MarcAntoineSchmidtQC, xhochy, jtilly and lbittarello as code owners November 2, 2023 07:31

MarcAntoineSchmidtQC approved these changes Nov 6, 2023

View reviewed changes

DOC add changelog entry

3595e7a

MarcAntoineSchmidtQC merged commit a304d55 into Quantco:main Nov 8, 2023
13 checks passed

lorentzenchr deleted the linear_search branch November 10, 2023 10:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH deal with tiny loss improvements in line search #724

ENH deal with tiny loss improvements in line search #724

lorentzenchr commented Nov 2, 2023 •

edited

Loading

MarcAntoineSchmidtQC left a comment

lorentzenchr commented Nov 7, 2023

MarcAntoineSchmidtQC commented Nov 8, 2023

ENH deal with tiny loss improvements in line search #724

ENH deal with tiny loss improvements in line search #724

Conversation

lorentzenchr commented Nov 2, 2023 • edited Loading

MarcAntoineSchmidtQC left a comment

Choose a reason for hiding this comment

lorentzenchr commented Nov 7, 2023

MarcAntoineSchmidtQC commented Nov 8, 2023

lorentzenchr commented Nov 2, 2023 •

edited

Loading