ENH: stats.gaussian_kde: replace use of inv_cov in logpdf #16987

mdhaber · 2022-09-08T21:42:45Z

Reference issue

What does this implement/fix?

gh-16692 used Cholesky decomposition to avoid the inversion of the covariance matrix in gaussian_kde.pdf.
We wanted to wait for gh-15493 to finish before implementing that change in gaussian_kde.logpdf.
Now that gh-15493 is done, this makes the change to gaussian_kde.logpdf.

It also simplifies what we did in gh-16692. In that, we found a way to use Cholesky decomposition to compute $L^T x$, where $L L^T = \Sigma^{-1}$ (that is, $L$ is a Cholesky factor of the precision matrix / inverse covariance matrix).
Ultimately, we don't need $L^T x$; it was just one way to get to $x^T \Sigma^{-1} x$. There is a simpler way to get that while still avoiding the matrix inversion: $x^T \Sigma^{-1} x = y^T y$, where $C y = x$ and $CC^T = \Sigma$ (that is, $C$ is a Cholesky factor of the original covariance matrix rather than the precision matrix). The short calculation is shown here.

@steppi I think someone else can review this since it's simpler than gh-16692, but I thought you might find it interesting that those permutations weren't necessary to get the end result.

mdhaber · 2022-09-08T22:51:35Z

Failures appear to be unrelated.

steppi · 2022-09-16T17:30:44Z

@steppi I think someone else can review this since it's simpler than gh-16692, but I thought you might find it interesting that those permutations weren't necessary to get the end result.

Oh right, that makes perfect sense. If you start with the cholesky decomposition of the covariance matrix then you end up with something that isn’t quite a cholesky decomposition of the precision matrix because the left factor is upper triangular instead of lower. This factorization is still workable though because you can just call solve_triangular with lower=True in the end.

steppi

Looks good to me.

mdhaber · 2022-09-16T18:25:11Z

Thanks @steppi.

@jjerphan does this look good to you from a Cython perspective? To avoid too many changes at once, I didn't release the GIL. That can be a separate PR, if desired.

jjerphan · 2022-09-19T08:12:01Z

@jjerphan does this look good to you from a Cython perspective? To avoid too many changes at once, I didn't release the GIL. That can be a separate PR, if desired.

This LGTM from a Cython perspective -- I haven't looked at the maths, yet. Doing another PR for improving this implementation w.r.t Cython technicalities is sensible to me. 👍

Let me know if you need another review for this PR generally.

mdhaber · 2022-09-19T13:48:51Z

@steppi would you like @jjerphan to review the math, too, or are you comfortable merging?

steppi · 2022-09-19T19:37:24Z

@steppi would you like @jjerphan to review the math, too, or are you comfortable merging?

The math looks good to me. I’m comfortable with merging.

mdhaber added 2 commits September 8, 2022 13:57

MAINT: stats.kde: simplify calculation of PDF

b66e05f

MAINT: stats.kde: use cholesky decomposition in logpdf, too

69e0727

mdhaber added scipy.stats enhancement A new feature or improvement labels Sep 8, 2022

mdhaber mentioned this pull request Sep 9, 2022

ENH: stats.multivariate: introduce Covariance class and subclasses mdhaber/scipy#88

Open

steppi approved these changes Sep 16, 2022

View reviewed changes

steppi merged commit b5a8052 into scipy:main Sep 20, 2022

mdhaber added this to the 1.10.0 milestone Nov 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: stats.gaussian_kde: replace use of inv_cov in logpdf #16987

ENH: stats.gaussian_kde: replace use of inv_cov in logpdf #16987

mdhaber commented Sep 8, 2022

mdhaber commented Sep 8, 2022

steppi commented Sep 16, 2022 •

edited

steppi left a comment

mdhaber commented Sep 16, 2022 •

edited

jjerphan commented Sep 19, 2022

mdhaber commented Sep 19, 2022

steppi commented Sep 19, 2022

ENH: stats.gaussian_kde: replace use of inv_cov in logpdf #16987

ENH: stats.gaussian_kde: replace use of inv_cov in logpdf #16987

Conversation

mdhaber commented Sep 8, 2022

Reference issue

What does this implement/fix?

mdhaber commented Sep 8, 2022

steppi commented Sep 16, 2022 • edited

steppi left a comment

Choose a reason for hiding this comment

mdhaber commented Sep 16, 2022 • edited

jjerphan commented Sep 19, 2022

mdhaber commented Sep 19, 2022

steppi commented Sep 19, 2022

steppi commented Sep 16, 2022 •

edited

mdhaber commented Sep 16, 2022 •

edited