`TabulatedPhase`: better eval method #226

nollety · 2022-05-18T11:50:49Z

Description

Resolves eradiate/eradiate-issues#165

I improved how TabulatedPhase.eval maps the input phase function data to a regular grid of scattering angle cosine (mu) when the input data does not already map to a regular mu grid.

Changes

I removed the hidden _n_mu class attribute and updated eval_mono in the following way:

if the input phase function data maps to a mu regular grid, interpolate in wavelength and return the values
else, compute regular mu grid based on the smallest mu step found in the input phase function data, and interpolate in wavelength (using xarray.DataArray.interp) and on that regular mu grid (using numpy.interp for performance) and return the values. For further details, refer to a discussion about the choice of this regular mu grid in the comments down below.

I added 2 unit tests for this new implementation (hence the data submodule update).

If the user wants to control the number of points used along the mu axis to discretize a tabulated phase function, they would simply have to provide phase function data mapping directly to a regular mu grid, since in that case the input DataArray is left unchanged.

I added a converter for the data attribute, that ensures the mu coordinate is monotonically increasing.

I added a validator for the data attribute to check that the mu coordinate in the input data set goes from -1.0 to 1.0.

I added TabulatedPhaseFunction to the API Reference.

Checklist

The code follows the relevant coding guidelines
The code generates no new warnings
The code is appropriately documented
The code is tested to prove its function
The feature branch is rebased on the current state of the main branch
I updated the change log if relevant
I give permission that the Eradiate project may redistribute my contributions under the terms of its license

nollety · 2022-05-19T08:49:56Z

I found no need to update the regression tests with these changes ; they still pass.

nollety · 2022-05-19T11:33:17Z

src/eradiate/scenes/phase/_tabulated.py

+            # compute the smallest regular grid that does not loose information
+            # in the sense of Shannon theorem
+            dmu = np.abs(self.data.mu.diff(dim="mu")).values.min()
+            nmu = 2 * int(np.ceil(2.0 / dmu))


After discussion with @schunkes and @lucio-f, we could consider removing the outside factor 2 (from Shannon's theorem) which would result in twice lower memory footprint and lower interpolation times

Any thoughts about that? @leroyvn

I'm not a signal processing expert so I don't have strong arguments to oppose: if you guys have proper arguments in favour of tuning down the number of grid points, go for it. Just keep that in mind when analysing results ;)

Well Lucio made the point that the data is already sampled at a maximum frequency of dmu, so we don't really gain anything from resampling it at 2*dmu.

For the record, my reasoning is that the smallest mu step is likely to be found at the extreme right of the [-1, 1] mu interval since this is where the forward scattering peak is located. Therefore, if we buid the regular mu grid using:

dmu_min = np.abs(self.data.mu.diff(dim="mu")).values.min() nmu = int(np.ceil(2.0 / dmu_min)) + 1 mu = np.linspace(-1, 1, nmu)

the two far-right mu points in [-1, 1] will exactly match the mu points in the input data. From these points on, the data to the left will be interpolated, therefore we incur a loss accuracy. Arguably, the loss of accuracy is minimised where it is important to be accuracy, namely close to mu=1.0.

@schunkes I am not convinced that:

dmu is the minimal mu step

implies that:

1 / dmu is the maximal frequency in the phase function signal

If you take the Fourrier transform of a realistic particle phase function in the mu-space, you will find an infinite number of harmonics in my opinion

After more discussion with the whiteboard, we found that assuming the following:

the input data is propertly discretised, namely the smallest mu-step is found where the phase function has the sharpest variations

the sharpest feature occurs either at mu=-1 or at mu=1.0, or if it is located somewhere in the middle of the interval, the input data is well enough sampled in the mu space
the smallest delta mu regularisation method is the best.

As illustrated by the figure below, in the presumably most common case where the phase function has sharpest variations around mu=1.0, interpolating on the regular grid prescribed by the Shannon's theorem or on the regular grid using the smallest mu-step lead to the same result:

As a rule of thumb, the smallest the mu step in the input data, the lowest the interpolation errors.

We note that this regular grid tabulated phase function approach is not best suited for phase function with very sharp features, generally speaking, because of the overhead in memory and in interpolation time.

However, this approach is the only available to us at the moment...

nollety · 2022-05-19T11:45:28Z

I'll add a couple of tests

leroyvn

Hi @nollety, this is a nice PR. I made a couple of comments which I'd like to see addressed.

src/eradiate/scenes/phase/_tabulated.py

tests/02_eradiate/01_unit/scenes/phase/test_tabulated.py

nollety · 2022-05-19T16:48:22Z

Thanks for the feedback! @leroyvn

nollety · 2022-05-20T13:23:33Z

Alright, we are ready for another review round I think :)

leroyvn · 2022-05-21T19:28:42Z

Thanks @nollety, we're good to go: I'll merge this.

nollety added the enhancement 🦾 New feature or request label May 18, 2022

nollety commented May 19, 2022

View reviewed changes

nollety marked this pull request as ready for review May 19, 2022 13:14

nollety force-pushed the 165-tabphase branch 2 times, most recently from 5b3d62f to 4264958 Compare May 19, 2022 13:53

leroyvn requested changes May 19, 2022

View reviewed changes

nollety force-pushed the 165-tabphase branch from b8df618 to 9394d38 Compare May 20, 2022 13:19

nollety requested review from leroyvn and schunkes May 20, 2022 13:23

TabulatedPhaseFunction: better eval method

7e8db1f

nollety force-pushed the 165-tabphase branch from b49203e to 7e8db1f Compare May 20, 2022 13:27

leroyvn merged commit 4b759c5 into eradiate:main May 21, 2022

leroyvn mentioned this pull request May 30, 2022

Add irregularly gridded tabulated phase function support #229

Merged

7 tasks

nollety deleted the 165-tabphase branch October 7, 2022 10:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`TabulatedPhase`: better eval method #226

`TabulatedPhase`: better eval method #226

nollety commented May 18, 2022 •

edited

nollety commented May 19, 2022

nollety May 19, 2022 •

edited

nollety May 19, 2022

leroyvn May 19, 2022

schunkes May 20, 2022

nollety May 20, 2022

nollety May 20, 2022 •

edited

nollety May 20, 2022

nollety May 20, 2022

nollety May 20, 2022

nollety commented May 19, 2022

leroyvn left a comment

nollety commented May 19, 2022

nollety commented May 20, 2022

leroyvn commented May 21, 2022

TabulatedPhase: better eval method #226

TabulatedPhase: better eval method #226

Conversation

nollety commented May 18, 2022 • edited

Description

Changes

Checklist

nollety commented May 19, 2022

nollety May 19, 2022 • edited

Choose a reason for hiding this comment

nollety May 19, 2022

Choose a reason for hiding this comment

leroyvn May 19, 2022

Choose a reason for hiding this comment

schunkes May 20, 2022

Choose a reason for hiding this comment

nollety May 20, 2022

Choose a reason for hiding this comment

nollety May 20, 2022 • edited

Choose a reason for hiding this comment

nollety May 20, 2022

Choose a reason for hiding this comment

nollety May 20, 2022

Choose a reason for hiding this comment

nollety May 20, 2022

Choose a reason for hiding this comment

nollety commented May 19, 2022

leroyvn left a comment

Choose a reason for hiding this comment

nollety commented May 19, 2022

nollety commented May 20, 2022

leroyvn commented May 21, 2022

`TabulatedPhase`: better eval method #226

`TabulatedPhase`: better eval method #226

nollety commented May 18, 2022 •

edited

nollety May 19, 2022 •

edited

nollety May 20, 2022 •

edited