Implement np.isclose #7067

guilhermeleobas · 2021-05-28T17:52:22Z

As title.

jeertmans · 2021-05-28T20:26:28Z

Hello,
A few months ago I also proposed an implementation of allclose and isclose but this was put in the PR Backlog milestone, see #6286. Maybe it would be worth merging my request with yours ? Or take the best one ?

gmarkall · 2021-06-01T12:47:14Z

Is this still work-in-progress? @guilhermeleobas Do you plan to add this to the list of supported functions in the docs?

guilhermeleobas · 2021-06-01T14:17:50Z

@gmarkall, I am waiting for a review from @stuartarchibald. The code for this function was copied from another PR (#4610)

stuartarchibald · 2021-06-01T17:27:18Z

CC @jpivarski, xref #6074

jpivarski · 2021-06-01T18:34:57Z

As promised in the meeting, here's my implementation for cross-checking:

@numba.njit
def _isclose_item(x, y, rtol, atol, equal_nan):
    if numpy.isnan(x) and numpy.isnan(y):
        return equal_nan
    elif numpy.isinf(x) and numpy.isinf(y):
        return (x > 0) == (y > 0)
    elif numpy.isinf(x) or numpy.isinf(y):
        return False
    else:
        return abs(x - y) <= atol + rtol * abs(y)

@numba.extending.overload(numpy.isclose)
def isclose(a, b, rtol=1e-05, atol=1e-08, equal_nan=False):
    if (isinstance(a, numba.types.Array) and a.ndim > 0) or (
        isinstance(b, numba.types.Array) and b.ndim > 0
    ):
        def isclose_impl(a, b, rtol=1e-05, atol=1e-08, equal_nan=False):
            # FIXME: want to broadcast_arrays(a, b) here
            x = a.reshape(-1)
            y = b.reshape(-1)
            out = numpy.zeros(len(y), numpy.bool_)
            for i in range(len(out)):
                out[i] = _isclose_item(x[i], y[i], rtol, atol, equal_nan)
            return out.reshape(b.shape)

    elif isinstance(a, numba.types.Array) or isinstance(b, numba.types.Array):
        def isclose_impl(a, b, rtol=1e-05, atol=1e-08, equal_nan=False):
            return numpy.asarray(
                _isclose_item(a.item(), b.item(), rtol, atol, equal_nan)
            )

    else:
        def isclose_impl(a, b, rtol=1e-05, atol=1e-08, equal_nan=False):
            return _isclose_item(a, b, rtol, atol, equal_nan)

    return isclose_impl

The main difference is that @guilhermeleobas's implementation does many passes over the data (following NumPy's implementation) and mine does one pass, and is specialized for non-arrays if given non-arrays. It has a weakness, though: for correctness, my a and b would have to be broadcasted with

a, b = numpy.broadcast_arrays(a, b)

before the out.reshape(b.shape) would be fully general. As it is, it managed to pass the tests because all of the tests have the larger shape as the second argument.

The tests can be made more general by adding the following two items:

yield [atol, np.inf, -np.inf, np.nan], [0], kw
yield [atol, np.inf, -np.inf, np.nan], 0, kw

which are just reversing the argument order of the last two tests. Then my implementation fails (for lack of lowered broadcast_arrays, #4074 (comment)) and I believe @guilhermeleobas's implementation would pass.

Meta-question: are the "single pass over arrays" or "specialized for non-array types" performance considerations significant?

guilhermeleobas · 2021-06-02T18:01:48Z

@jpivarski, your implementation compiles WAY faster than the one I did. For reference, I am using this script to benchmark both implementations.

Compare both implementations with:

IMPL=jim pytest -rs -sv --disable-warnings --tb=short -x -v --durations=0 a.py
IMPL=guilherme pytest -rs -sv --disable-warnings --tb=short -x -v --durations=0 a.py

jpivarski · 2021-06-02T18:18:27Z

The compilation speed probably depends strongly on types—if the arguments to my isclose function are scalars, then it resolves to a couple of if-statements. If the arguments are arrays, my implementation is wrong because it lacks broadcasting. (I think the array functions you use don't broadcast, but they at least fail with the right error message when the shapes don't match.)

A "best of both worlds" might be to check for scalar arguments and drop to _isclose_item for that case, which is (I'm guessing) what is compiling so quickly. It should also run faster because it won't have to create array structs to evaluate scalars—I doubt LLVM is smart enough to compile that away, but again, I'm just guessing.

stuartarchibald · 2021-12-14T12:37:06Z

/AzurePipelines run

azure-pipelines · 2021-12-14T12:37:16Z

Azure Pipelines successfully started running 1 pipeline(s).

guilhermeleobas · 2021-12-14T14:59:13Z

Although the current implementation seems correct (similar to NumPy ones), it takes a while to compile. I'll try to rewrite it based on @jpivarski implementation

guilhermeleobas · 2022-04-12T15:34:59Z

@jpivarski, can you review the code? Once #7437 gets merged, one can remove the broadcast_shapes part.

jpivarski · 2022-04-12T15:38:52Z

@jpivarski, can you review the code?

Sure, I'll review. As part of that, I'm checking out the code and I'm going to try running it (in Vector, which motivated my interest in it). It will take tens of minutes to set up a new environment for it.

jpivarski

In my test environment, I manually changed _min_llvm_version to (0, 38, 0) because I couldn't figure out how to install 0.39.0 (it's not released).

I tried it out manually, and all of the arguments (rtol, atol, equal_nan) work and give me the values I'd expect. I may be manually reproducing your test suite, but okay.

And now the motivating case: can I remove our custom implementation of isclose in Vector?

https://github.com/scikit-hep/vector/blob/e493e6f77d589571f85dcdf4dc4b61b8ca16f991/src/vector/_backends/numba_object.py#L95-L125

With this custom implementation removed and Numba 0.55.1, the last line fails:

>>> import numpy as np
>>> import numba as nb
>>> import vector
>>> one = vector.obj(x=1.1, y=2.2)
>>> two = vector.obj(x=1.1+1e-12, y=2.2+1e-12)
>>> (lambda x, y: x.isclose(y))(one, two)
True
>>> nb.njit()(lambda x, y: x.isclose(y))(one, two)

because the vector isclose relies on the existence of np.isclose to be defined for numeric arguments. In the environment with this branch installed, the last line succeeds (returns True) because Vector is picking up on your new, lowered np.isclose.

So it works!

And the implementation looks good to me.

guilhermeleobas · 2022-04-13T17:55:50Z

Thanks for the review, @jpivarski.

@stuartarchibald or @gmarkall, can one of you folks review this PR when possible?

…hange np.isclose impl. to use np.broadcast_shapes

…le and change np.isclose impl. to use np.broadcast_shapes" This reverts commit 4c42ec7.

guilhermeleobas · 2022-05-19T14:41:59Z

One cannot use np.broadcast_shapes inside np.isclose because the former is available only on NumPy >= (1, 20)

njriasan

Thanks @guilhermeleobas. I left some comments, but this looks pretty good.

numba/core/typing/arraydecl.py

njriasan · 2022-06-25T23:34:15Z

numba/np/arraymath.py

+            return np.broadcast_to(out, tup)
+
+    else:
+        def isclose_impl(a, b, rtol=1e-05, atol=1e-08, equal_nan=False):


It seems like this path can be reached for types that shouldn't be supported. In particular since type_can_asarray supports types.Number you could take this path with complex numbers.

It looks like np.isclose supports complex numbers

>>> import numpy as np >>> print(np.isclose(2, 3j)) False

numba/np/arraymath.py

njriasan · 2022-06-28T02:33:48Z

numba/np/arraymath.py

+            x = a
+            y = b.reshape(-1)
+            out = np.zeros(len(y), np.bool_)
+            for i in range(len(out)):


Should this be supported with parfors as well? It seems like this should be supported with parallel=True.

So, you're saying replace range by prange when parallel=True?

guilhermeleobas requested review from esc, sklam and stuartarchibald as code owners May 28, 2021 17:52

guilhermeleobas mentioned this pull request May 28, 2021

Implement np.is* functions #4610

Merged

9 tasks

stuartarchibald added the 2 - In Progress label May 28, 2021

gmarkall added 3 - Ready for Review Effort - medium Medium size effort needed and removed 2 - In Progress labels Jun 1, 2021

guilhermeleobas force-pushed the np_isclose branch from 823f7b2 to 9ec6cd5 Compare June 14, 2021 23:58

stuartarchibald added 2 - In Progress and removed 3 - Ready for Review labels Jul 9, 2021

guilhermeleobas force-pushed the np_isclose branch 2 times, most recently from 9c94e21 to 0d795e2 Compare August 24, 2021 01:40

jpivarski previously approved these changes Apr 12, 2022

View reviewed changes

jpivarski mentioned this pull request Apr 12, 2022

Take Numba's np.isclose lowering instead of defining our own. scikit-hep/vector#185

Closed

stuartarchibald added 3 - Ready for Review and removed 2 - In Progress labels Apr 13, 2022

guilhermeleobas added 6 commits May 18, 2022 22:20

add np.isclose

8a95eac

add np.isclose to the list of supported functions

2780351

include more tests

acb31f2

remove print stmt

178698e

update np.isclose to use broadcast_to

23b660a

update comment

bd8c33a

guilhermeleobas dismissed jpivarski’s stale review via bd8c33a May 19, 2022 03:11

guilhermeleobas force-pushed the np_isclose branch from d6923fa to bd8c33a Compare May 19, 2022 03:11

guilhermeleobas added 2 commits May 19, 2022 00:51

Change the return type of np.broadcast_shapes to return a tuple and c…

4c42ec7

…hange np.isclose impl. to use np.broadcast_shapes

Revert "Change the return type of np.broadcast_shapes to return a tup…

b77ba5a

…le and change np.isclose impl. to use np.broadcast_shapes" This reverts commit 4c42ec7.

guilhermeleobas mentioned this pull request May 19, 2022

change return type of np.broadcast_shapes to a tuple #8077

Merged

Trigger CI

9a70948

guilhermeleobas mentioned this pull request Jun 22, 2022

Add support for math.isclose() and numpy.isclose() #6074

Closed

njriasan reviewed Jun 28, 2022

View reviewed changes

gmarkall added the needs initial review This PR needs an initial review to check the code change is well formed, documented, efficient etc. label Jun 28, 2022

guilhermeleobas added 2 commits June 29, 2022 21:40

address reviewer comments

265a9ae

Merge remote-tracking branch 'upstream/main' into np_isclose

bfe1e90

stuartarchibald removed the needs initial review This PR needs an initial review to check the code change is well formed, documented, efficient etc. label Jul 5, 2022

Merge remote-tracking branch 'upstream/main' into np_isclose

68062c4

apmasell self-assigned this Oct 19, 2022

import numpy_broadcast_shapes_list

0ebb0c7

apmasell approved these changes Oct 19, 2022

View reviewed changes

apmasell added 5 - Ready to merge Review and testing done, is ready to merge and removed 3 - Ready for Review labels Oct 19, 2022

sklam added this to the Numba 0.57 RC milestone Oct 19, 2022

sklam approved these changes Oct 19, 2022

View reviewed changes

sklam merged commit 9b13ac7 into numba:main Oct 19, 2022

guilhermeleobas deleted the np_isclose branch October 19, 2022 23:36

stuartarchibald mentioned this pull request Mar 29, 2023

Fix broken np.allclose, add np.isclose #8857

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement np.isclose #7067

Implement np.isclose #7067

guilhermeleobas commented May 28, 2021

jeertmans commented May 28, 2021

gmarkall commented Jun 1, 2021

guilhermeleobas commented Jun 1, 2021

stuartarchibald commented Jun 1, 2021

jpivarski commented Jun 1, 2021

guilhermeleobas commented Jun 2, 2021

jpivarski commented Jun 2, 2021

stuartarchibald commented Dec 14, 2021

azure-pipelines bot commented Dec 14, 2021

guilhermeleobas commented Dec 14, 2021

guilhermeleobas commented Apr 12, 2022

jpivarski commented Apr 12, 2022

jpivarski left a comment

guilhermeleobas commented Apr 13, 2022 •

edited

guilhermeleobas commented May 19, 2022

njriasan left a comment

njriasan Jun 25, 2022

guilhermeleobas Jun 30, 2022

njriasan Jun 28, 2022

guilhermeleobas Jun 30, 2022

Implement np.isclose #7067

Implement np.isclose #7067

Conversation

guilhermeleobas commented May 28, 2021

jeertmans commented May 28, 2021

gmarkall commented Jun 1, 2021

guilhermeleobas commented Jun 1, 2021

stuartarchibald commented Jun 1, 2021

jpivarski commented Jun 1, 2021

guilhermeleobas commented Jun 2, 2021

jpivarski commented Jun 2, 2021

stuartarchibald commented Dec 14, 2021

azure-pipelines bot commented Dec 14, 2021

guilhermeleobas commented Dec 14, 2021

guilhermeleobas commented Apr 12, 2022

jpivarski commented Apr 12, 2022

jpivarski left a comment

Choose a reason for hiding this comment

guilhermeleobas commented Apr 13, 2022 • edited

guilhermeleobas commented May 19, 2022

njriasan left a comment

Choose a reason for hiding this comment

njriasan Jun 25, 2022

Choose a reason for hiding this comment

guilhermeleobas Jun 30, 2022

Choose a reason for hiding this comment

njriasan Jun 28, 2022

Choose a reason for hiding this comment

guilhermeleobas Jun 30, 2022

Choose a reason for hiding this comment

guilhermeleobas commented Apr 13, 2022 •

edited