BLD: Do not use clang default to ignore floating point exceptions #18005

tomgoddard · 2020-12-16T02:26:05Z

Remarkably numpy.log() gives a runtime overflow warning on a float32 array containing values larger than 1e9. Numpy 1.19.4, macOS 10.15.7, MacBookPro15,3. Warning does not appear for float64 array even for much larger values (e.g. 1e35).

Reproducing code example:

import numpy as np
a = np.array([1e9], np.float32)
np.log(a)

Error message:

RuntimeWarning: overflow encountered in log

NumPy/Python version information:

1.19.4 3.8.6 (default, Nov 20 2020, 18:29:40)
[Clang 12.0.0 (clang-1200.0.32.27)]

seberg · 2020-12-16T03:07:30Z

@tomgoddard, I cannot reproduce this on linux. Since you are on 1.19.x, could you print:

np.core._multiarray_umath.__cpu_features__

Just so we have the information?

The most likely cause will be that you have AVX512 instructions available and our AVX512 code has an error here. If it is in the AV512 path, @r-devulap could you have a look?

Of course it could also be a clang bug in the glibc version...

tomgoddard · 2020-12-16T03:14:21Z

np.core._multiarray_umath.__cpu_features__ {'MMX': True, 'SSE': True, 'SSE2': True, 'SSE3': True, 'SSSE3': True, 'SSE41': True, 'POPCNT': True, 'SSE42': True, 'AVX': True, 'F16C': True, 'XOP': False, 'FMA4': False, 'FMA3': True, 'AVX2': True, 'AVX512F': False, 'AVX512CD': False, 'AVX512ER': False, 'AVX512PF': False, 'AVX5124FMAPS': False, 'AVX5124VNNIW': False, 'AVX512VPOPCNTDQ': False, 'AVX512VL': False, 'AVX512BW': False, 'AVX512DQ': False, 'AVX512VNNI': False, 'AVX512IFMA': False, 'AVX512VBMI': False, 'AVX512VBMI2': False, 'AVX512BITALG': False, 'AVX512_KNL': False, 'AVX512_KNM': False, 'AVX512_SKX': False, 'AVX512_CLX': False, 'AVX512_CNL': False, 'AVX512_ICL': False, 'VSX': False, 'VSX2': False, 'VSX3': False, 'NEON': False, 'NEON_FP16': False, 'NEON_VFPV4': False, 'ASIMD': False, 'FPHP': False, 'ASIMDHP': False, 'ASIMDDP': False, 'ASIMDFHM': False}

tomgoddard · 2020-12-16T03:17:54Z

Numpy 1.19.4 was installed using brew on macOS 10.15.7.

seiko2plus · 2020-12-16T05:29:50Z

could you run np.log(np.array([1e9]*8, np.float32)) and check if the overflow error still remains? newest versions of clang committing aggressive optimization to partial load operations lead to undefined the vector tail on AVX2.

r-devulap · 2020-12-16T05:31:30Z

I don't see this problem when using NumPy built with gcc 9.3 or clang 10.0. Makes me think it has to something to do with clang 12. Still figuring out how to install clang 12 on my Ubuntu to reproduce the error :/

tomgoddard · 2020-12-16T05:34:50Z

>> a = np.array([1e9]*8, np.float32) >> np.log(a)

<stdin>:1: RuntimeWarning: overflow encountered in log array([20.723267, 20.723267, 20.723267, 20.723267, 20.723267, 20.723267, 20.723267, 20.723267], dtype=float32)

…

could you run a = np.array([1e9]*8, np.float32) and check if the overflow error still remains? newest versions of clang committing aggressive optimization to partial load operations lead to undefined the vector tail on AVX2.

tomgoddard · 2020-12-16T05:39:24Z

No overflow warning for smaller values such as 1e8. Python 3.8.6 (default, Nov 20 2020, 18:29:40) [Clang 12.0.0 (clang-1200.0.32.27)] on darwin

>> import numpy as np >> a = np.array([1e8], np.float32) >> np.log(a)

array([18.420681], dtype=float32) No overflow warning with float64

>> import numpy as np >> a = np.array([1e9, 1e20, 1e30], np.float64) >> np.log(a)

array([20.72326584, 46.05170186, 69.07755279])

r-devulap · 2020-12-16T22:03:35Z

confirmed that this behavior is only with clang (clang 10 and clang 12), and does not happen with gcc. It decides to set an overflow flag for a blend instruction (which is weird and incorrect) x = _mm256_blendv_ps(x, temp, denormal_mask); here:

numpy/numpy/core/src/umath/simd.inc.src

Line 1554 in d7a75e8

x = _mm256_blendv_ps(x, temp, denormal_mask);

seberg · 2020-12-17T18:48:23Z

@r-devulap thanks for digging this down, I am assuming it is a clang bug. Can we file it there? I am not sure there is a good way for us to work around it in NumPy to begin with...

r-devulap · 2020-12-18T21:24:39Z

Trying to narrow down to a simple C program to replicate the bug, unsuccessful so far ..

r-devulap · 2020-12-19T00:26:56Z

Found the culprit! clang optimizes out a blend instruction which was precisely meant to avoid this reported issue of an overflow warning. While the compiler was being clever, it is still a bug.

PR #18030 has a simple work around for this issue. Let me know if that works.

EDIT: didn't mean to include the code snippet.

* BUG: make a variable volatile to work around clang compiler bug * Adding comments for relevance * TST: Adding test to check for no overflow warnings in log Fixes #18005

…y#18030) * BUG: make a variable volatile to work around clang compiler bug * Adding comments for relevance * TST: Adding test to check for no overflow warnings in log Fixes numpy#18005

matthew-brett · 2020-12-19T18:39:33Z

Just checking - is this bug in clang reported somewhere?

…y#18030) * BUG: make a variable volatile to work around clang compiler bug * Adding comments for relevance * TST: Adding test to check for no overflow warnings in log Fixes numpy#18005

r-devulap · 2020-12-20T01:35:10Z

@matthew-brett not sure, I couldn't find anything relevant but I didn't spend too much time searching. I am planning on submitting a bug report.

matthew-brett · 2020-12-20T09:01:45Z

@r-devulap - I'm sure that would be worthwhile, especially as the bug seems to have survived through at least two releases - right?

r-devulap · 2020-12-20T22:38:27Z

You can play with different version of compilers here https://godbolt.org/z/Yajjcs and looks like it got introduced somewhere between 7.0 (which produces the right code) and 8.0 and is still there in 12.0.

h-vetinari · 2020-12-21T11:59:17Z

@r-devulap
Thanks for your efforts on this! Could you please reference the clang ticket here once you open it (or let people know if you won't be pursuing it anymore)?

r-devulap · 2020-12-21T17:52:54Z

So, it turns out this is expected behavior in clang. Clang has a flag to force strict floating point exception compliance by setting -ffp-exception-behavior=strict. The default, however, is set to ignore which means:

The compiler assumes that the exception status flags will not be read and that floating point exceptions will be masked

No wonder I have had trouble with clang before too. The docs has a section on it which you can find by searching for ffp-exception-behavior here: https://clang.llvm.org/docs/UsersManual.html

mattip · 2020-12-21T17:57:18Z

Should we add that flag if we are on clang?

seberg · 2020-12-21T18:02:41Z

Sounds like we definitely have to. I wonder if we worked around other clang optimizations that could have been avoided with that flag. Probably just me, but I don't like compilers defaulting to fast-math or this kind of thing...

r-devulap · 2020-12-21T18:06:36Z

I would think so. Clang would end up disabling some floating point optimizations which will have some negative performance impact. Might be minimal though and probably necessary going forward to ensure FP exception compliance.

r-devulap · 2020-12-21T18:08:23Z

Probably just me, but I don't like compilers defaulting to fast-math or this kind of thing...

Fortunately clang doesn't default to fast-math. But yes, I don't like this either.

seberg · 2020-12-21T18:16:26Z

Reopened and changed title. We should probably have a quick look around for clang workaround, but I don't think we need to backport getting rid of those.

r-devulap · 2020-12-21T18:40:47Z

This was the other time clang give me an incorrect FP flag: #13586 (comment) and the associated PR #13623

seberg · 2021-01-25T19:37:11Z

Hmmm, I thought I would look into this. But I am honestly not sure where we should be adding this flag, or if we actually should (additionally) even prod Python itself to add this flag. Am I right to think that most clang specific flags currently are inherited from Python itself?

charris · 2021-01-29T20:07:16Z

What is the status of this?

seberg · 2021-01-29T20:14:15Z

I think this is important, but I am not very familiar with distutils, and am not sure where this flag should be inserted correctly. (It probably is also correct for SciPy, but do we want to set it by default for scipy as well?).
I am happy to check for the related code cleanups, but not too fond of digging through distutils to find the right place.

It shouldn't matter too much to push it off to 1.21 all known issues due to this are fixed after all. While the flag change is something we could do, the (probably) related code cleanup we do not want to backport anyway probably.

seberg · 2021-03-18T16:20:34Z

@rgommers I somewhat assume you just know where to put this. Can you point where we would add a clang specific compiler flag to distutils (maybe NumPy specific, although this flag is really also necessary for scipy, I bet).

rgommers · 2021-03-18T21:29:12Z

In CCompiler_customize: https://github.com/numpy/numpy/blob/main/numpy/distutils/ccompiler.py#L366

maybe NumPy specific, although this flag is really also necessary for scipy, I bet

It's hard to make it NumPy-specific, and I think other libraries should get this flag too, so I'd say just add it unconditionally.

seiko2plus · 2021-03-18T23:11:59Z

@seberb,

Can you point where we would add a clang specific compiler flag to distutils?

for a quickfix, we can link the flag with one of baseline features flags, etc SSE on x86.`

numpy/numpy/distutils/ccompiler_opt.py

Lines 309 to 314 in ddbff08

    
           on_x86 = self.cc_on_x86 or self.cc_on_x64 
        
           is_unix = self.cc_is_gcc or self.cc_is_clang 
        
           if on_x86 and is_unix: return dict( 
        
               SSE    = dict(flags="-msse"), 
        
               SSE2   = dict(flags="-msse2"),

seberg added 00 - Bug component: numpy.ufunc labels Dec 16, 2020

seberg added the 26 - Compiler label Dec 17, 2020

r-devulap mentioned this issue Dec 19, 2020

BUG: make a variable volatile to work around clang compiler bug #18030

Merged

charris added this to the 1.19.5 release milestone Dec 19, 2020

seberg closed this as completed in #18030 Dec 19, 2020

charris mentioned this issue Dec 19, 2020

BUG: make a variable volatile to work around clang compiler bug #18035

Merged

charris mentioned this issue Dec 19, 2020

BUG: make a variable volatile to work around clang compiler bug #18036

Merged

seberg reopened this Dec 21, 2020

seberg changed the title ~~log(1e9) gives RuntimeWarning: overflow encountered in log~~ BLD: Do not use clang default to ignore floating point exceptions Dec 21, 2020

charris mentioned this issue Dec 23, 2020

nan in matrix matrix multiplication on linux/arm64 guest on macos/arm64 host #18061

Closed

seberg modified the milestones: 1.19.5 release, 1.20.0 release Jan 5, 2021

charris modified the milestones: 1.20.0 release, 1.21.0 release Jan 29, 2021

seberg mentioned this issue Mar 5, 2021

RuntimeWarning: divide by zero encountered in reciprocal for np.ones(3) ** -1 #18555

Closed

seberg mentioned this issue Mar 13, 2021

np.mod ~3x slower in numpy 1.20 #18607

Closed

mattip added the sprintable Issue fits the time-frame and setting of a sprint label May 6, 2021

seberg mentioned this issue May 10, 2021

"invalid value encountered in left_shift" for broadcasted int16 array #18986

Open

seberg mentioned this issue May 19, 2021

BLD,API: (distutils) Force strict floating point error model on clang #19049

Merged

charris closed this as completed in #19049 May 20, 2021

charris mentioned this issue Jun 13, 2021

BLD,API: (distutils) Force strict floating point error model on clang #19229

Merged

seberg mentioned this issue Jul 15, 2021

BLD: Add clang -ftrapping-math also for compiler_so #19479

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BLD: Do not use clang default to ignore floating point exceptions #18005

BLD: Do not use clang default to ignore floating point exceptions #18005

tomgoddard commented Dec 16, 2020

seberg commented Dec 16, 2020

tomgoddard commented Dec 16, 2020 via email •

edited by seberg

tomgoddard commented Dec 16, 2020

seiko2plus commented Dec 16, 2020 •

edited

r-devulap commented Dec 16, 2020

tomgoddard commented Dec 16, 2020 via email

tomgoddard commented Dec 16, 2020 via email

r-devulap commented Dec 16, 2020

seberg commented Dec 17, 2020

r-devulap commented Dec 18, 2020

r-devulap commented Dec 19, 2020 •

edited

matthew-brett commented Dec 19, 2020

r-devulap commented Dec 20, 2020

matthew-brett commented Dec 20, 2020

r-devulap commented Dec 20, 2020

h-vetinari commented Dec 21, 2020

r-devulap commented Dec 21, 2020 •

edited

mattip commented Dec 21, 2020

seberg commented Dec 21, 2020

r-devulap commented Dec 21, 2020

r-devulap commented Dec 21, 2020

seberg commented Dec 21, 2020

r-devulap commented Dec 21, 2020

seberg commented Jan 25, 2021

charris commented Jan 29, 2021

seberg commented Jan 29, 2021

seberg commented Mar 18, 2021

rgommers commented Mar 18, 2021

seiko2plus commented Mar 18, 2021

BLD: Do not use clang default to ignore floating point exceptions #18005

BLD: Do not use clang default to ignore floating point exceptions #18005

Comments

tomgoddard commented Dec 16, 2020

Reproducing code example:

Error message:

NumPy/Python version information:

seberg commented Dec 16, 2020

tomgoddard commented Dec 16, 2020 via email • edited by seberg

tomgoddard commented Dec 16, 2020

seiko2plus commented Dec 16, 2020 • edited

r-devulap commented Dec 16, 2020

tomgoddard commented Dec 16, 2020 via email

tomgoddard commented Dec 16, 2020 via email

r-devulap commented Dec 16, 2020

seberg commented Dec 17, 2020

r-devulap commented Dec 18, 2020

r-devulap commented Dec 19, 2020 • edited

matthew-brett commented Dec 19, 2020

r-devulap commented Dec 20, 2020

matthew-brett commented Dec 20, 2020

r-devulap commented Dec 20, 2020

h-vetinari commented Dec 21, 2020

r-devulap commented Dec 21, 2020 • edited

mattip commented Dec 21, 2020

seberg commented Dec 21, 2020

r-devulap commented Dec 21, 2020

r-devulap commented Dec 21, 2020

seberg commented Dec 21, 2020

r-devulap commented Dec 21, 2020

seberg commented Jan 25, 2021

charris commented Jan 29, 2021

seberg commented Jan 29, 2021

seberg commented Mar 18, 2021

rgommers commented Mar 18, 2021

seiko2plus commented Mar 18, 2021

tomgoddard commented Dec 16, 2020 via email •

edited by seberg

seiko2plus commented Dec 16, 2020 •

edited

r-devulap commented Dec 19, 2020 •

edited

r-devulap commented Dec 21, 2020 •

edited