Add overflow-checking variants of arithmetic scalar dyn kernels #2713

viirya · 2022-09-12T19:44:32Z

Which issue does this PR close?

Closes #2712.

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

viirya · 2022-09-12T22:09:48Z

cc @sunchao

sunchao · 2022-09-13T16:17:22Z

arrow/src/compute/kernels/arithmetic.rs

+///
+/// This detects overflow and returns an `Err` for that. For an non-overflow-checking variant,
+/// use `add_scalar_dyn` instead.
+pub fn add_scalar_checked_dyn<T>(array: &dyn Array, scalar: T::Native) -> Result<ArrayRef>


curious: do we have benchmark to track how much slower add_scalar_checked_dyn is comparing to add_scalar_dyn?

If it is anything like the non-scalar kernels, it is about 10x slower. Aside from the branching costs, it prevents LLVM from vectorising it correctly

I see. I wonder if we should point that out in the doc of this method, in case it's not obvious to the users.

Yea, it will be much slower. As by default (ansi-mode disabled) in our case, non-checked kernels will be used. So most of time users will use faster one, except they have special need to use checked kernels.

Yea, I'm going to add a few lines mentioning that.

sunchao · 2022-09-13T16:20:43Z

arrow/src/compute/kernels/arithmetic.rs

@@ -834,12 +834,34 @@ where
 /// Add every value in an array by a scalar. If any value in the array is null then the
 /// result is also null. The given array must be a `PrimitiveArray` of the type same as
 /// the scalar, or a `DictionaryArray` of the value type same as the scalar.
+///
+/// This doesn't detect overflow. Once overflowing, the result will wrap around.
+/// For an overflow-checking variant, use `add_scalar_checked_dyn` instead.


perhaps explain a bit when it will return Err

sunchao · 2022-09-13T16:28:07Z

arrow/src/compute/kernels/arity.rs

+    F: Fn(T::Native) -> Result<T::Native>,
+{
+    downcast_dictionary_array! {
+        array => try_unary_dict::<_, F, T>(array, op),


hmm how do we know the dictionary value type matches T?

Indeed, there is no type-bound for the dictionary value type. Just do a simple test. At runtime downcast_ref will fail in unary_dict. I will address it in other PR.

Normally as the op is provided by users, I suppose that users know dictionary value is same type as the scalar. But it is good to return a meaningful Err instead of runtime panic. I will do it in a follow-up.

Follow up sounds fine to me. Perhaps we can just check the type here:

downcast_dictionary_array! { array => if array.values().data_type() == &T::DATA_TYPE { try_unary_dict::<_, F, T>(array, op) } else { // throw error }, t => {

Oh, right, actually I thought to handle it at try_unary_dict. But this fix looks okay as try_unary_dict is currently used here and not public. I may fix at try_unary_dict at another followup.

…ic_scalar_dyn

sunchao · 2022-09-14T21:42:48Z

arrow/src/compute/kernels/arity.rs

+    F: Fn(T::Native) -> Result<T::Native>,
+{
+    downcast_dictionary_array! {
+        array => try_unary_dict::<_, F, T>(array, op),


Follow up sounds fine to me. Perhaps we can just check the type here:

downcast_dictionary_array! { array => if array.values().data_type() == &T::DATA_TYPE { try_unary_dict::<_, F, T>(array, op) } else { // throw error }, t => {

viirya · 2022-09-14T23:54:29Z

Thanks.

ursabot · 2022-09-15T00:04:01Z

Benchmark runs are scheduled for baseline = 2a0fc77 and contender = 7594db6. 7594db6 is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ec2-t3-xlarge-us-east-2] ec2-t3-xlarge-us-east-2
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on test-mac-arm] test-mac-arm
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ursa-i9-9960x] ursa-i9-9960x
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ursa-thinkcentre-m75q] ursa-thinkcentre-m75q
Buildkite builds:
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

github-actions bot added the arrow Changes to the arrow crate label Sep 12, 2022

Add overflow-checking variants of arithmetic scalar dyn kernels

54d44d2

viirya force-pushed the overflow_arithmetic_scalar_dyn branch from 7a10d91 to 54d44d2 Compare September 12, 2022 19:53

sunchao reviewed Sep 13, 2022

View reviewed changes

viirya added 2 commits September 13, 2022 12:46

Update doc

f17a586

Merge remote-tracking branch 'upstream/master' into overflow_arithmet…

7dfd138

…ic_scalar_dyn

sunchao approved these changes Sep 14, 2022

View reviewed changes

For review

713b54f

viirya merged commit 7594db6 into apache:master Sep 14, 2022

alamb mentioned this pull request Sep 16, 2022

Add overflow-checking variants of arithmetic scalar dyn kernels #2712

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add overflow-checking variants of arithmetic scalar dyn kernels #2713

Add overflow-checking variants of arithmetic scalar dyn kernels #2713

viirya commented Sep 12, 2022

viirya commented Sep 12, 2022

sunchao Sep 13, 2022

tustvold Sep 13, 2022

sunchao Sep 13, 2022

viirya Sep 13, 2022

viirya Sep 13, 2022

sunchao Sep 13, 2022

sunchao Sep 13, 2022

viirya Sep 13, 2022

viirya Sep 13, 2022

sunchao Sep 14, 2022

viirya Sep 14, 2022

sunchao Sep 14, 2022

viirya commented Sep 14, 2022

ursabot commented Sep 15, 2022

Add overflow-checking variants of arithmetic scalar dyn kernels #2713

Add overflow-checking variants of arithmetic scalar dyn kernels #2713

Conversation

viirya commented Sep 12, 2022

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

viirya commented Sep 12, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

viirya commented Sep 14, 2022

ursabot commented Sep 15, 2022