Add `Decimal128` API and use it in DecimalArray and DecimalBuilder #1871

viirya · 2022-06-14T05:51:45Z

Which issue does this PR close?

Closes #1870.

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

viirya · 2022-06-14T05:54:25Z

arrow/src/util/decimal.rs

+use std::cmp::Ordering;
+
+#[derive(Clone, Debug)]
+pub struct Decimal128 {


C++ Decimal128 has implemented some operators. I don't implement the same in this change. This change tries to be functionally equal with current i128 API. We can consider to have these operators if they are needed.

codecov-commenter · 2022-06-14T06:10:47Z

Codecov Report

Merging #1871 (ecb026f) into master (cedaf8a) will decrease coverage by 0.00%.
The diff coverage is 90.19%.

@@            Coverage Diff             @@
##           master    #1871      +/-   ##
==========================================
- Coverage   83.46%   83.46%   -0.01%     
==========================================
  Files         201      202       +1     
  Lines       57014    57069      +55     
==========================================
+ Hits        47586    47630      +44     
- Misses       9428     9439      +11

Impacted Files	Coverage Δ
arrow/src/util/decimal.rs	`79.59% <79.59%> (ø)`
arrow/src/array/array_binary.rs	`94.18% <100.00%> (-0.06%)`	⬇️
arrow/src/array/builder.rs	`86.98% <100.00%> (+0.09%)`	⬆️
arrow/src/array/equal_json.rs	`89.70% <100.00%> (ø)`
arrow/src/array/iterator.rs	`96.11% <100.00%> (ø)`
arrow/src/compute/kernels/cast.rs	`95.77% <100.00%> (ø)`
arrow/src/compute/kernels/sort.rs	`95.67% <100.00%> (ø)`
arrow/src/compute/kernels/take.rs	`95.27% <100.00%> (ø)`
parquet/src/arrow/arrow_reader.rs	`96.87% <100.00%> (ø)`
parquet/src/arrow/arrow_writer/mod.rs	`97.53% <100.00%> (ø)`
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cedaf8a...ecb026f. Read the comment docs.

viirya · 2022-06-14T06:35:01Z

arrow/src/util/decimal.rs

+}
+
+impl PartialEq<Self> for Decimal128 {
+    fn eq(&self, other: &Self) -> bool {


I'm a bit not sure about comparing Decimal128. Although C++ Decimal128 just compares two uint64 values. We also compare i128 directly currently (e.g., ord kernel).

But I'm still wondering it is correct to compare two values with different scale? E.g., 100_i128 (scale 2) and 100_i128 (scale 3)? Isn't it "1.00" and "0.100" respectively?

So I put an assert to check scale here.

I think the assert is needed.
If two decimal128 has diff type(precision or scale), we can't compare the value of i128.

tustvold

I like this, my only major question concerns how we generalise this to Decimal 256 without having to duplicate lots of code

tustvold · 2022-06-15T09:08:47Z

arrow/src/util/decimal.rs

+
+use std::cmp::Ordering;
+
+#[derive(Debug)]


Perhaps a doc comment or something

tustvold · 2022-06-15T09:10:50Z

arrow/src/util/decimal.rs

+        }
+    }
+
+    pub fn new_from_i128(precision: usize, scale: usize, value: i128) -> Self {


I'm presuming we will want to make Decimal256 and Decimal128 generic versions of the same impl, and so I wonder how methods like this which explicitly name the type will translate? Maybe new_from_raw?

I have considered making them generic versions. Because I only implement Decimal128 now, new_from_i128 is used to make Decimal128 fit into existing codes.

Next step I will implement Decimal256 and try generalise it with Decimal128. I think if it works, new_from_bytes (maybe rename to new_from_raw) will be the generalised API. new_from_i128 will be removed if the above idea works.

I think we have some time before next release to have Decimal256 and generalise the API.

liukun4515 · 2022-06-16T01:59:20Z

I want to take a look this pr, please hold it.

liukun4515 · 2022-06-16T03:24:39Z

arrow/src/util/decimal.rs

+
+impl Eq for Decimal128 {}
+
+impl Decimal128 {


Do we need a function to get the type of the decimal128 value?

Decimal(p, s)? We can have it. Because it is easy to add API but harder to remove. The API starts from minimalism. Currently I keep it as less as possible only to be on par with existing functionality.

liukun4515 · 2022-06-16T03:26:59Z

@viirya
a question which is not about this pr.
How to represent decimal256 in rust？How does c++ implement it?
I'm not familiar with arrow c++ version.

viirya · 2022-06-16T03:41:19Z

@viirya a question which is not about this pr. How to represent decimal256 in rust？How does c++ implement it? I'm not familiar with arrow c++ version.

Like C++ Arrow Decimal256, we can represent the integer in an array of parts of it. C++ Arrow Decimal256 uses an uint64_t array.

liukun4515

LGTM

viirya · 2022-06-16T06:26:36Z

Thank you @tustvold @liukun4515. I'm going to merge this and keeping working on Decimal256 and generalise them.

martin-g · 2022-06-16T07:53:08Z

arrow/src/util/decimal.rs

+        let as_array = bytes.try_into();
+        let value = match as_array {
+            Ok(v) if bytes.len() == 16 => i128::from_le_bytes(v),
+            _ => panic!("Input to Decimal128 is not 128bit integer."),


Wouldn't it be better to return a Result instead of panic-ing here ?

Good idea. I will change it to Result. Thanks.

Add Decimal128

946e9d4

github-actions bot added arrow Changes to the arrow crate parquet Changes to the parquet crate labels Jun 14, 2022

viirya commented Jun 14, 2022

View reviewed changes

viirya added the api-change Changes to the arrow API label Jun 14, 2022

Fix clippy

ecb026f

viirya commented Jun 14, 2022

View reviewed changes

tustvold approved these changes Jun 15, 2022

View reviewed changes

Add code comment

77ad6f3

liukun4515 reviewed Jun 16, 2022

View reviewed changes

liukun4515 approved these changes Jun 16, 2022

View reviewed changes

viirya merged commit f0bf7f9 into apache:master Jun 16, 2022

martin-g reviewed Jun 16, 2022

View reviewed changes

alamb changed the title ~~Add Decimal128 API and use it in DecimalArray and DecimalBuilder~~ Add Decimal128 API and use it in DecimalArray and DecimalBuilder Jun 23, 2022

alamb mentioned this pull request Jun 23, 2022

Update to arrow 17.0.0 apache/datafusion#2778

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Decimal128` API and use it in DecimalArray and DecimalBuilder #1871

Add `Decimal128` API and use it in DecimalArray and DecimalBuilder #1871

viirya commented Jun 14, 2022

viirya Jun 14, 2022

codecov-commenter commented Jun 14, 2022 •

edited

viirya Jun 14, 2022 •

edited

liukun4515 Jun 16, 2022

tustvold left a comment

tustvold Jun 15, 2022

tustvold Jun 15, 2022

viirya Jun 15, 2022

viirya Jun 15, 2022

liukun4515 commented Jun 16, 2022

liukun4515 Jun 16, 2022

viirya Jun 16, 2022

liukun4515 commented Jun 16, 2022

viirya commented Jun 16, 2022

liukun4515 left a comment

viirya commented Jun 16, 2022 •

edited

martin-g Jun 16, 2022

viirya Jun 16, 2022

Add Decimal128 API and use it in DecimalArray and DecimalBuilder #1871

Add Decimal128 API and use it in DecimalArray and DecimalBuilder #1871

Conversation

viirya commented Jun 14, 2022

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

Choose a reason for hiding this comment

codecov-commenter commented Jun 14, 2022 • edited

Codecov Report

viirya Jun 14, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tustvold left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liukun4515 commented Jun 16, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liukun4515 commented Jun 16, 2022

viirya commented Jun 16, 2022

liukun4515 left a comment

Choose a reason for hiding this comment

viirya commented Jun 16, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add `Decimal128` API and use it in DecimalArray and DecimalBuilder #1871

Add `Decimal128` API and use it in DecimalArray and DecimalBuilder #1871

codecov-commenter commented Jun 14, 2022 •

edited

viirya Jun 14, 2022 •

edited

viirya commented Jun 16, 2022 •

edited