Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add page index reader test and support empty index. #2011

Closed
wants to merge 2 commits into from

Conversation

Ted-Jiang
Copy link
Member

@Ted-Jiang Ted-Jiang commented Jul 6, 2022

Which issue does this PR close?

Closes #2010.

Rationale for this change

After apache/parquet-testing#25 merged, there will be standard page index test file in parquet-testing.All types support page index in parquet-format has add test in this pr.

Another change:
I found there is a situation one col has pageLocation but without min_max index(馃槀 i missed it). So add EMPTY_ARRAY in enum Index.

What changes are included in this PR?

Are there any user-facing changes?

@github-actions github-actions bot added arrow-flight Changes to the arrow-flight crate parquet Changes to the parquet crate labels Jul 6, 2022
//col6->bigint_col: INT64 UNCOMPRESSED DO:0 FPO:152326 SZ:71598/71598/1.00 VC:7300 ENC:BIT_PACKED,RLE,PLAIN ST:[min: 0, max: 90, num_nulls: 0]
if let Index::INT64(index) = &page_indexes[0][5] {
//Todo row_group_metadata.column(0).statistics().unwrap().min_bytes() only return 4 bytes
check_native_page_index(
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i try to use

row_group_metadata
                    .column(0)
                    .statistics()
                    .unwrap()
                    .min_bytes(),

get min values from one column chunk metadata in type In64, but it return only 4 bytes...

@Ted-Jiang Ted-Jiang closed this Jul 6, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arrow-flight Changes to the arrow-flight crate parquet Changes to the parquet crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Enable column page index read test for all types
1 participant