Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support reading PageIndex from column metadata #1761

Closed
Tracked by #1749
Ted-Jiang opened this issue May 28, 2022 · 0 comments · Fixed by #1762
Closed
Tracked by #1749

Support reading PageIndex from column metadata #1761

Ted-Jiang opened this issue May 28, 2022 · 0 comments · Fixed by #1762
Labels
enhancement Any new improvement worthy of a entry in the changelog parquet Changes to the parquet crate

Comments

@Ted-Jiang
Copy link
Member

Ted-Jiang commented May 28, 2022

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
Read PageLocation and PageIndex from parquet.
which
PageLocation like in https://github.com/apache/parquet-format/blame/master/src/main/thrift/parquet.thrift#L909-L924

PageIndex like https://github.com/apache/parquet-format/blame/master/src/main/thrift/parquet.thrift#L938-L959
has
NativeIndex for fix length type
ByteIndex for un_length bytes
BoolIndex

Get this info in memory then we can apply page-level filter in future.
Describe the solution you'd like
A clear and concise description of what you want to happen.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

@Ted-Jiang Ted-Jiang added the enhancement Any new improvement worthy of a entry in the changelog label May 28, 2022
@alamb alamb added the parquet Changes to the parquet crate label Jun 9, 2022
@alamb alamb changed the title Prepare and construct index from col metadata for skipping pages at reading Support reading PageIndex from column metadata Jun 9, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Any new improvement worthy of a entry in the changelog parquet Changes to the parquet crate
Projects
None yet
2 participants