Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parquet: Support read process for PageIndex (ColumnIndex / OffsetIndex) #1749

Closed
1 of 5 tasks
Tracked by #1705
Ted-Jiang opened this issue May 26, 2022 · 0 comments
Closed
1 of 5 tasks
Tracked by #1705
Labels
enhancement Any new improvement worthy of a entry in the changelog

Comments

@Ted-Jiang
Copy link
Member

Ted-Jiang commented May 26, 2022

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
follow #1705
Goal:
Index granularity - per page (used row Group).
Make selective scans on both sorting & non-sorting columns efficient(maybe).
No additional I/O & deserialization costs for non-selective filter.

Describe the solution you'd like
There is already store the PageIndex Index just above the footer. Store offset and length in col metadata.
Using these index with Row Alignment filter page without decode+decompress.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

@Ted-Jiang Ted-Jiang added the enhancement Any new improvement worthy of a entry in the changelog label May 26, 2022
@Ted-Jiang Ted-Jiang changed the title Parquet: Read support for PageIndex (ColumnIndex / OffsetIndex) Parquet: Support read for PageIndex (ColumnIndex / OffsetIndex) May 26, 2022
@Ted-Jiang Ted-Jiang changed the title Parquet: Support read for PageIndex (ColumnIndex / OffsetIndex) Parquet: Support read process for PageIndex (ColumnIndex / OffsetIndex) May 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Any new improvement worthy of a entry in the changelog
Projects
None yet
Development

No branches or pull requests

2 participants