Use Parquet OffsetIndex to prune IO with RowSelection #2473

thinkharderdev · 2022-08-16T22:00:08Z

Which issue does this PR close?

Closes #2426.

Rationale for this change

When we have a RowSelection and an OffsetIndex we can reduce IO by fetching only the pages selected.

This also builds on #2464 to remove InMemoryColumnChunk and unify everything to use SerializedPageReader

What changes are included in this PR?

We can represent pre-fetched column chunks as either a "dense" encoding (just Bytes) or a "sparse" encoding which contains only the pages relevant to a given RowSelection.

Also remove InMemoryColumnChunk to help unify the sync and async parquet paths.

Are there any user-facing changes?

thinkharderdev · 2022-08-16T22:00:45Z

@tustvold

Leaving this in draft until #2464 is merged as this includes those changes.

thinkharderdev · 2022-08-16T22:05:53Z

parquet/src/file/page_index/index_reader.rs

@@ -65,7 +65,7 @@ pub fn read_pages_locations<R: ChunkReader>(
    let (offset, total_length) = get_location_offset_and_total_length(chunks)?;

    //read all need data into buffer
-    let mut reader = reader.get_read(offset, reader.len() as usize)?;
+    let mut reader = reader.get_read(offset, total_length)?;


Pretty sure this was a bug

Yup, I remember fixing it in a PR that got abandoned at some point

tustvold · 2022-08-16T22:18:18Z

Thank you for this, I'll review first thing tomorrow. I like that you've found a way to allow sharing the page muxing logic in SerializedPageReader 👍

tustvold

Looking good, will review again once rebased, but mostly minor nits

tustvold · 2022-08-17T09:03:05Z

object_store/src/local.rs

@@ -1068,6 +1068,7 @@ mod tests {
        integration.head(&path).await.unwrap();
    }

+    #[ignore]


Sorry, this test fails on my machine because of a permissions issue. Meant to revert before submitting.

tustvold · 2022-08-17T09:05:31Z

parquet/src/arrow/arrow_reader/selection.rs

+        (mask, ranges)
+    }
+
+    pub fn selectors(&self) -> &[RowSelector] {


This doesn't appear to be being used, and so I think can go. I've been trying to avoid exposing the internal layout of this type externally

tustvold · 2022-08-17T09:07:02Z

parquet/src/arrow/arrow_reader/selection.rs

+        let (mask, ranges) = selection.page_mask(&index);
+
+        assert_eq!(mask, vec![false, true, true, false, true, true, false]);
+        assert_eq!(ranges, vec![10..20, 20..30, 40..50, 50..60]);


Could we get a test where the final PageLocation is selected?

tustvold · 2022-08-17T09:10:42Z

parquet/src/arrow/arrow_reader/selection.rs

@@ -116,6 +118,62 @@ impl RowSelection {
        Self { selectors }
    }

+    /// Given an offset index, return a mask indicating which pages are selected along with their locations by `self`
+    pub fn page_mask(


Suggested change

pub fn page_mask(

pub(crate) fn page_mask(

I don't think this likely to be useful outside the crate

tustvold · 2022-08-17T09:16:39Z

parquet/src/arrow/arrow_reader/selection.rs

+    pub fn page_mask(
+        &self,
+        page_locations: &[PageLocation],
+    ) -> (Vec<bool>, Vec<Range<usize>>) {


It seems strange to me that this method would return Vec<Range<usize>> when it is called page_mask, and the caller clearly already has &[PageLocation] that can easily be combined with the mask...

The idea was to just do it it one shot to avoid iterating over the locations again to get the ranges, but perhaps it's better to avoid overloading

Edit: Looking at this again, the mask was part of a previous design that is no longer relevant, so I think we can just rename this and only return the ranges.

tustvold · 2022-08-17T09:33:31Z