Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix overlap so that set_index doesn't drop rows #9423

Merged
merged 3 commits into from Sep 15, 2022

Conversation

jsignell
Copy link
Member

@pavithraes
Copy link
Member

I've merged main to fix the test_parquet[fastparquet] failures :)

@jsignell
Copy link
Member Author

I am pretty confident of this change, so I will plan to merge this week unless there are comments.

@@ -1182,6 +1182,17 @@ def test_set_index_overlap_2():
assert ddf2.npartitions == 8


def test_set_index_overlap_3():
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see that we have test_set_index_overlap and then _2 and _3. I thought the only thing changing was the number of partitions but it seems it's not just that.

Would it be better to have either a comment/docstring to these tests that explains better what each test is accomplishing?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah it's a lazy name, I did link out to the issue that has the full context. But I'm happy to change the name

@jsignell jsignell merged commit f45df2b into dask:main Sep 15, 2022
@jsignell jsignell deleted the fix-overlap branch September 15, 2022 20:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

ddf.set_index with sorted=True drops rows Results of dask.multi.merge_asof depends on npartitions
3 participants