Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix flaky ray tests #3430

Merged
merged 9 commits into from Mar 1, 2022
Merged

Fix flaky ray tests #3430

merged 9 commits into from Mar 1, 2022

Conversation

EnricoMi
Copy link
Collaborator

@EnricoMi EnricoMi commented Feb 28, 2022

Ray tests assert that available ray resources after the test are identical to before the test. This turns out to be flaky.

This removes the check_resources assertion as it has race conditions. Further, this restores the CUDA_VISIBLE_DEVICES environment variable after tests finish that modify it.

And this improves assertion context to further debug the issue of 8 GPUs though 4 workers have been started.

@github-actions
Copy link

github-actions bot commented Feb 28, 2022

Unit Test Results

     802 files   -   24       802 suites   - 24   9h 40m 4s ⏱️ + 24m 59s
     722 tests ±    0       679 ✔️ ±    0       43 💤 ±    0  0 ±0 
17 373 runs   - 474  12 381 ✔️  - 248  4 992 💤  - 226  0 ±0 

Results for commit d627a27. ± Comparison against base commit 7b5346e.

♻️ This comment has been updated with latest results.

@github-actions
Copy link

github-actions bot commented Feb 28, 2022

Unit Test Results (with flaky tests)

     886 files   -      74       886 suites   - 74   9h 59m 22s ⏱️ + 5m 27s
     722 tests ±       0       679 ✔️ ±    0       43 💤 ±    0  0 ±0 
19 365 runs   - 1 202  13 617 ✔️  - 572  5 748 💤  - 630  0 ±0 

Results for commit d627a27. ± Comparison against base commit 7b5346e.

♻️ This comment has been updated with latest results.

@EnricoMi EnricoMi force-pushed the branch-fix-flaky-ray-tests branch 5 times, most recently from 06e9969 to 4e9c44c Compare March 1, 2022 11:51
@EnricoMi EnricoMi mentioned this pull request Mar 1, 2022
Signed-off-by: Enrico Minack <github@enrico.minack.dev>
Signed-off-by: Enrico Minack <github@enrico.minack.dev>
This reverts commit 6f9e7f9.

Signed-off-by: Enrico Minack <github@enrico.minack.dev>
Signed-off-by: Enrico Minack <github@enrico.minack.dev>
Signed-off-by: Enrico Minack <github@enrico.minack.dev>
Signed-off-by: Enrico Minack <github@enrico.minack.dev>
Signed-off-by: Enrico Minack <github@enrico.minack.dev>
This reverts commit df5a49b.

Signed-off-by: Enrico Minack <github@enrico.minack.dev>
@EnricoMi EnricoMi marked this pull request as ready for review March 1, 2022 19:23
# The code after the yield will run as teardown code.
ray.shutdown()
finally:
if orig_devices:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

None and "" have different meanings for CUDA_VISIBLE_DEVICES. Probably safer to be explicit:

if orig_devices is not None:
    ...

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Alright, will better do that then.

Signed-off-by: Enrico Minack <github@enrico.minack.dev>
@EnricoMi EnricoMi merged commit b553974 into master Mar 1, 2022
@EnricoMi EnricoMi deleted the branch-fix-flaky-ray-tests branch March 1, 2022 20:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants