DDP strategy doesn't work for on_validation_epoch_end, always hang #19783
Labels
logging
Related to the `LoggerConnector` and `log()`
question
Further information is requested
ver: 2.1.x
Bug description
My code looks like below, i want compute a validation metric based on entire validation dataset. so i will append every results in every batch into a list, and then in on_validation_epoch_end function to compute the metric.
It works fine with single GPU, but when i use ddp strategy the do so, i always meet error, seems validation dataset hangs.
What version are you seeing the problem on?
v2.1, v2.2
How to reproduce the bug
Error messages and logs
Environment
Current environment
More info
No response
cc @carmocca
The text was updated successfully, but these errors were encountered: