New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix resume logger #3375
fix resume logger #3375
Conversation
0fe0b05
to
554d273
Compare
horovod/spark/lightning/remote.py
Outdated
@@ -123,15 +123,15 @@ def train(serialized_model): | |||
train_logger = TensorBoardLogger(logs_path) | |||
print(f"Setup logger: Using TensorBoardLogger: {train_logger}") | |||
|
|||
elif isinstance(logger, CometLogger) and logger._experiment_key is None: | |||
# Resume logger experiment key if passed correctly from CPU. | |||
elif isinstance(logger, CometLogger) and logger_experiment_key is None: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Did you mean "is not None"? this looks like it's creating a new CometLogger with an empty experiment.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good catch, yes, should be is not none.
Unit Test Results 438 files - 364 438 suites - 364 6h 15m 21s ⏱️ - 2h 25m 27s For more details on these failures, see this check. Results for commit 633c82d. ± Comparison against base commit a5edcd0. This pull request skips 95 tests.
♻️ This comment has been updated with latest results. |
Unit Test Results (with flaky tests) 480 files - 410 480 suites - 410 7h 22m 20s ⏱️ - 1h 58m 34s For more details on these failures, see this check. Results for commit 633c82d. ± Comparison against base commit a5edcd0. This pull request skips 95 tests.
♻️ This comment has been updated with latest results. |
554d273
to
e57fdc2
Compare
Signed-off-by: Peng Zhang <pengz@uber.com>
e57fdc2
to
633c82d
Compare
Signed-off-by: Peng Zhang pengz@uber.com
Description
The comet logger will fail in local if change save_dir without resume experience.
Need to always resume the experience with the new save_dir path in arguments.