Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tag nested runs with parent run ID #4197

Merged
merged 1 commit into from Oct 24, 2021

Conversation

yitao-li
Copy link
Contributor

@yitao-li yitao-li commented Mar 23, 2021

Signed-off-by: Yitao Li yitao@rstudio.com

What changes are proposed in this pull request?

MLflow R client should tag a nested run with the parent run ID. The 'mlflow.parentRunId' tag is required for nested runs to be displayed properly in MLflow UI.

How is this patch tested?

Unit test

Release Notes

Is this a user-facing change?

  • No. You can skip the rest of this section.
  • Yes. Give a description of this change to be included in the release notes for MLflow users.

MLflow R client now tags a nested run with its parent run ID correctly.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

  • area/artifacts: Artifact stores and artifact logging
  • area/build: Build and test infrastructure for MLflow
  • area/docs: MLflow documentation pages
  • area/examples: Example code
  • area/model-registry: Model Registry service, APIs, and the fluent client calls for Model Registry
  • area/models: MLmodel format, model serialization/deserialization, flavors
  • area/projects: MLproject format, project running backends
  • area/scoring: Local serving, model deployment tools, spark UDFs
  • area/server-infra: MLflow server, JavaScript dev server
  • area/tracking: Tracking Service, tracking client APIs, autologging

Interface

  • area/uiux: Front-end, user experience, JavaScript, plotting
  • area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
  • area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
  • area/windows: Windows support

Language

  • language/r: R APIs and clients
  • language/java: Java APIs and clients
  • language/new: Proposals for new client languages

Integrations

  • integrations/azure: Azure and Azure ML integrations
  • integrations/sagemaker: SageMaker integrations
  • integrations/databricks: Databricks integrations

#closes #4172

How should the PR be classified in the release notes? Choose one:

  • rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
  • rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
  • rn/feature - A new user-facing feature worth mentioning in the release notes
  • rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
  • rn/documentation - A user-facing documentation change worth mentioning in the release notes

@github-actions github-actions bot added area/tracking Tracking service, tracking client APIs, autologging language/r R APIs and clients rn/feature Mention under Features in Changelogs. labels Mar 23, 2021
Signed-off-by: Yitao Li <yitao@rstudio.com>
@yitao-li
Copy link
Contributor Author

yitao-li commented Mar 23, 2021

@harupy @kevinykuo @tomasatdatabricks @jdlesage Could any of you take a look at this PR when you have time?
It's a small change to make MLflow R client consistent with MLflow Python client behavior when it comes to tagging a nested run with parent run ID.

Copy link
Collaborator

@jinzhang21 jinzhang21 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for the late response. LGTM overall albeit minor comment on the test.

@@ -77,6 +77,13 @@ test_that("mlflow_start_run()/mlflow_end_run() works properly with nested runs",
expect_equal(mlflow:::mlflow_get_active_run_id(), runs[[i]]$run_uuid)
run <- mlflow_end_run(client = client, run_id = runs[[i]]$run_uuid)
expect_identical(run$run_uuid, runs[[i]]$run_uuid)
if (i > 1) {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you also test that parentRunId doesn't exist for the first (non-nested) run?

@jinzhang21
Copy link
Collaborator

Haven't heard from the author for a week. Merging as is.

@jinzhang21 jinzhang21 merged commit db4c318 into mlflow:master Oct 24, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/tracking Tracking service, tracking client APIs, autologging language/r R APIs and clients rn/feature Mention under Features in Changelogs.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FR] Add support for tagging nested runs as such in R client
2 participants