Support specifying 'latest' in model URI to get the latest version of a model regardless of the stage #5027

lichenran1234 · 2021-11-09T01:29:43Z

Signed-off-by: Chenran Li chenran.li@databricks.com

What changes are proposed in this pull request?

Support specifying 'latest' in model URI like models:/<model_name>/latest to get the latest version of a model regardless of the stage.

Also fix a bug in sqlalchemy_store.get_latest_versions(): when no stage argument is specified, it should return the latest versions of all stages. Now it's only returning latest versions for active stages.

according to the top-level comment in the proto file, it should return the latest version of all stages when no stage argument is specified
also fixed the contradictory comments in the docstring of the function:

In #4250, Anil also suggested supporting model URI formats like models:/<model_name>/latest-n to get the nth to last model version. But it requires too big of a change (e.g. extending the RegisteredModel class and the corresponding proto message to store not only the latest versions of a model, but also all versions). It may not be worth it to make such a big change for this small "latest-n" feature. So I'm not doing this in this PR.

How is this patch tested?

unit tests

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

Now users can specify 'latest' in model URI like models:/<model_name>/latest to get the latest version of a model regardless of the stage. Previously users can only specify models:/<model_name>/<Stage> to get the latest version of a model on a specific stage.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

… a model regardless of the stage Signed-off-by: Chenran Li <chenran.li@databricks.com>

Signed-off-by: Chenran Li <chenran.li@databricks.com>

ankit-db

The change looks good to me - just a few questions! I do think it may qualify as a breaking change though, so we should make sure to change the label

mlflow/store/artifact/utils/models.py

ankit-db · 2021-12-23T20:16:04Z

mlflow/store/artifact/utils/models.py

        )
-    return latest[0].version
+    return max(map(lambda x: int(x.version), latest))


Just curious - why do we need to call int() here? I'm not opposed just for safety reasons, but x.version is already a number right?

Actually model version is str: link. So it's safer to convert it into int here.

ankit-db · 2021-12-23T20:24:15Z

mlflow/store/artifact/utils/models.py

@@ -46,12 +59,16 @@ def _parse_model_uri(uri):

    if parts[1].isdigit():


Given the added complexity, it may be good to have an example of each URI type in the branch so that it's clear exactly which case maps to which tuple

Done, thanks!

Signed-off-by: Chenran Li <chenran.li@databricks.com>

Support specifying 'latest' in model URI to get the latest version of…

c75afe4

… a model regardless of the stage Signed-off-by: Chenran Li <chenran.li@databricks.com>

github-actions bot added area/model-registry rn/feature labels Nov 9, 2021

fix unit test in test_model_registry.py

3500f59

Signed-off-by: Chenran Li <chenran.li@databricks.com>

lichenran1234 requested review from mparkhe and arjundc-db November 9, 2021 02:17

fix Java unit test testGetLatestModelVersions

0bfbf1d

Signed-off-by: Chenran Li <chenran.li@databricks.com>

lichenran1234 requested review from wentinghu and removed request for arjundc-db November 9, 2021 19:08

lichenran1234 requested review from sueann and removed request for mparkhe November 17, 2021 19:46

lichenran1234 requested a review from ankit-db December 21, 2021 23:08

ankit-db reviewed Dec 27, 2021

View reviewed changes

address comments

d02fe3a

Signed-off-by: Chenran Li <chenran.li@databricks.com>

lichenran1234 requested a review from ankit-db January 7, 2022 22:42

lichenran1234 closed this Jan 7, 2022

lichenran1234 deleted the version branch January 7, 2022 22:43

lichenran1234 restored the version branch January 7, 2022 22:43

lichenran1234 reopened this Jan 7, 2022

ankit-db approved these changes Jan 10, 2022

View reviewed changes

github-actions bot added the area/sqlalchemy label Jan 10, 2022

lichenran1234 merged commit d3ddd59 into mlflow:master Jan 10, 2022

lichenran1234 mentioned this pull request Jan 10, 2022

[FR] Ability to query latest model from registry #4250

Closed

23 tasks

This was referenced Jan 26, 2022

[BUG] referencing models by stage using models URI suddenly became case sensitive #5311

Closed

Fix the bug of stages in models URI being case-sensitive #5312

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support specifying 'latest' in model URI to get the latest version of a model regardless of the stage #5027

Support specifying 'latest' in model URI to get the latest version of a model regardless of the stage #5027

lichenran1234 commented Nov 9, 2021 •

edited

Loading

ankit-db left a comment

ankit-db Dec 23, 2021

lichenran1234 Jan 7, 2022

ankit-db Jan 10, 2022

ankit-db Dec 23, 2021

lichenran1234 Jan 7, 2022

		@@ -46,12 +59,16 @@ def _parse_model_uri(uri):

		if parts[1].isdigit():

Support specifying 'latest' in model URI to get the latest version of a model regardless of the stage #5027

Support specifying 'latest' in model URI to get the latest version of a model regardless of the stage #5027

Conversation

lichenran1234 commented Nov 9, 2021 • edited Loading

What changes are proposed in this pull request?

How is this patch tested?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

ankit-db left a comment

Choose a reason for hiding this comment

ankit-db Dec 23, 2021

Choose a reason for hiding this comment

lichenran1234 Jan 7, 2022

Choose a reason for hiding this comment

ankit-db Jan 10, 2022

Choose a reason for hiding this comment

ankit-db Dec 23, 2021

Choose a reason for hiding this comment

lichenran1234 Jan 7, 2022

Choose a reason for hiding this comment

lichenran1234 commented Nov 9, 2021 •

edited

Loading