XGBoost support #4204

mikaelmv · 2021-03-26T07:01:42Z

Thank you for submitting an issue. Please refer to our issue policy for additional information about bug reports. For help with debugging your code, please refer to Stack Overflow.

Please fill in this bug report template to ensure a timely and thorough response.

Willingness to contribute

The MLflow Community encourages bug fix contributions. Would you or another member of your organization be willing to contribute a fix for this bug to the MLflow code base?

Yes. I can contribute a fix for this bug independently.
Yes. I would be willing to contribute a fix for this bug with guidance from the MLflow community.
No. I cannot contribute a bug fix at this time.

System information

Have I written custom code (as opposed to using a stock example script provided in MLflow): Custom
OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Windows 10
MLflow installed from (source or binary): source
MLflow version (run mlflow --version): 1.14.1
Python version: 3.7
npm version, if running the dev UI:
Exact command to reproduce:

Describe the problem

Hi,

I am using a Scikit-Learn wrapper for XGBoost (XGBRegressor).

Now when logging the model with mlflow (mlflow.xgboost.log_model), it crashes as anticipated by the official documentation:

https://www.mlflow.org/docs/latest/python_api/mlflow.xgboost.html

I noticed that if I use mlflow.sklearn.log_model instead, the artifact gets saved as expected.

What are the implications of using mlflow.sklearn.log_model with an XGBoost model? Not clear to me.
What alternatives do I have to log my XGBRegressor model correctly with mlflow?

Obviously if I instead use a RandomForest, then mlflow.sklearn.log_model works perfectly as expected, so again my issue is to log the XGBRegressor model.

Code to reproduce issue

Provide a reproducible test case that is the bare minimum necessary to generate the problem.

mlflow.xgboost.log_model(pipeline, "model") (where pipeline combines transformers and the XGBRegressor)

Other info / logs

151
152     # Save an XGBoost model

--> 153 xgb_model.save_model(model_data_path)
154
155 conda_env_subpath = "conda.yaml"

AttributeError: 'Pipeline' object has no attribute 'save_model'

What component(s), interfaces, languages, and integrations does this bug affect?

Components

area/artifacts: Artifact stores and artifact logging
area/model-registry: Model Registry service, APIs, and the fluent client calls for Model Registry

The text was updated successfully, but these errors were encountered:

dmatrix · 2021-04-01T17:59:51Z

@mikaelmv Thanks for filing this? Can you give me a small example, to reproduce this, that I can run and see? I know that for mlflow autologging we don't support scikit-learn API

chedikouki · 2022-04-12T12:25:07Z

is there any solution
mlflow.xgboost.save_model(pipeline,'Model')
AttributeError: 'Pipeline' object has no attribute 'save_model'

harupy · 2022-06-03T00:24:36Z

#4954 fixed this issue.

mikaelmv added the bug Something isn't working label Mar 26, 2021

github-actions bot added area/artifacts Artifact stores and artifact logging area/model-registry Model registry, model registry APIs, and the fluent client calls for model registry labels Mar 26, 2021

dmatrix added the needs author feedback Issue is waiting for the author to respond label Apr 1, 2021

stale bot removed the needs author feedback Issue is waiting for the author to respond label Apr 12, 2022

harupy closed this as completed Jun 3, 2022

harupy reopened this Jun 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XGBoost support #4204

XGBoost support #4204

mikaelmv commented Mar 26, 2021

dmatrix commented Apr 1, 2021

chedikouki commented Apr 12, 2022

harupy commented Jun 3, 2022

XGBoost support #4204

XGBoost support #4204

Comments

mikaelmv commented Mar 26, 2021

Willingness to contribute

System information

Describe the problem

Code to reproduce issue

Other info / logs

What component(s), interfaces, languages, and integrations does this bug affect?

dmatrix commented Apr 1, 2021

chedikouki commented Apr 12, 2022

harupy commented Jun 3, 2022