Memory usage jumps to 50G when trying to predict #6659

honzasterba · 2021-01-29T12:26:24Z

I have a fairly small booster and data-set, when trying to do prediction on this dataset the memory usage jumps to 50GB.
Here is the code to reproduce:

import xgboost as xgb
dtest = xgb.DMatrix("dmatrix.bin")
bst1 = xgb.Booster() 
bst1.load_model('booster.bin')
ypred_h1 = bst1.predict(dtest)

Data used to reproduce attached.

data.zip

The text was updated successfully, but these errors were encountered:

trivialfis · 2021-01-29T12:54:18Z

Just confirming the data loading is correct, your data has 3781180 columns?

honzasterba · 2021-01-29T13:13:35Z

the original training dataset has less than 100 columns, but there are some high cardinality categoricals which due to 1-hot encoding lead to this many columns in the xgboost training set
also should be noted that this is a regression since 1.3.0, with 1.2.0 I did not see this memory spike

trivialfis · 2021-01-29T13:18:24Z

@ShvetsKS Would you like to help taking a look? I think the thread optimization spikes up the memory usage. A better way to handle this might be putting some thoughts on extreme sparse dataset.

Right now you can try setting nthread to 1 explicitly, or use GPU predictor.

As a side note, #6503 should help removing the 1-hot encoding.

trivialfis · 2021-01-29T13:44:43Z

bst.set_param({"nthread": 1})
# or if you have a gpu at hand
bst.set_param({"predictor": "gpu_predictor"})

honzasterba · 2021-01-29T14:14:14Z

setting nthread to 1 helped to work around the issue

ShvetsKS · 2021-01-31T07:41:35Z

@trivialfis memory usage was increased as currently we process kBlockOfRowsSize observations per each tree to keep cache locality (1 observation was processed before).
I think there are at least tree possible options:

add possibility to change kBlockOfRowsSize via user provided parameters, and set different from default 64 value
implement automatic L1/L2 cache fitting (for current example kBlockOfRowsSize would be equal to 1)
and as you proposed better way to handle extra sparse datasets with specific prediction implementation

trivialfis added the type: bug label Feb 1, 2021

trivialfis mentioned this issue Aug 30, 2021

Optimizations for partition, build hist, evaluate, sync kernels #7192

Open

trivialfis mentioned this issue Sep 24, 2021

Avoid thread block with sparse data. #7255

Merged

trivialfis closed this as completed in #7255 Sep 25, 2021

This was referenced May 11, 2023

Upgrade XGBoost to 1.6 h2oai/h2o-3#7630

Closed

XGBoost memory usage explodes for Cox dataset with hist tree method h2oai/h2o-3#8622

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory usage jumps to 50G when trying to predict #6659

Memory usage jumps to 50G when trying to predict #6659

honzasterba commented Jan 29, 2021

trivialfis commented Jan 29, 2021

honzasterba commented Jan 29, 2021

trivialfis commented Jan 29, 2021 •

edited

trivialfis commented Jan 29, 2021 •

edited

honzasterba commented Jan 29, 2021

ShvetsKS commented Jan 31, 2021

Memory usage jumps to 50G when trying to predict #6659

Memory usage jumps to 50G when trying to predict #6659

Comments

honzasterba commented Jan 29, 2021

trivialfis commented Jan 29, 2021

honzasterba commented Jan 29, 2021

trivialfis commented Jan 29, 2021 • edited

trivialfis commented Jan 29, 2021 • edited

honzasterba commented Jan 29, 2021

ShvetsKS commented Jan 31, 2021

trivialfis commented Jan 29, 2021 •

edited

trivialfis commented Jan 29, 2021 •

edited