Fixing incorrect model parse when XGB buf starts with 'binff' or 'binfn' #2162

TheZL · 2021-09-02T17:43:20Z

In XGBoost 1.1, .save_raw() output added a prefix 'binf' to the model buffer.

#1220 addressed this by using lstrip to remove 'binf'. However, this has unintended consequence of stripping 'binff', 'binfn', 'binfb', 'binfi' as well, which occassionally occur as valid starts to a buffer.

The fix here is to check for exactly 'binf', and move the buffer forward by 4 if the prefix is exactly 'binf'.

TheZL · 2021-09-07T16:45:42Z

Hi @slundberg, I believe this PR fixes (#1864), and addresses the root cause which stems from unintended issues from #1220. Please update me when you can, thank you!

lrjball

Well spotted @TheZL ! This bug has crept in because .lstrip looks for any characters in the provided string and keeps stripping until a character not provided, whereas what we want here is more the behaviour of .removeprefix, although that was only added in 3.9 so can't be used here as shap supports other python versions.

Maybe this could be written slightly more succinctly as something like

self.buf = xgb_model.save_raw()
if self.buf.startswith(b'binf'):
    self.buf = self.buf[4:]

Have you been able to find an example of a model where the raw dump starts with 'binff' of something? If there is an easy example then it would be good to add that as a test case, but if not then this should be merged either way as this is clearly better than the current bug.

@slundberg , thoughts?

TheZL · 2021-12-30T00:05:23Z

Hi @Irjball, thanks for the reply! I agree with your opinion on this issue.
In our data example, we found that the utf-8 error occurred when the base score in XGBoost model was set to some specific values, e.g., base_score = 29.709193548387095. For testing the code, we could run a script like this:

from xgboost import XGBRegressor
import numpy as np
import shap
# Create input data
X = np.array([[1,2,3,4,5],[3,3,3,2,4]])
y = np.array([1,0])
# Fit XGBoost model with base score equals to 29.709193548387095
model = XGBRegressor(base_score=29.709193548387095)
model.fit(X,y,eval_metric="rmse")
explainer = shap.TreeExplainer(model)
shap_result = explainer.shap_values(X)
# Examine the buff
buf = model.get_booster().save_raw()[0:10]
print(buf)

The committed code on this branch works well with the example. I also tried your suggested code, but I got an error for it:
AttributeError: 'bytearray' object has no attribute 'startwith'
Maybe we need to convert bytearray to string first?
Thanks again for your thoughtful feedback!

lrjball · 2021-12-30T18:18:14Z

Looks like that is just a typo, it should be startswith, not startwith.

Good one on the example though, I've tried a few others and e.g. base_score=1.3 seems to be another example. If you are comfortable with pytest, it would be worth adding a test function to tests/explainers/test_tree.py to check that the provided example model can be loaded properly when using XGBTreeModelLoader, and that the params are as expected. That way if any changes are made to the code in future, we can make sure this issue doesn't happen again! Alternatively I can add the test function in, let me know what you prefer.

TheZL · 2021-12-31T00:26:45Z

@lrjball Yeah, you are right. The code works well with "startswith". I have updated the code accordingly on this branch.
If you don't mind, could you please add the test function in? I understand that it is important to add the test example for future changes, but I am not quite familiar with pytest yet.
Thank you for your time and suggestions!

TheZL · 2022-02-07T17:32:42Z

Hi @lrjball, Just want to follow up with the progress on this issue. Will the bug be fixed in the next released version of shap? We are using the shap package for an application study. Currently we manually adjust the input data when the error occurs. If this bug could be fixed in the next release, that will be very helpful. Thanks!

lrjball · 2022-02-28T21:36:57Z

Hi @TheZL, thanks for your patience. I have added a test for your update and raised a PR to your branch (TheZL#1). If you could merge that PR then this should be ready to go in.

…ction Added test for buffer strip update

TheZL · 2022-03-01T22:07:26Z

@lrjball Thanks for the help! The change has been merged.

josis-silver · 2022-03-11T12:04:57Z

Just wanted to add: I have the same problem with several datasets and models, it even seems to happen more frequently recently and for us changing the input data manually is not an option. If there is any chance that this fix could be merged soon, that would be awesome, thanks!

slundberg · 2022-03-23T02:33:41Z

Hey! Sorry to be out of the loop here for a while (was out after paternity leave for a while). The changes and test look great thanks @lrjball and @TheZL !

I am running a new CI pipeline to make sure things work as expected, then we can merge and get this is 0.41

daviskirk · 2022-05-13T16:36:28Z

I just checked the failing test on 3.8 because I thought I might be able to help push this over the finish line.
But as far as I can tell almost all of them are request failures that might just be because of random connection issues... perhaps they would just work on a rerun?

codecov · 2022-06-15T17:21:03Z

Codecov Report

Merging #2162 (2cfa489) into master (854426a) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master    #2162   +/-   ##
=======================================
  Coverage   51.51%   51.52%           
=======================================
  Files          90       90           
  Lines       13116    13118    +2     
=======================================
+ Hits         6757     6759    +2     
  Misses       6359     6359

Impacted Files	Coverage Δ
shap/explainers/_tree.py	`69.32% <100.00%> (+0.05%)`	⬆️

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

TheZL added 2 commits September 2, 2021 10:19

remove lstrip(b'binf') from XGBTreeModelLoader

81da34e

Update _tree.py

91e3b8c

TheZL changed the title ~~remove lstrip(b'binf') from XGBTreeModelLoader~~ Fixing incorrect model parse when XGB buf starts with 'binff' or 'binfn' Sep 7, 2021

stang2424 mentioned this pull request Sep 7, 2021

Fixes error with XGBoost 1.1 #1220

Merged

lrjball reviewed Dec 24, 2021

View reviewed changes

Update code to be more succinctly

c183e05

lrjball and others added 2 commits February 28, 2022 20:49

Merge branch 'master' into xgbmodel_buffer_lstrip_error_correction

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23
Expired

Verified
Learn about vigilant mode

6795f73

Added test for buffer strip update

f1318e3

Merge branch 'master' into xgbmodel_buffer_lstrip_error_correction

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23
Expired

Verified
Learn about vigilant mode

0fd465d

Merge branch 'master' into xgbmodel_buffer_lstrip_error_correction

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23
Expired

Verified
Learn about vigilant mode

2cfa489

slundberg merged commit 4921c50 into shap:master Jun 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing incorrect model parse when XGB buf starts with 'binff' or 'binfn' #2162

Fixing incorrect model parse when XGB buf starts with 'binff' or 'binfn' #2162

TheZL commented Sep 2, 2021 •

edited

Loading

TheZL commented Sep 7, 2021

lrjball left a comment

TheZL commented Dec 30, 2021 •

edited

Loading

lrjball commented Dec 30, 2021

TheZL commented Dec 31, 2021

TheZL commented Feb 7, 2022

lrjball commented Feb 28, 2022

TheZL commented Mar 1, 2022

josis-silver commented Mar 11, 2022

slundberg commented Mar 23, 2022

daviskirk commented May 13, 2022

codecov bot commented Jun 15, 2022 •

edited

Loading

Fixing incorrect model parse when XGB buf starts with 'binff' or 'binfn' #2162

Fixing incorrect model parse when XGB buf starts with 'binff' or 'binfn' #2162

Conversation

TheZL commented Sep 2, 2021 • edited Loading

TheZL commented Sep 7, 2021

lrjball left a comment

Choose a reason for hiding this comment

TheZL commented Dec 30, 2021 • edited Loading

lrjball commented Dec 30, 2021

TheZL commented Dec 31, 2021

TheZL commented Feb 7, 2022

lrjball commented Feb 28, 2022

TheZL commented Mar 1, 2022

josis-silver commented Mar 11, 2022

slundberg commented Mar 23, 2022

daviskirk commented May 13, 2022

codecov bot commented Jun 15, 2022 • edited Loading

Codecov Report

TheZL commented Sep 2, 2021 •

edited

Loading

TheZL commented Dec 30, 2021 •

edited

Loading

codecov bot commented Jun 15, 2022 •

edited

Loading