Replace grid_resolution np.inf value with large number #940

mauicv · 2023-07-04T10:05:04Z

What is this

Fixes CI failing due to sklearn parameter validation in partial_dependence function

Sklearn has added parameter validation for partial_dependence here which we use in the tests for our partial dependence implementation. This means the ci is failing as we set grid_resolution to np.inf where now Sklearn expects that parameter to be less than infinity.

This PR just replaces np.inf in the relevant tests to a large number. This is to check if the test suite failed elsewhere. This is still a work in progress.

codecov · 2023-07-04T10:56:10Z

Codecov Report

Merging #940 (e88d4c8) into master (c41b6d8) will decrease coverage by 0.05%.
The diff coverage is n/a.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #940      +/-   ##
==========================================
- Coverage   85.28%   85.23%   -0.05%     
==========================================
  Files          74       74              
  Lines        8832     8832              
==========================================
- Hits         7532     7528       -4     
- Misses       1300     1304       +4

see 4 files with indirect coverage changes

RobertSamoilescu · 2023-07-06T09:13:28Z

The inf value was used to avoid the division of the categorical feature axis into equidistant bins controlled by the grid_resolution parameter. In principle, any value bigger than the cardinal of all categorical feature values should work.

At that time, sklearn was supporting only numerical features, and using the inf value was a way around to support categorical features as well. Note that at the moment, sklearn included support for categorical features (see PR here). Thus, I would recommend to use this new feature instead of the large number, only if we don't run into any dependency problems.

mauicv · 2023-07-06T14:27:54Z

I've updated the sklearn partial_dependence call to use categorical_features instead of grid_resolution. This fixes the issue for python >= 3.8 but sklearn 1.3.0 is not supported for python == 3.7 so the categorical_features functionality has not been added and this introduces an error. I've added some logic so that the test checks for which sklearn version we're using.

Note: Python 3.7 is technically end-of-life so we should drop support for it (see here). If we do so we could remove the above changes. It's perhaps worth leaving them in case we're testing on supported python versions with sklearn<=-1.3.0.

ascillitoe · 2023-07-06T17:11:18Z

Ci failing due to #943

ascillitoe · 2023-07-07T08:57:31Z

I'm in favour of the second option i.e. keeping the conditional sklearn version behaviour in to support running tests with a new Python version but older sklearn version.

alibi/explainers/tests/test_partial_dependence.py

Replace grid_resolution np.inf value with large number

bb00413

mauicv added the WIP This PR is a Work in Progress label Jul 4, 2023

mauicv requested a review from RobertSamoilescu July 4, 2023 10:05

Pass categorical_names to sklearn partial_dependency

329132d

mauicv added 2 commits July 6, 2023 15:28

Make test beahvour depend on sklearn version

ddf3594

Fix linting error

a42f01e

ascillitoe self-requested a review July 6, 2023 16:49

Bump shap version

e88d4c8

ascillitoe reviewed Jul 7, 2023

View reviewed changes

alibi/explainers/tests/test_partial_dependence.py Show resolved Hide resolved

ascillitoe mentioned this pull request Jul 7, 2023

Update shap requirement from <0.42.0,>=0.40.0 to >=0.40.0,<0.43.0 #944

Merged

Merge branch 'master' into bugfix/pd-grid-res-fix

65f7c74

ascillitoe merged commit edd11aa into SeldonIO:master Jul 7, 2023
9 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace grid_resolution np.inf value with large number #940

Replace grid_resolution np.inf value with large number #940

mauicv commented Jul 4, 2023

codecov bot commented Jul 4, 2023 •

edited

RobertSamoilescu commented Jul 6, 2023

mauicv commented Jul 6, 2023 •

edited

ascillitoe commented Jul 6, 2023

ascillitoe commented Jul 7, 2023

Replace grid_resolution np.inf value with large number #940

Replace grid_resolution np.inf value with large number #940

Conversation

mauicv commented Jul 4, 2023

What is this

codecov bot commented Jul 4, 2023 • edited

Codecov Report

RobertSamoilescu commented Jul 6, 2023

mauicv commented Jul 6, 2023 • edited

ascillitoe commented Jul 6, 2023

ascillitoe commented Jul 7, 2023

codecov bot commented Jul 4, 2023 •

edited

mauicv commented Jul 6, 2023 •

edited