You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Scikit-learn version 1.2.0 enforces two API changes that currently break tests.
LinearRegression no longer supports the normalize keyword argument, which some of the tests use.
Fix should theoretically be rather simple since it is just replacing LinearRegression with a Pipeline object with a StandardScaler preprocessing step.
Scikit-learn now enforces strict column name restrictions.
First, all columns must be of the same type, and second, column names should match between fit and predict.
This might require a solution of larger breadth.
The first part will require a "safe join" that is column-name-type aware and replace all the instances we join covariate X with treatment assignment a.
The second part require to validate column-names are consistent/preserved when new data is inputted. Which might be mostly in the time-pooled survival models where a time range is artificially created and placed as a predictor.
A slightly more minor exception was also raised with Numpy v1.24.0. Throwing a TypeError: ufunc 'isfinite' not supported for the input types exception when generating calibration plots calls matplotlib's fill_between call that fails.
Need to dig deeper into that and whether that's a causallib problem (providing bad fill values) or some external matplotlib-numpy mismatch.
In the meantime, PR #50 limited the allowed dependency versions.
The text was updated successfully, but these errors were encountered:
Scikit-learn version 1.2.0 enforces two API changes that currently break tests.
LinearRegression
no longer supports thenormalize
keyword argument, which some of the tests use.Fix should theoretically be rather simple since it is just replacing
LinearRegression
with aPipeline
object with aStandardScaler
preprocessing step.First, all columns must be of the same type, and second, column names should match between
fit
andpredict
.This might require a solution of larger breadth.
The first part will require a "safe join" that is column-name-type aware and replace all the instances we join covariate
X
with treatment assignmenta
.The second part require to validate column-names are consistent/preserved when new data is inputted. Which might be mostly in the time-pooled survival models where a time range is artificially created and placed as a predictor.
A slightly more minor exception was also raised with Numpy v1.24.0. Throwing a
TypeError: ufunc 'isfinite' not supported for the input types
exception when generating calibration plots calls matplotlib'sfill_between
call that fails.Need to dig deeper into that and whether that's a causallib problem (providing bad fill values) or some external matplotlib-numpy mismatch.
In the meantime, PR #50 limited the allowed dependency versions.
The text was updated successfully, but these errors were encountered: