sklearn
In Development
The following estimators and functions, when fit with the same data and parameters, may produce different models from the previous version. This often occurs due to changes in the modelling logic (bug fixes or enhancements), or in random sampling procedures.
- The predict and fit_predict methods of
cluster.OPTICS
now accept sparse data type for input data.14736
byHunt Zhan <huntzhan>
,20802
byBrandon Pokorny <Clickedbigfoot>
, and22965
byMeekail Zain <micky774>
. cluster.Birch
now preserves dtype for numpy.float32 inputs.22968
by Meekail Zain <micky774>.
- Introduce the new parameter parser in
datasets.fetch_openml
. parser="pandas" allows to use the very CPU and memory efficient pandas.read_csv parser to load dense ARFF formatted dataset files. It is possible to pass parser="liac-arff" to use the old LIAC parser. When parser="auto", dense datasets are loaded with "pandas" and sparse datasets are loaded with "liac-arff". Currently, parser="liac-arff" by default and will change to parser="auto" in version 1.421938
byGuillaume Lemaitre <glemaitre>
.
- Improve runtime performance of
ensemble.IsolationForest
by avoiding data copies.23252
byZhehao Liu <MaxwellLZH>
.
class_likelihood_ratios
is added to compute the positive and negative likelihood ratios derived from the confusion matrix of a binary classification problem.22518
byArturo Amor <ArturoAmorQ>
.
neighbors.KernelDensity
bandwidth parameter now accepts definition using Scott's and Silvermann's estimation methods.10468
byRuben <icfly2>
and22993
byJovan Stojanovic <jovan-stojanovic>
.- Adds new function
neighbors.sort_graph_by_row_values
to sort a CSR sparse graph such that each row is stored with increasing values. This is useful to improve efficiency when using precomputed sparse distance matrices in a variety of estimators and avoid an EfficiencyWarning.23139
by Tom Dupre la Tour.
- Fixed invalid memory access bug during fit in
tree.DecisionTreeRegressor
andtree.DecisionTreeClassifier
.23273
by Thomas Fan.
utils.extmath.randomized_svd
now accepts an argument, lapack_svd_driver, to specify the lapack driver used in the internal deterministic SVD used by the randomized SVD algorithm.20617
bySrinath Kailasa <skailasa>
Thanks to everyone who has contributed to the maintenance and improvement of the project since version 1.1, including:
TODO: update at the time of the release.