sklearn

Version 1.2.0

In Development

Changed models

The following estimators and functions, when fit with the same data and parameters, may produce different models from the previous version. This often occurs due to changes in the modelling logic (bug fixes or enhancements), or in random sampling procedures.

Changelog

`sklearn.cluster`

The predict and fit_predict methods of cluster.OPTICS now accept sparse data type for input data. 14736 by Hunt Zhan <huntzhan>, 20802 by Brandon Pokorny <Clickedbigfoot>, and 22965 by Meekail Zain <micky774>.
cluster.Birch now preserves dtype for numpy.float32 inputs. 22968 by Meekail Zain <micky774>.

`sklearn.datasets`

Introduce the new parameter parser in datasets.fetch_openml. parser="pandas" allows to use the very CPU and memory efficient pandas.read_csv parser to load dense ARFF formatted dataset files. It is possible to pass parser="liac-arff" to use the old LIAC parser. When parser="auto", dense datasets are loaded with "pandas" and sparse datasets are loaded with "liac-arff". Currently, parser="liac-arff" by default and will change to parser="auto" in version 1.4 21938 by Guillaume Lemaitre <glemaitre>.

`sklearn.ensemble`

Improve runtime performance of ensemble.IsolationForest by avoiding data copies. 23252 by Zhehao Liu <MaxwellLZH>.

`sklearn.metrics`

class_likelihood_ratios is added to compute the positive and negative likelihood ratios derived from the confusion matrix of a binary classification problem. 22518 by Arturo Amor <ArturoAmorQ>.

`sklearn.neighbors`

neighbors.KernelDensity bandwidth parameter now accepts definition using Scott's and Silvermann's estimation methods. 10468 by Ruben <icfly2> and 22993 by Jovan Stojanovic <jovan-stojanovic>.
Adds new function neighbors.sort_graph_by_row_values to sort a CSR sparse graph such that each row is stored with increasing values. This is useful to improve efficiency when using precomputed sparse distance matrices in a variety of estimators and avoid an EfficiencyWarning. 23139 by Tom Dupre la Tour.

`sklearn.tree`

Fixed invalid memory access bug during fit in tree.DecisionTreeRegressor and tree.DecisionTreeClassifier. 23273 by Thomas Fan.

`sklearn.utils`

utils.extmath.randomized_svd now accepts an argument, lapack_svd_driver, to specify the lapack driver used in the internal deterministic SVD used by the randomized SVD algorithm. 20617 by Srinath Kailasa <skailasa>

Code and Documentation Contributors

Thanks to everyone who has contributed to the maintenance and improvement of the project since version 1.1, including:

TODO: update at the time of the release.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.2.rst

v1.2.rst

Version 1.2.0

Changed models

Changelog

`sklearn.cluster`

`sklearn.datasets`

`sklearn.ensemble`

`sklearn.metrics`

`sklearn.neighbors`

`sklearn.tree`

`sklearn.utils`

Code and Documentation Contributors

Files

v1.2.rst

Latest commit

History

v1.2.rst

File metadata and controls

Version 1.2.0

Changed models

Changelog

sklearn.cluster

sklearn.datasets

sklearn.ensemble

sklearn.metrics

sklearn.neighbors

sklearn.tree

sklearn.utils

Code and Documentation Contributors

`sklearn.cluster`

`sklearn.datasets`

`sklearn.ensemble`

`sklearn.metrics`

`sklearn.neighbors`

`sklearn.tree`

`sklearn.utils`