Enhance docstring for LinearRegression.fit #28741

miguelcsilva · 2024-04-01T14:06:22Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Changes the docstring to include all types that can be used for the sample_weight parameter, as well as explaining the user what happens in each type.

github-actions · 2024-04-01T14:07:56Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 54c32de. Link to the linter CI: here}

betatim · 2024-04-02T09:34:21Z

sklearn/linear_model/_base.py

@@ -560,8 +560,15 @@ def fit(self, X, y, sample_weight=None):
        y : array-like of shape (n_samples,) or (n_samples, n_targets)
            Target values. Will be cast to X's dtype if necessary.

-        sample_weight : array-like of shape (n_samples,), default=None
-            Individual weights for each sample.
+        sample_weight : array-like of shape (n_samples,), int, float or None (default)


This seems more consistent with how other docstrings are formatted

Suggested change

sample_weight : array-like of shape (n_samples,), int, float or None (default)

sample_weight : array-like of shape (n_samples,), int or float, default=None

I could only find None (default) in the actual text explaining a parameter, not in the "headline"

betatim · 2024-04-02T09:36:31Z

sklearn/linear_model/_base.py

+            - `int` or `float`: all samples have a weight equal to the value \
+provided. Since there is no difference in the relative weight between samples, \
+results in the same fitted model as when `sample_weight=None`.


I'd go for something concise like

Suggested change

- `int` or `float`: all samples have a weight equal to the value \

provided. Since there is no difference in the relative weight between samples, \

results in the same fitted model as when `sample_weight=None`.

- `int` or `float`: all samples have a weight equal to the value provided.

I wonder if we should explain to the user what exact effect (no effect) this has, we don't go into that for array-like either. So maybe we can skip it.

betatim · 2024-04-02T09:37:04Z

Thanks for the PR fixing this! I left two small comments about formatting and level of detail

lorentzenchr · 2024-04-02T21:34:17Z

While this PR add technically correct statements, I'm not so sure to include it. Also, if we start here, do we go through all estimators to adapt the description?

jeremiedbb · 2024-04-03T10:17:20Z

I would not add the detailed paragraph either. But I'd still fix the valid types. And I think we should in all places where it applies (floats are not always supported), in other PRs though.

jeremiedbb · 2024-04-03T10:19:00Z

sklearn/linear_model/_base.py

@@ -560,8 +560,15 @@ def fit(self, X, y, sample_weight=None):
        y : array-like of shape (n_samples,) or (n_samples, n_targets)
            Target values. Will be cast to X's dtype if necessary.

-        sample_weight : array-like of shape (n_samples,), default=None
-            Individual weights for each sample.
+        sample_weight : array-like of shape (n_samples,), int, float or None (default)


no need to have float and int. float covers both. In the few places where we already mention float as valid type, it comes first. Let's keep the same formating for consistency.

Suggested change

sample_weight : array-like of shape (n_samples,), int, float or None (default)

sample_weight : float or array-like of shape (n_samples,), default=None

betatim · 2024-04-03T12:16:55Z

The discussion on whether or not to do this is happening in #28732

miguelcsilva added 2 commits April 1, 2024 14:57

Enhance docstring LinearRegression.fit

bce2e59

Reword docstring

58b1bd7

github-actions bot added the module:linear_model label Apr 1, 2024

betatim reviewed Apr 2, 2024

View reviewed changes

jeremiedbb reviewed Apr 3, 2024

View reviewed changes

lorentzenchr added the Needs Decision Requires decision label Apr 5, 2024

Address PR comments

54c32de

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance docstring for LinearRegression.fit #28741

Enhance docstring for LinearRegression.fit #28741

miguelcsilva commented Apr 1, 2024

github-actions bot commented Apr 1, 2024 •

edited

betatim Apr 2, 2024

betatim Apr 2, 2024

betatim commented Apr 2, 2024

lorentzenchr commented Apr 2, 2024

jeremiedbb commented Apr 3, 2024

jeremiedbb Apr 3, 2024

betatim commented Apr 3, 2024

	sample_weight : array-like of shape (n_samples,), int, float or None (default)
	sample_weight : array-like of shape (n_samples,), int or float, default=None

	sample_weight : array-like of shape (n_samples,), int, float or None (default)
	sample_weight : float or array-like of shape (n_samples,), default=None

Enhance docstring for LinearRegression.fit #28741

Are you sure you want to change the base?

Enhance docstring for LinearRegression.fit #28741

Conversation

miguelcsilva commented Apr 1, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

github-actions bot commented Apr 1, 2024 • edited

✔️ Linting Passed

betatim Apr 2, 2024

Choose a reason for hiding this comment

betatim Apr 2, 2024

Choose a reason for hiding this comment

betatim commented Apr 2, 2024

lorentzenchr commented Apr 2, 2024

jeremiedbb commented Apr 3, 2024

jeremiedbb Apr 3, 2024

Choose a reason for hiding this comment

betatim commented Apr 3, 2024

github-actions bot commented Apr 1, 2024 •

edited