Adding unit test to cover ties/duplicate x values in Isotonic Regression... #4185

mjbommar · 2015-01-30T00:47:39Z

Unit test to highlight regression in issue #4184

mjbommar · 2015-01-30T16:12:04Z

Small note: Travis failures are intended, as this is meant to cover the issue highlighted in #4184.

amueller · 2015-01-30T18:35:58Z

I think we should maybe hard-code what the expected result is. In master, neigher fit_transform nor fit.transform gives the/a correct one, right?

…a() with ties=primary

mjbommar · 2015-01-31T16:20:41Z

Hi @amueller , yes, great suggestion. I used the R isotone package's examples from the Leeuw et al. paper in JSS as a base, and have committed expanded unit tests based on this.

fit_transform matches the results of gpava(ties="primary"); fit, then transform does not.
gpava() allows the user to set a method for tie-handling; should we dock to one of these options, e.g., "primary", until deciding if we should allow the user to specify in sklearn as well? See pp. 14 in the JSS paper linked above.

agramfort · 2015-01-31T16:28:34Z

sklearn/tests/test_isotonic.py

+
+def test_isotonic_regression_ties_primary_fit_transform():
+    """
+    Test isotonic regression fit_transform  against the "primary" ties method


2 spaces after transform but besides LGTM if travis is happy :)

thanks @mjbommar

oops, fixed now, thanks.

travis was happy other than these 3 failures:

====================================================================== FAIL: sklearn.tests.test_isotonic.test_isotonic_regression_ties_min ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib/python2.7/dist-packages/nose/case.py", line 197, in runTest self.test(*self.arg) File "/data/workspace/scikit-learn/sklearn/tests/test_isotonic.py", line 92, in test_isotonic_regression_ties_min assert_array_equal(ir.fit(x, y).transform(x), ir.fit_transform(x, y)) File "/usr/local/lib/python2.7/dist-packages/numpy/testing/utils.py", line 739, in assert_array_equal verbose=verbose, header='Arrays are not equal') File "/usr/local/lib/python2.7/dist-packages/numpy/testing/utils.py", line 665, in assert_array_compare raise AssertionError(msg) AssertionError: Arrays are not equal (mismatch 28.5714285714%) x: array([ 0., 0., 0., 3., 4., 5., 6.]) y: array([ 0., 1., 2., 3., 4., 5., 6.]) ====================================================================== FAIL: sklearn.tests.test_isotonic.test_isotonic_regression_ties_max ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib/python2.7/dist-packages/nose/case.py", line 197, in runTest self.test(*self.arg) File "/data/workspace/scikit-learn/sklearn/tests/test_isotonic.py", line 103, in test_isotonic_regression_ties_max assert_array_equal(ir.fit(x, y).transform(x), ir.fit_transform(x, y)) File "/usr/local/lib/python2.7/dist-packages/numpy/testing/utils.py", line 739, in assert_array_equal verbose=verbose, header='Arrays are not equal') File "/usr/local/lib/python2.7/dist-packages/numpy/testing/utils.py", line 665, in assert_array_compare raise AssertionError(msg) AssertionError: Arrays are not equal (mismatch 33.3333333333%) x: array([ 1., 2., 3., 4., 0., 0.]) y: array([ 1., 2., 3., 4., 5., 6.]) ====================================================================== FAIL: Test isotonic regression fit, transform against the "primary" ties method ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib/python2.7/dist-packages/nose/case.py", line 197, in runTest self.test(*self.arg) File "/data/workspace/scikit-learn/sklearn/tests/test_isotonic.py", line 134, in test_isotonic_regression_ties_primary_fit assert_array_equal(ir.transform(x), y_true) File "/usr/local/lib/python2.7/dist-packages/numpy/testing/utils.py", line 739, in assert_array_equal verbose=verbose, header='Arrays are not equal') File "/usr/local/lib/python2.7/dist-packages/numpy/testing/utils.py", line 665, in assert_array_compare raise AssertionError(msg) AssertionError: Arrays are not equal (mismatch 100.0%) x: array([ 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.]) y: array([ 21. , 22.375, 22.375, 22.375, 22.375, 22.375, 22.375, 22.375, 22.375, 23.5 , 25. ]) ---------------------------------------------------------------------- Ran 24 tests in 0.041s FAILED (failures=3)

agramfort · 2015-02-01T17:12:16Z

travis still complains :(

mjbommar · 2015-02-01T18:46:52Z

@agramfort , the purpose of these unit tests is to highlight a current issue that exists in 0.15.2 and 0.16-dev. The three failures that are occurring in travis are intended to fail :)

amueller · 2015-02-01T19:08:39Z

sklearn/tests/test_isotonic.py

+    assert_array_equal(ir.transform(x), y_true)
+
+
+def test_isotonic_regression_ties_primary_fit_transform():


I would put the two in the same test, I think.

amueller · 2015-02-01T19:12:58Z

Maybe @NelleV has time to have a look. I'll have a look next week. Specifying the tie-breaking mechanism with as backward compatible a default as possible would be nice.
We might need to refactor a bit to fix #2507, too. (Currently multiple zero sample weights in a row give an infinite loop).

agramfort · 2015-02-01T20:16:54Z

@agramfort , the purpose of these unit tests is to highlight a current issue that exists in 0.15.2 and 0.16-dev. The three failures that are occurring in travis are intended to fail :)

arrfff ... sorry for the noise.

maybe @fabianp can have a look too.

GaelVaroquaux · 2015-02-01T20:39:58Z

Our policy is to not commit to master failing tests without the fix. The reason is to be able to always have master green. If master is not green, it has a psychological detrimental effect on quality.

mjbommar · 2015-02-01T21:36:47Z

@GaelVaroquaux , understood; just trying to help whoever picks up issue #4184.

amueller · 2015-02-03T18:24:06Z

I think we should add a ties= option and maybe just use a naive implementation of fit_predict, that is just remove the method. That would also make masking zero-sample weight points super easy.

mjbommar · 2015-02-03T18:32:40Z

@amueller, +1 from my perspective. In my experience, the "secondary" and "tertiary" options described in the JSS paper are more useful than "primary" given that "primary" does not necessarily produce bijective mappings.

Small chance I might be able to do this in the next week or two. Any opposition to reworking the cython source along these lines:
https://github.com/cran/isotone/blob/master/R/gpava.R#L36

amueller · 2015-02-03T18:39:48Z

I have no detailed knowledge and I think @NelleV or @agramfort might need to weight in.

agramfort · 2015-02-03T18:46:56Z

no opposition

NelleV · 2015-02-04T08:30:43Z

I think it is a good idea to have different ways to deal with ties. Go for it!

mjbommar · 2015-02-04T15:31:29Z

OK, here are some thoughts; want to make sure that my particular use cases are not leading me astray.

From a syntax perspective, how do we distinguish between the case A, the monotonic optimization of a specific sample (X, Y) from case B, the usage of a monotonic regression function constructed from that optimization result? For example, fit_transform need not concern itself with an interpolant, but transform on out of sample data does (and predict is simply a call to transform, but we have no fit_predict, and thus fit_transform and fit_predict may not return the same result based on more on LARS (we're getting there) #2 below).
For case B above (i.e., predict), the simplest thing to do is construct a piece-wise constant interpolant from the optimization of the specific sample used in fit. However, we've been using "linear" or "slinear" calls to interp1d, resulting in piece-wise linear interpolant; depending on what fit()'s X and y look like (i.e., whatself.X_and`self.y_`` are), these may be radically different or "broken" answers (as we are seeing for some tied values at left or right endpoint).
My gut instinct at this point is we will "break" things further from a reverse compatibility perspective in order to "fix" these issues. How do we want to go about handling this? Another question might be - what is the regression? That fit_transform and fit -> transform are different, or that our syntax is not consistent with the concepts?

mjbommar · 2015-02-10T04:01:53Z

@NelleV or @agramfort, if you have any thoughts about the question above, I would have some time to put towards a fix this week.

agramfort · 2015-02-10T13:22:10Z

no bandwidth to look into it :(

ogrisel · 2015-02-26T13:57:00Z

I just ran those 3 tests against 0.15.2 and they also fail there: the first two tests (test_isotonic_regression_ties_min / max) yield the same failures while the later used to output nans which has been fixed in scikit-learn master:

https://gist.github.com/ogrisel/676f7d582600036efa60

So it seems that the test_isotonic_regression_ties_min / max tests do not properly qualify the regression you initially reported @mjbommar . Can you please confirm that your reproduce the same test failures on sklearn 0.15.2?

The last test, namely test_isotonic_regression_ties_primary_ highlights a bug in our code in my opinion: the output is all zeros on master instead of being in the range of 22 something... This is really weird.

ogrisel · 2015-02-26T14:00:04Z

Note that the fit_transform method returns the expected y_true. There is something wrong in our implementation of transform.

amueller · 2015-02-26T14:29:46Z

Wait, when I last looked at it it was the other way around ... ?!

amueller · 2015-02-26T14:32:48Z

Oh no, you are right, transform was the problem.

amueller · 2015-02-26T14:34:15Z

I also want to highlight again the issue #2507 than can cause infinite loops:
#2507 (comment)

ogrisel · 2015-02-26T15:06:28Z

I also want to highlight again the issue #2507 than can cause infinite loops:
#2507 (comment)

I created a dedicated issue for this (#4297) as it does not seem to be the same bug as the original report in #2507.

mjbommar · 2015-02-26T15:33:08Z

@ogrisel , the discussion here was unfortunately split between both issue #4184 and this PR. You can see that we had confirmed the failures in 0.15.2 as well in #4184 ; in other words, this is a regression that goes back some ways.

While it's been a few weeks since I spent time thinking about it, I believe my comment here is the best synopsis of the ways forward: #4185 (comment)

ogrisel · 2015-02-26T15:39:21Z

Thanks.

While it's been a few weeks since I spent time thinking about it, I believe my comment here is the best synopsis of the ways forward: #4185 (comment)

This does not explain the all zeros output I get when running test_isotonic_regression_ties_primary_ with slinear right? I still need to investigate further to understand.

mjbommar · 2015-02-26T16:15:13Z

@ogrisel , agreed on 0's.

My line of inquiry led me to question what we meant by "expected" results in general. Our implementation is only a very narrow way of looking at the problem and may be too naive. These unit tests were meant to dock us to the corresponding R package released by the publication authors.

For example, why is "slinear" the default and not a piece-wise constant interpolant? Also, should transform and predict necessarily return the same result? I think we can agree that fit_transform and fit, transform should.

amueller · 2015-02-27T15:52:28Z

Here is a notebook that illustrates everything that goes horribly wrong in interp1d:
http://nbviewer.ipython.org/gist/amueller/df9a8a7da67c1c4556cf

when called at the exact input data, slinear gives zeros (looks like a scipy bug)
on the lower tie, linear gives a NaN

I would be ok with having any valid tie-breaking strategy, but that doesn't seem to be reasonable.

amueller · 2015-02-27T15:54:38Z

With just

x = [  8., 10.,  12.,  14.]
y = [ 21.   ,  22.375,  22.375,  23.5]

as input, everything looks good fyi.

amueller · 2015-02-27T16:00:54Z

Proposed solution: report / ask scipy people, implement our own duplicate removal strategy.

@mjbommar I agree there is much room for different strategies. I think the sklearn developers don't have that many uses outside of calibration, where duplicate values are basically non-existent.
I also don't know about nearest vs linear interpolation. If you have insights, I think we are pretty open to improvements.
Actually, I am quite confused by the presence of both transform and predict.
I think this API is a bit weird, and I think the reason is that this is usually applied to the labels, not the data.

amueller · 2015-02-27T16:06:23Z

@mjbommar did you already put any thought into how to implement the tie-breaking?

mjbommar · 2015-02-27T16:31:43Z

@amueller , my perspective was that we should try to dock our approach to the publication authors'. While they implement three approaches in their CRAN package, we could simply pick one; some have a means of breaking ties in the input sample.

That said, my point above about the difference between transform and predict will remain. The sample optimization problem (i.e., fit_transform) is much simpler, as it doesn't involve a choice of interpolant, i.e., no need to use interp1d.

amueller · 2015-02-27T18:11:35Z

I agree about using one of their approaches, I was more asking if you looked into implementing any of them. I'll have a closer look at the paper now...

amueller · 2015-02-27T18:37:15Z

I think we need to implement the secondary approach. With the primary approach, we can not get fit_transform to be the same as transform. The same is true for the tertiary approach if I understand it correctly.

amueller · 2015-02-27T18:59:52Z

Ugh, it looks like there is also an(other) issue with sample_weights. It looks like it is not reordered in https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/isotonic.py#L253 ...

amueller · 2015-02-27T19:43:31Z

@mjbommar what do you think about secondary tie-breaking as the default?

amueller · 2015-02-27T19:56:55Z

Another question: What does it mean to predict on new data using the primary method? What would you predict for a point with ties?

mjbommar · 2015-02-27T20:29:17Z

@amueller , I think choice of tie-breaking is tied to whether we support predict or not. Like you are seeing, if the resulting sequence is not one-to-one, then interpolants won't make sense.

Since the client uncovered this issue, my first approach had been to replace the interp1d interpolant with a left-hand piece-wise constant interpolant. This handles the ties OK. Even better from an application perspective was to take the mean/median of the tied values and replace into the original input (updating sample weight optionally).

It sounds like we are seeing the same issues now :)

amueller · 2015-02-27T21:10:56Z

Ok. The secondary strategy actually takes the mean and supports predict. I think that is a reasonable choice.

mjbommar · 2015-02-27T21:26:08Z

@amueller , yes, perfect.

I would still be happy to expand the methods to support other interpolants and primary/tertiary from gpava, but perhaps we can push that discussion off until the regressions are resolved.

amueller · 2015-02-27T21:52:24Z

I agree, it would be nice to have, and you are welcome to work on it.

ogrisel · 2015-02-28T12:08:19Z

@mjbommar please avoid merging master into PR branch but instead squash old uninformative commits to cleanup the history and rebase on top of the current master.

mjbommar · 2015-02-28T12:23:11Z

@ogrisel , sorry, but I did this since @amueller cherry-picked my commits for his fix PR #4302. I will close this PR for review and we can continue the conversation in the active PR, #4302.

ogrisel · 2015-02-28T13:00:14Z

Ok no pbm.

Adding unit test to cover ties/duplicate x values in Isotonic Regress…

a70094d

…ion re: issue scikit-learn#4184

mjbommar mentioned this pull request Jan 30, 2015

IsotonicRegression results differ between fit/transform and fit_transform with ties in X #4184

Closed

Expanding tests to include ties at both x_min and x_max

8b0c0da

amueller added the Bug label Jan 30, 2015

amueller added this to the 0.16 milestone Jan 30, 2015

Updating unit test to include reference data against R's isotone gpav…

de7d6bf

…a() with ties=primary

Adding R and isotone package versions for reproducibility/documentation

3dcbae7

agramfort reviewed Jan 31, 2015
View reviewed changes

Removing double space in docstring

c31e8b3

amueller reviewed Feb 1, 2015
View reviewed changes

Combining tests for fit and transform with ties; fixing spelling error

814a49e

amueller mentioned this pull request Feb 4, 2015

[MRG+1] Isotonic calibration #1176

Closed

5 tasks

amueller mentioned this pull request Feb 27, 2015

[MRG+1] Isotonic regression duplicate fixes #4302

Merged

Merge branch 'master' into issue-4184-isotonic-x-tie-unit-test

13a4ffb

mjbommar closed this Feb 28, 2015

mjbommar mentioned this pull request Feb 18, 2016

[MRG+1] Much faster prediction with isotonic regression #6206

Closed

mjbommar mentioned this pull request Jun 23, 2016

Fix for #6931: isotonic y_min/y_max bounds #6928

Closed

		assert_array_equal(ir.transform(x), y_true)


		def test_isotonic_regression_ties_primary_fit_transform():

Adding unit test to cover ties/duplicate x values in Isotonic Regression... #4185

Adding unit test to cover ties/duplicate x values in Isotonic Regression... #4185

Conversation

mjbommar commented Jan 30, 2015

mjbommar commented Jan 30, 2015

amueller commented Jan 30, 2015

mjbommar commented Jan 31, 2015

agramfort Jan 31, 2015

Choose a reason for hiding this comment

mjbommar Jan 31, 2015

Choose a reason for hiding this comment

agramfort commented Feb 1, 2015

mjbommar commented Feb 1, 2015

amueller Feb 1, 2015

Choose a reason for hiding this comment

mjbommar Feb 1, 2015

Choose a reason for hiding this comment

amueller commented Feb 1, 2015

agramfort commented Feb 1, 2015

GaelVaroquaux commented Feb 1, 2015

mjbommar commented Feb 1, 2015

amueller commented Feb 3, 2015

mjbommar commented Feb 3, 2015

amueller commented Feb 3, 2015

agramfort commented Feb 3, 2015

NelleV commented Feb 4, 2015

mjbommar commented Feb 4, 2015

mjbommar commented Feb 10, 2015

agramfort commented Feb 10, 2015

ogrisel commented Feb 26, 2015

ogrisel commented Feb 26, 2015

amueller commented Feb 26, 2015

amueller commented Feb 26, 2015

amueller commented Feb 26, 2015

ogrisel commented Feb 26, 2015

mjbommar commented Feb 26, 2015

ogrisel commented Feb 26, 2015

mjbommar commented Feb 26, 2015

amueller commented Feb 27, 2015

amueller commented Feb 27, 2015

amueller commented Feb 27, 2015

amueller commented Feb 27, 2015

mjbommar commented Feb 27, 2015

amueller commented Feb 27, 2015

amueller commented Feb 27, 2015

amueller commented Feb 27, 2015

amueller commented Feb 27, 2015

amueller commented Feb 27, 2015

mjbommar commented Feb 27, 2015

amueller commented Feb 27, 2015

mjbommar commented Feb 27, 2015

amueller commented Feb 27, 2015

ogrisel commented Feb 28, 2015

mjbommar commented Feb 28, 2015

ogrisel commented Feb 28, 2015