Attempt to deflake TestFnCacheSanity #10250

zmb3 · 2022-02-09T16:14:06Z

Convert approxReads to an integer (by truncating) before comparing
to actual reads.

This should prevent failures where due to our approximation, we estimate
a fractional number of reads that exceed our tolerance of 1.

Sample error: Max difference between 10.461059975 and 9 allowed is 1, but difference was 1.4610599749999995

Updates #9492

espadolini · 2022-02-09T16:21:20Z

Isn't this effectively just bumping the tolerance to 2?

zmb3 · 2022-02-09T16:26:06Z

Isn't this effectively just bumping the tolerance to 2?

In a way, yes. I've never seen it fail with anything larger than 1.4-1.5, so I think it's not so much increasing the tolerance but making sure our "approximation" is a whole number.

We could even round instead of truncate to may be a bit more accurate, but this isn't an exact science - it's an approximation which is bound to be flaky.

espadolini · 2022-02-09T16:28:45Z

We could just increase the delta in InDelta so we'd still keep the info with full precision if it exceeds that; and we can bump the tolerance to 1.5 or 1.75 without requiring integer steps that way.

rosstimothy · 2022-02-09T16:41:14Z

One thing to note about the sudden flakiness of this test is that I changed the FnCache to use a clockwork.Clock in 5279edf instead of calling time.Now directly as part of #9958

jakule · 2022-02-10T23:16:44Z

I agree with @espadolini on that. I think that increasing the timeout is more explicit than casting to int.

We could even round instead of truncate to may be a bit more accurate, but this isn't an exact science - it's an approximation which is bound to be flaky.

Maybe I misunderstood your comment, but casting or rounding doesn't really change the value. Casting "removes" the floating part from the number, but that doesn't change the nature of the problem. Too big value will fail the test and that value is related to execution time.

If we want to make that test more reliable, I think we should leverage the fact that now we can use clockwork.Clock to change time and check if cache behave as expected instead of stress testing the code.

zmb3 · 2022-02-11T17:58:59Z

My thought here was that the test is not flaky because it is sometimes too slow, but because sometimes a rounding error on our calculation for "expected number of reads" pushes us slightly over the tolerance of 1, and there's no such thing as a fractional read.

In any case, updated the tolerance to 2.

espadolini

Been running 6 instances of the lib/cache tests for about 15 minutes together with adding t.Parallel() to every test, and I've had a single failure because of a 2.2; plenty of failures when the threshold was 1

zmb3 · 2022-02-15T15:02:35Z

@rosstimothy can you have a look?

Increase tolerance on expected reads. This should prevent failures where due to our approximation, we estimate a fractional number of reads that exceed our tolerance of 1. Sample error: Max difference between 10.461059975 and 9 allowed is 1, but difference was 1.4610599749999995 Updates #9492

github-actions bot requested review from jakule and rosstimothy February 9, 2022 16:14

zmb3 added the flaky tests label Feb 9, 2022

zmb3 force-pushed the zmb3/deflake-fn-cache branch from c2eaff6 to 898d494 Compare February 9, 2022 16:15

espadolini approved these changes Feb 11, 2022

View reviewed changes

rosstimothy approved these changes Feb 15, 2022

View reviewed changes

zmb3 force-pushed the zmb3/deflake-fn-cache branch from 512217b to c677f8b Compare February 15, 2022 16:44

zmb3 enabled auto-merge (squash) February 15, 2022 16:46

zmb3 force-pushed the zmb3/deflake-fn-cache branch from c677f8b to 02eea03 Compare February 15, 2022 18:18

zmb3 force-pushed the zmb3/deflake-fn-cache branch from 02eea03 to b686665 Compare February 15, 2022 19:34

zmb3 merged commit 5f6cc76 into master Feb 15, 2022

zmb3 deleted the zmb3/deflake-fn-cache branch February 15, 2022 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attempt to deflake TestFnCacheSanity #10250

Attempt to deflake TestFnCacheSanity #10250

zmb3 commented Feb 9, 2022 •

edited

espadolini commented Feb 9, 2022

zmb3 commented Feb 9, 2022

espadolini commented Feb 9, 2022

rosstimothy commented Feb 9, 2022

jakule commented Feb 10, 2022

zmb3 commented Feb 11, 2022

espadolini left a comment

zmb3 commented Feb 15, 2022

Attempt to deflake TestFnCacheSanity #10250

Attempt to deflake TestFnCacheSanity #10250

Conversation

zmb3 commented Feb 9, 2022 • edited

espadolini commented Feb 9, 2022

zmb3 commented Feb 9, 2022

espadolini commented Feb 9, 2022

rosstimothy commented Feb 9, 2022

jakule commented Feb 10, 2022

zmb3 commented Feb 11, 2022

espadolini left a comment

Choose a reason for hiding this comment

zmb3 commented Feb 15, 2022

zmb3 commented Feb 9, 2022 •

edited