Fix PERT distribution for when `mode` is very close to `(max - min) / 2` #1311

LukasKalbertodt · 2023-05-01T16:45:53Z

There is already a special condition for this case, as the general formula breaks for floats. However, the check uses == which is fishy for floats. This commit instead checks if the difference is smaller than the machine epsilon.

Without this commit, this returns an error (despite being totally valid parameters for PERT):

Pert::new(0.0, 0.48258883, 0.24129441)

dhardy · 2023-05-18T13:13:35Z

rand_distr/src/pert.rs

@@ -99,7 +99,7 @@ where

        let range = max - min;
        let mu = (min + max + shape * mode) / (shape + F::from(2.).unwrap());
-        let v = if mu == mode {
+        let v = if (mu - mode).abs() < F::epsilon() {


Epsilon is not the appropriate value to use here: e.g. if min and max are very small this test will be too permissive. I'd suggest something like ~~2 * ε / mu~~ 2εμ.

Good point. I've read up on floating point comparisons (in particular this) and force pushed some changes. I'm now using 2 * F::EPSILON * max(mu.abs(), mode.abs()). This is something commonly suggested, i.e. scaling the machine epsilon by the maximum of the two inputs.

This might still go wrong if one of the values is 0. The linked article talks a lot about how that's quite difficult to deal with. And actually, the popular approx crate treats numbers as equal if they differ less than F::epsilon, at least by default. (To the best of my understanding.) This is to deal with numbers close to 0. So even for ULP or relative epsilon comparison, this absolute epsilon comparison is done first. So actually, thinking about it, my initial commit might be fine for small numbers, but it's main problem was probably large numbers (where F::epsilon is smaller than the gap between two neighboring floats).
Uff, this is tricky :/

I pushed a new version again. This time, the epsilon is scaled by the largest input. I think with this, everything should be fine. In particular, this deals with "catastrophic cancellation": where the inputs (min, max, mode, shape) are quite large, but due to subtracting them from one another, we keep the quite large rounding error even for mu close to 0. This is also recommended in the linked article:

You’ll need to use an absolute epsilon, whose value might be some small multiple of FLT_EPSILON and the inputs to your calculation.

dhardy

Your previous suggestion makes the most sense to me:

let thresh = 2 * F::EPSILON * max(mu.abs(), mode.abs());

dhardy · 2023-05-20T13:32:47Z

rand_distr/src/pert.rs

+        // to the whole calculation.
+        //
+        // https://randomascii.wordpress.com/2012/02/25/comparing-floating-point-numbers-2012-edition/
+        let epsilon = shape.max(min.abs()).max(max.abs()) * F::epsilon() * (F::one() + F::one());


Must you call this complicated expression epsilon? And why does it involve shape anyway?

There is already a special condition for this case, as the general formula breaks for floats. However, the check uses `==` which is fishy for floats. This commit instead checks if the difference is smaller than the machine epsilon. Without this commit, this returns an error (despite being totally valid parameters for PERT): Pert::new(0.0, 0.48258883, 0.24129441)

LukasKalbertodt · 2023-05-20T15:08:27Z

Your previous suggestion makes the most sense to me:
let thresh = 2 * F::EPSILON * max(mu.abs(), mode.abs());

I think the problem with this is that mu can have an error that is not proportional to its magnitude. If the inputs of that calculation (e.g. min and max) are considerably larger than what mu ends up being, mu has an error that's proportional to those large inputs.

But... I've looked at this quite a while now and I'm not 100% sure about anything. It doesn't help that I don't know the semantic meaning of mu, shape or many other parts. So, I'm fine with merging the threshold calculation you mentioned. I pushed that now.

LukasKalbertodt force-pushed the fix-pert-distribution branch from f6161bf to b76ed8a Compare May 1, 2023 16:53

dhardy reviewed May 18, 2023

View reviewed changes

LukasKalbertodt force-pushed the fix-pert-distribution branch 3 times, most recently from 880e1ac to ebd1d98 Compare May 20, 2023 11:59

dhardy reviewed May 20, 2023

View reviewed changes

LukasKalbertodt force-pushed the fix-pert-distribution branch from ebd1d98 to 6f8437e Compare May 20, 2023 15:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix PERT distribution for when `mode` is very close to `(max - min) / 2` #1311

Fix PERT distribution for when `mode` is very close to `(max - min) / 2` #1311

LukasKalbertodt commented May 1, 2023

dhardy May 18, 2023 •

edited

LukasKalbertodt May 20, 2023 •

edited

LukasKalbertodt May 20, 2023

LukasKalbertodt May 20, 2023

dhardy left a comment

dhardy May 20, 2023

LukasKalbertodt commented May 20, 2023

Fix PERT distribution for when mode is very close to (max - min) / 2 #1311

Are you sure you want to change the base?

Fix PERT distribution for when mode is very close to (max - min) / 2 #1311

Conversation

LukasKalbertodt commented May 1, 2023

dhardy May 18, 2023 • edited

Choose a reason for hiding this comment

LukasKalbertodt May 20, 2023 • edited

Choose a reason for hiding this comment

LukasKalbertodt May 20, 2023

Choose a reason for hiding this comment

LukasKalbertodt May 20, 2023

Choose a reason for hiding this comment

dhardy left a comment

Choose a reason for hiding this comment

dhardy May 20, 2023

Choose a reason for hiding this comment

LukasKalbertodt commented May 20, 2023

Fix PERT distribution for when `mode` is very close to `(max - min) / 2` #1311

Fix PERT distribution for when `mode` is very close to `(max - min) / 2` #1311

dhardy May 18, 2023 •

edited

LukasKalbertodt May 20, 2023 •

edited