Implement HighPrecision01 distribution #372

pitdicker · 2018-04-04T18:10:24Z

Re-opening #320.

(The mean test is totally inadequate for checking high precision.)

dhardy · 2018-04-16T15:24:52Z

So if I recall correctly we want to try implementing HighPrecisionUniform (or some shorter name) based on this idea plus @pitdicker's previous split-around-zero trick to produce high-precision floats over arbitrary ranges.

This is thus a good addition but doesn't have to be done before 0.5.

vks · 2018-04-23T13:44:10Z

Instead of testing the average, I think it makes more sense to fill a histogram. With a sufficient number of samples, it should be possible to distinguish the biased Standard from the unbiased HighPrecision01 by looking at the lowest bins. However, it might take too many sample to be feasible as a test. In any case, even with lower samples a histogram would be preferable to just calculating the average.

I think it might make sense to expose LowPrecison01 in case we ever want to change Standard. Maybe Biased01 and Unbiased01 are better names?

vks · 2018-04-23T13:52:19Z

One caveat (from http://xoroshiro.di.unimi.it/) that should probably be addressed in the documentation:

Interestingly, these are not the only notions of “uniformity” you can come up with. Another possibility is that of generating 1074-bit integers, normalize and return the nearest value representable as a 64-bit double (this is the theory—in practice, you will almost never use more than two integers per double as the remaining bits would not be representable). This approach guarantees that all representable doubles could be in principle generated, albeit not every returned double will appear with the same probability. A reference implementation can be found here. Note that unless your generator has at least 1074 bits of state and suitable equidistribution properties, the code above will not do what you expect (e.g., it might never return zero).

sicking · 2018-06-04T10:51:45Z

Is there still interest in this?

I wrote a generator a while back for values in the [0, 1) range using maximum precision here.

What the algorithm effectively does is that it picks a random, but perfectly unbiased, point on the continuous line between 0 and 1. It then rounds that down to the nearest point that can be represented as a f32/f64.

This means that all values in the [0, 1) range that can be represented by an f32/f64 can be returned by the algorithm. However not all values are equally likely since f32/f64 has many more values close to 0 than close to 1, and so likelyhood of rounding to a particular value close to 0 is smaller, than a particular value close to 1.

Happy to adapt this to rand if there's interest?

There's also code in the same file which uses the same approach to generate values between two arbitrary, finite, f32/f64. I.e. it picks an random unbiased point on the continuous line between the start and end and then rounds to the closest f32/f64 below the picked point. However it relies on the num_bigint crate so would require more work to port.

sicking · 2018-06-04T22:08:31Z

I should also mention that the code for high-precision sampling for an arbitrary range is quite slow. I mainly wrote it for funsies to see what it'd look like.

However the code for high-precision sampling in [0, 1) has more reasonable performance. Though obviously slower than the Standard implementation.

dhardy · 2018-06-05T08:09:55Z

IIRC this implementation works fine and has reasonable performance, but we were considering going with arbitrary ranges. On the other hand, if that's not easy to do it might not be a good option.

We were planning on adding distributions::HighPrecision which is just the full-precision equivalent of Uniform.

sicking · 2018-06-05T23:25:19Z

Cool, the existing implementation here does seem faster than the one I wrote, so I think we should go with this one.

Happy to port the arbitrary-range full-precision implementation that I wrote if there's a interest? The current code is here.

Would there be any perf goals in mind?

sicking · 2018-06-12T05:26:16Z

FWIW, i benchmarked my full-precision-arbitrary-range implementation and it's about 50-100x slower than the low-precision version, which is similar to what's in Uniform<f32/64>.

It's not been perf optimized much, so I'm sure that can be lowered some. But it's pretty darn slow.

sicking · 2018-06-21T07:58:16Z

I'm working on a more performant implementation here. Still doesn't work, so can't get perf numbers yet.

sicking · 2018-06-22T23:36:15Z

The implementation over here is now working and benchmarked. It's looking about 6-9x slower than the current Uniform implementation which seems viable? I'm sure some more performance can be squeezed out as well.

dhardy · 2018-06-23T14:42:53Z

Good work. I think with that performance it is probably worth including this somehow, though obviously not as the default option. I guess we may also want a different implementation for high-precision Standard?

I would say open a PR, but it probably makes sense to resolve #494 first.

pitdicker · 2018-07-18T20:55:22Z

Closing in favor of #531

dhardy · 2018-07-18T21:34:01Z

Why? As I understand this has better performance, but only works over [0, 1), so they seem to be mutually exclusive.

sicking · 2018-07-19T02:04:09Z

The PR in #531 includes the commits here, but updated to compile on master. #531 contains both HighPrecision01 and HighPrecision distributions.

pitdicker and others added 2 commits April 4, 2018 20:07

Implement HighPrecision01 distribution

5324026

Float sampling: improve high precision sampling; add mean test

b0f7e0e

(The mean test is totally inadequate for checking high precision.)

dhardy mentioned this pull request Apr 5, 2018

Implement HighPrecision01 distribution #320

Closed

dhardy added X-enhancement Type: proposed enhancement F-new-int Functionality: new, within Rand T-distributions Topic: distributions P-medium Priority: Medium labels Apr 16, 2018

dhardy mentioned this pull request May 24, 2018

Added Cauchy distribution #474

Merged

dhardy mentioned this pull request Jun 5, 2018

Compromising accuracy and speed [distributions] #494

Open

sicking mentioned this pull request Jun 27, 2018

HighPrecision<f32/64> #531

Closed

pitdicker closed this Jul 18, 2018

dhardy reopened this Jul 18, 2018

dhardy closed this Jul 23, 2018

dhardy mentioned this pull request Nov 13, 2019

Conversion to 64-bit float rust-random/book#21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement HighPrecision01 distribution #372

Implement HighPrecision01 distribution #372

pitdicker commented Apr 4, 2018

dhardy commented Apr 16, 2018

vks commented Apr 23, 2018

vks commented Apr 23, 2018

sicking commented Jun 4, 2018 •

edited

sicking commented Jun 4, 2018

dhardy commented Jun 5, 2018

sicking commented Jun 5, 2018 •

edited

sicking commented Jun 12, 2018

sicking commented Jun 21, 2018

sicking commented Jun 22, 2018

dhardy commented Jun 23, 2018

pitdicker commented Jul 18, 2018

dhardy commented Jul 18, 2018

sicking commented Jul 19, 2018

Implement HighPrecision01 distribution #372

Implement HighPrecision01 distribution #372

Conversation

pitdicker commented Apr 4, 2018

dhardy commented Apr 16, 2018

vks commented Apr 23, 2018

vks commented Apr 23, 2018

sicking commented Jun 4, 2018 • edited

sicking commented Jun 4, 2018

dhardy commented Jun 5, 2018

sicking commented Jun 5, 2018 • edited

sicking commented Jun 12, 2018

sicking commented Jun 21, 2018

sicking commented Jun 22, 2018

dhardy commented Jun 23, 2018

pitdicker commented Jul 18, 2018

dhardy commented Jul 18, 2018

sicking commented Jul 19, 2018

sicking commented Jun 4, 2018 •

edited

sicking commented Jun 5, 2018 •

edited