Block RNGs: remove unaligned memory cast #783

dhardy · 2019-04-22T10:55:41Z

Fix #779. Review please @RalfJung.

The point of the removed specialisations was to avoid one copy. Since the output type may not have the correct alignment and we wish to copy bytes in the same order, we have no choice but to use a buffer anyway. (We could complicate things by checking the alignment at run-time, but I don't think it's worthwhile given the very small performance cost of the extra copy.)

Benchmarking with 10*1024 byte buffer does show a small impact:

# before
test gen_bytes_chacha20             ... bench:  17,775,692 ns/iter (+/- 290,563) = 576 MB/s
test gen_bytes_hc128                ... bench:   4,243,890 ns/iter (+/- 69,032) = 2412 MB/s
test gen_bytes_isaac                ... bench:   7,238,757 ns/iter (+/- 123,797) = 1414 MB/s
test gen_bytes_isaac64              ... bench:   3,900,653 ns/iter (+/- 47,162) = 2625 MB/s
# after
test gen_bytes_chacha20             ... bench:  18,007,036 ns/iter (+/- 384,358) = 568 MB/s
test gen_bytes_hc128                ... bench:   4,634,505 ns/iter (+/- 55,119) = 2209 MB/s
test gen_bytes_isaac                ... bench:   7,338,592 ns/iter (+/- 90,396) = 1395 MB/s
test gen_bytes_isaac64              ... bench:   4,066,367 ns/iter (+/- 55,543) = 2518 MB/s

The second commit cleans up some warnings in the benches.

These specialisations relied on casting a u8 byte slice to a u32 or u64 slice, which is UB due to alignment requirements.

burdges · 2019-04-22T11:10:48Z

It's not worth it to use read_unaligned like you mentioned previously?

dhardy · 2019-04-22T13:13:33Z

This isn't about unaligned reads, it's about unaligned writes masquerading as aligned ones (because of cast to &[u32] or &[u64]). I guess we could change BlockRngCore::generate to take *mut u32 arg or some such, but I really don't think these small performance losses are worth pushing unsafe code into all block-rng implementations.

RalfJung · 2019-04-22T16:13:58Z

Is there a way that align_to could be useful here to find the "aligned" part of the buffer? That would work on all platforms then, not just x86.

The change itself seems fine; this is what Miri is testing in #781.

dhardy · 2019-04-22T16:26:26Z

Finding the aligned subset of dest doesn't help unless we allow bytes of the RNG to be skipped. We don't wish to do this arbitrarily (or based on something as tenuous as input alignment) because in some cases reproducibility is important, and this would be hard to document and a breaking change from past behaviour.

I'll leave this until tomorrow for any further review, then we can merge the PRs.

vks · 2019-04-23T12:04:43Z

Finding the aligned subset of dest doesn't help unless we allow bytes of the RNG to be skipped.

Maybe we should add an API that requires aligned slices?

dhardy · 2019-04-23T14:13:26Z

As in fill_aligned_bytes? Possible I guess.

What I don't understand is why everyone keeps suggesting complicated ways to keep this (very) small highly-specific optimisation.

burdges · 2019-04-23T15:35:14Z

It's not worth complicating the trait over this. lol

LukasKalbertodt · 2019-07-27T05:08:30Z

Would it be possible to backport this bugfix to rand_core 0.4? Currently it's not possible to run quickcheck tests through Miri because this bug always makes the tests fail. And I think quite a few crates have not switched to the new rand version yet.

But I can also understand if this is too much work. I just wanted to ask ^_^

dhardy · 2019-07-27T05:30:19Z

Sure, I guess it's possible.

dhardy added 2 commits April 22, 2019 11:42

Remove specialisations of fill_bytes for block RNGs

0c0f931

These specialisations relied on casting a u8 byte slice to a u32 or u64 slice, which is UB due to alignment requirements.

Benches: fix warnings and move Binomial benches to appropriate suite

232c634

vks approved these changes Apr 23, 2019

View reviewed changes

dhardy merged commit 40b8eb9 into rust-random:master Apr 23, 2019

dhardy deleted the block branch May 16, 2019 11:27

dhardy mentioned this pull request Jul 27, 2019

rand_core 0.4.2 (backports) #853

Merged

saethlin mentioned this pull request May 12, 2023

regression: misaligned pointer dereference: address must be a multiple ... rust-lang/rust#111487

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Block RNGs: remove unaligned memory cast #783

Block RNGs: remove unaligned memory cast #783

dhardy commented Apr 22, 2019

burdges commented Apr 22, 2019

dhardy commented Apr 22, 2019

RalfJung commented Apr 22, 2019

dhardy commented Apr 22, 2019

vks commented Apr 23, 2019

dhardy commented Apr 23, 2019

burdges commented Apr 23, 2019

LukasKalbertodt commented Jul 27, 2019

dhardy commented Jul 27, 2019

Block RNGs: remove unaligned memory cast #783

Block RNGs: remove unaligned memory cast #783

Conversation

dhardy commented Apr 22, 2019

burdges commented Apr 22, 2019

dhardy commented Apr 22, 2019

RalfJung commented Apr 22, 2019

dhardy commented Apr 22, 2019

vks commented Apr 23, 2019

dhardy commented Apr 23, 2019

burdges commented Apr 23, 2019

LukasKalbertodt commented Jul 27, 2019

dhardy commented Jul 27, 2019