Check for buggy preadv2 #3821

Darksonn · 2021-05-28T06:59:13Z

Closes: #3803

wmanley · 2021-05-28T09:19:58Z

tokio/src/fs/file.rs

+        let release = unsafe { CStr::from_ptr((*uts.as_ptr()).release.as_ptr()) };
+        let release = release.to_bytes();
+        release[..4] == b"5.9."[..] || release[..5] == b"5.10."[..]


You beat me to it :)

FWIW in the impl I was brewing I made this comparison with [i8] avoiding the conversion via CStr like this:

const _5_9: [i8; 4] = [53, 46, 57, 46]; // [i8] repr of b"5.9." const _5_10: [i8; 5] = [53, 46, 49, 48, 46]; // [i8] repr of b"5.10." uts.release[..4] == _5_9 || uts.release[..5] == _5_10

Not that it matters, this function will only be called ~once per process.

wmanley · 2021-05-28T09:20:15Z

I'll run my tests with this impl.

wmanley · 2021-05-28T15:40:21Z

I've run my test on 5.10 and it succeeds with this PR, and fails on master.

tokio/src/fs/file.rs

…he threadpool (#3518)"" This reverts commit 3fbcf1b.

Darksonn · 2021-06-02T10:02:54Z

@wmanley I don't really understand the Linux kernel version numbers here. The Linux git repository only has tags for v5.10 and v5.11, and not for things like v5.10.17 like the version from the original issue.

wmanley · 2021-06-02T10:27:48Z

@wmanley I don't really understand the Linux kernel version numbers here. The Linux git repository only has tags for v5.10 and v5.11, and not for things like v5.10.17 like the version from the original issue.

Here's the git repo with the stable kernel versions: https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git . This is where the point releases are tagged. I've tested both the mainline and the stable kernel tags.

Darksonn · 2021-06-02T14:30:18Z

I've tested both the mainline and the stable kernel tags.

I got the impression that you just did a bisection? Which tags did you try?

wmanley · 2021-06-02T14:40:41Z

I've tested both the mainline and the stable kernel tags.

I got the impression that you just did a bisection? Which tags did you try?

I did both.

I did a bisection on mainline to identify the specific commits where it was introduced and fixed. Having discovered that it was fixed on mainline I separately and additionally ran the test on the latest stable releases to see if the fix had been cherry-picked to any stable branch. The results of the stable branch checking is as follows:

5.6.19: OK
5.7.19: OK
5.8.18: OK
5.9.16: BAD
5.10.38: BAD
5.11.22: OK

My conclusion is that the kernel developers aren't/weren't even aware that the bug existed, so the fix for it hasn't been backported.

Bisecting to find first bad commit: #3803 (comment)
Bisecting to find first fixed commit: #3803 (comment)
Testing stable branches: #3803 (comment)

carllerche · 2021-06-02T17:34:59Z

Before merging this, I think that we need to have a plan to include testing different kernel versions as part of CI. we don't want to accidentally introduce regressions if we change this code.

Granted, testing different kernel versions is not trivial.

blasrodri · 2021-07-05T09:59:41Z

Before merging this, I think that we need to have a plan to include testing different kernel versions as part of CI. we don't want to accidentally introduce regressions if we change this code.

Granted, testing different kernel versions is not trivial.

I have not had the time to check in detail. But this project seems like an alternative https://github.com/linuxkit/linuxkit

wmanley · 2021-07-12T11:56:24Z

Before merging this, I think that we need to have a plan to include testing different kernel versions as part of CI. we don't want to accidentally introduce regressions if we change this code.

I'd be happy to have a crack at this, adapting the kernel testing I'd done in #3803 to CI. So we'd have a few pre-built kernels and run qemu in CI to see if the bug is present or not on each. Ideally we'd have a way of running against the latest RC kernel too to catch future regressions before they happen.

But first I'd like to know - is it worthwhile? Does anyone have a real-world benchmark that this helps with, or is there some other reason to want this?

Darksonn · 2021-07-12T12:07:46Z

A benchmark? It's mostly to make sure we catch regressions if we break something on some kernel versions only.

wmanley · 2021-07-12T12:26:53Z

A benchmark? It's mostly to make sure we catch regressions if we break something on some kernel versions only.

Not a benchmark for testing, but for #3518. The value in the kernel testing is to see if we can un-revert #3518, so the question is of whether #3518 is of sufficient value to bother. I wrote #3518 for fun rather than to satisfy a particular need that I have - I'm not actually a tokio user. I'm happy to have a go at test-infra, but not for it's own sake - only if it's actually providing someone with value. So it comes down to - is this something that will actually help people - for performance reasons, or otherwise?

Darksonn · 2021-07-12T14:18:06Z

Generally, file reading involves a whole bunch of cross-thread communication with the current implementation, so it would surprise me if it doesn't help.

I tried running the benchmarks in benches/fs.rs with and without the change.

Without the change
test async_read_buf      ... bench:   6,714,074 ns/iter (+/- 1,063,411)
test async_read_codec    ... bench:   6,727,703 ns/iter (+/- 893,984)

With the change
test async_read_buf      ... bench:   6,363,540 ns/iter (+/- 807,035)
test async_read_codec    ... bench:   6,454,059 ns/iter (+/- 857,237)

Though it's just reading from /dev/zero. I don't know how representative that is.

the8472 · 2023-04-24T18:17:06Z

Instead of probing the kernel version wouldn't it be better to do a normal read when reaching end of file to verify correct behavior? Since the file should be at its end the kernel should be able to serve the request without blocking unless something is appending to the file between the preadv2 and the read call. But if something else is concurrently writing to the file I would expect the some internal locks causing the threadpool fallback anyway.

Darksonn added A-tokio Area: The main tokio crate M-fs Module: tokio/fs labels May 28, 2021

wmanley reviewed May 28, 2021

View reviewed changes

Darksonn mentioned this pull request May 28, 2021

Prepare tokio 1.6.1 #3822

Merged

cynecx mentioned this pull request May 28, 2021

fs::File reads first 4096 bytes only on Linux v5.9 and v5.10 #3803

Open

jonhoo reviewed May 28, 2021

View reviewed changes

tokio/src/fs/file.rs Show resolved Hide resolved

Darksonn added 2 commits June 2, 2021 11:55

Revert "Revert "fs: try doing a non-blocking read before punting to t…

72c8816

…he threadpool (#3518)"" This reverts commit 3fbcf1b.

Check for buggy preadv2

031b75f

Darksonn force-pushed the preadv2-bug branch from 380fb20 to 031b75f Compare June 2, 2021 09:58

Add comment

358f8d0

Darksonn mentioned this pull request Jul 5, 2021

File operations don't use the whole buffer #1976

Closed

Merge branch 'master' into preadv2-bug

e87c60f

Darksonn closed this Nov 25, 2023

Darksonn deleted the preadv2-bug branch November 25, 2023 12:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check for buggy preadv2 #3821

Check for buggy preadv2 #3821

Darksonn commented May 28, 2021

wmanley May 28, 2021

wmanley commented May 28, 2021

wmanley commented May 28, 2021

Darksonn commented Jun 2, 2021

wmanley commented Jun 2, 2021

Darksonn commented Jun 2, 2021

wmanley commented Jun 2, 2021

carllerche commented Jun 2, 2021

blasrodri commented Jul 5, 2021

wmanley commented Jul 12, 2021

Darksonn commented Jul 12, 2021

wmanley commented Jul 12, 2021 •

edited

Darksonn commented Jul 12, 2021

the8472 commented Apr 24, 2023

Check for buggy preadv2 #3821

Check for buggy preadv2 #3821

Conversation

Darksonn commented May 28, 2021

wmanley May 28, 2021

Choose a reason for hiding this comment

wmanley commented May 28, 2021

wmanley commented May 28, 2021

Darksonn commented Jun 2, 2021

wmanley commented Jun 2, 2021

Darksonn commented Jun 2, 2021

wmanley commented Jun 2, 2021

carllerche commented Jun 2, 2021

blasrodri commented Jul 5, 2021

wmanley commented Jul 12, 2021

Darksonn commented Jul 12, 2021

wmanley commented Jul 12, 2021 • edited

Darksonn commented Jul 12, 2021

the8472 commented Apr 24, 2023

wmanley commented Jul 12, 2021 •

edited