Optimize name parsing #1388

saethlin · 2021-02-24T16:02:16Z

This is deliberately a draft. Looking to investigate an approach like flatvec too, but my instinct is that this approach is better (at the expense of making Name larger) because it avoid allocations entirely for a lot of common domain names; 4 labels or less all 16 bytes or less. We're going with the flatvec-style approach to make better use of the expanded stack footprint.

Before:

test bench_parse_real_message          ... bench:       4,219 ns/iter (+/- 38)

After:

test bench_parse_real_message          ... bench:       1,780 ns/iter (+/- 16)

codecov · 2021-02-24T16:19:29Z

Codecov Report

Merging #1388 (c6ac502) into main (a6eb537) will decrease coverage by 0.01%.
The diff coverage is 96.47%.

@@            Coverage Diff             @@
##             main    #1388      +/-   ##
==========================================
- Coverage   85.21%   85.20%   -0.01%     
==========================================
  Files         153      152       -1     
  Lines       15016    15020       +4     
==========================================
+ Hits        12795    12797       +2     
- Misses       2221     2223       +2

saethlin · 2021-03-01T17:44:45Z

At this point, I'm stuck.

This implementation grows the size of Name significantly, because it adds stack buffers to avoid heap allocations. Avoiding allocations is the primary cause of improved performance here.

However, this size increase has caused ResolveErrorKind to grow significantly (the size was already too large, this PR makes it much worse). We could paper over the impact on the size of ResolveError by giving it the same #[cold] + Box treatment, but that doesn't address the lint, which is firing on an enum whose representation is public, so boxing its components would be a breaking change.

I also cannot change the representation of Name to use the flatvec approach, which would significantly improve the stack footprint. Name needs to contain a collection of Label somewhere because of its Index impl (Name::iter is fine because it yields slices of bytes). If I had my way I'd toss the Index impls.

@bluejekyll Do you have any suggestions?

crates/proto/src/error.rs

.github/workflows/test.yml

djc · 2021-03-02T08:58:44Z

Ditching the Index impls sounds good to me provided we don't rely on them too much in tree?

djc · 2021-03-04T10:22:27Z

BTW, I know smallvec is currently used but maybe we should take the chance to migrate to tinyvec. smallvec has a pretty bad security record, with 5 RustSec-reported vulnerabilities reported over the past three years.

saethlin · 2021-03-04T20:39:52Z

I agree, but if we're looking to decrease the memory footprint of Name, going to tinyvec takes us in the wrong direction. If we're not worried overmuch about that, I'm all in. Perhaps I'm just reacting to a lint that we should add an allow for in this PR then fix soon after when we actually address the situation with errors. TinyVec can accept an inline buffer up to 24 bytes without growing, but itself has a minimum size of 32, where SmallVec has a minimum size of 24, the same as Vec.

djc · 2021-03-04T22:18:47Z

I'm personally fine with prioritizing security over memory usage.

saethlin · 2021-03-05T02:04:41Z

As a nice side effect, this is also ~5% faster in a microbenchmark.

bluejekyll · 2021-03-06T04:59:37Z

Where are we on this? I reviewed. I like the changes better than the one I had been working on. Is there still a concern on additional stack space?

The previous implementation used Rc to represent Label, then composed those in an array to represent Name. That produced a large number of small allocations in the parsing code path. This new implementation avoids allocations entirely for small names, and unless the name has a very large number of labels, it is stored entirely in one allocation. This also removes the Index impl for Name. Since we no longer contain any Labels, we cannot implement that (a common problem with Index leaking implementation details).

saethlin · 2021-03-06T07:28:23Z

@bluejekyll I think this is good to go. I've rebased this into two commits to erase the previous implementation strategy and preserve the benchmark on its own so people can check it out and run against the old code. I also bumped one of the inline buffers up to 24 bytes because it's free.

I do not think there are any outstanding concerns about stack footprint. I think the current implementation is a good compromise, and the consequence it has on the size of the error types that was tripping lints is a bigger problem that I'm tackling in another PR.

djc

This is looking great! Commented on some minor nits, but I definitely think we should merge something like this.

crates/proto/src/rr/domain/name.rs

djc

Sorry, forgot to submit this earlier.

crates/resolver/src/error.rs

djc · 2021-03-07T12:43:36Z

@bluejekyll I think this can just be merged, but it doesn't look like I have privileges to do so because CI failed (from what I can tell, spuriously). (Maybe remove the requirement that CI passes necessarily?)

saethlin · 2021-03-07T20:13:09Z

CI failed (from what I can tell, spuriously). (Maybe remove the requirement that CI passes necessarily?)

I'm not convinced this is totally spurious? It's failed in the same way the last two commits in this PR 🤷

bluejekyll · 2021-03-08T04:47:50Z

I kicked the CI jobs again, we'll see what happens.

bluejekyll · 2021-03-08T19:08:23Z

FYI, I reran the tests twice, I think we're good. Merging in. Thanks for this PR and all the perf improvements!

saethlin force-pushed the optimize-name-parsing branch from 9ee9091 to bd4bf60 Compare February 26, 2021 22:26

djc reviewed Mar 2, 2021

View reviewed changes

crates/proto/src/error.rs Outdated Show resolved Hide resolved

.github/workflows/test.yml Outdated Show resolved Hide resolved

saethlin force-pushed the optimize-name-parsing branch from 6a53d0b to 4d29b04 Compare March 3, 2021 20:49

saethlin marked this pull request as ready for review March 5, 2021 04:34

bluejekyll mentioned this pull request Mar 6, 2021

Flat name #1190

Closed

saethlin added 2 commits March 6, 2021 02:22

Add a non-artificial Message parsing benchmark

116a4a8

saethlin force-pushed the optimize-name-parsing branch from b5026d1 to 7b6b79a Compare March 6, 2021 07:23

djc reviewed Mar 6, 2021

View reviewed changes

crates/proto/src/rr/domain/name.rs Outdated Show resolved Hide resolved

crates/proto/src/rr/domain/name.rs Show resolved Hide resolved

crates/proto/src/rr/domain/name.rs Outdated Show resolved Hide resolved

Cleanliness feedback from @djc

31cba74

djc reviewed Mar 6, 2021

View reviewed changes

crates/resolver/src/error.rs Show resolved Hide resolved

Remove duplicate initialization of is_fqdn

d5d3db5

djc approved these changes Mar 7, 2021

View reviewed changes

bluejekyll merged commit 61122bc into hickory-dns:main Mar 8, 2021

This was referenced Mar 16, 2021

chore(deps): bump trust-dns-server from 0.20.0 to 0.20.1 conblem/acme-dns-rust#117

Merged

Bump trust-dns-resolver from 0.20.0 to 0.20.1 lukaspustina/mhost#629

Closed

build(deps): bump trust-dns-proto from 0.20.0 to 0.20.1 compassd/dcompass#48

Closed

djc mentioned this pull request Nov 24, 2021

Evaluate all Vec usage and replace with SmallVec where apropriate #365

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize name parsing #1388

Optimize name parsing #1388

saethlin commented Feb 24, 2021 •

edited

codecov bot commented Feb 24, 2021 •

edited

saethlin commented Mar 1, 2021

djc commented Mar 2, 2021 •

edited

djc commented Mar 4, 2021

saethlin commented Mar 4, 2021

djc commented Mar 4, 2021

saethlin commented Mar 5, 2021

bluejekyll commented Mar 6, 2021

saethlin commented Mar 6, 2021

djc left a comment •

edited

djc left a comment

djc commented Mar 7, 2021 •

edited

saethlin commented Mar 7, 2021

bluejekyll commented Mar 8, 2021

bluejekyll commented Mar 8, 2021

Optimize name parsing #1388

Optimize name parsing #1388

Conversation

saethlin commented Feb 24, 2021 • edited

codecov bot commented Feb 24, 2021 • edited

Codecov Report

saethlin commented Mar 1, 2021

djc commented Mar 2, 2021 • edited

djc commented Mar 4, 2021

saethlin commented Mar 4, 2021

djc commented Mar 4, 2021

saethlin commented Mar 5, 2021

bluejekyll commented Mar 6, 2021

saethlin commented Mar 6, 2021

djc left a comment • edited

Choose a reason for hiding this comment

djc left a comment

Choose a reason for hiding this comment

djc commented Mar 7, 2021 • edited

saethlin commented Mar 7, 2021

bluejekyll commented Mar 8, 2021

bluejekyll commented Mar 8, 2021

saethlin commented Feb 24, 2021 •

edited

codecov bot commented Feb 24, 2021 •

edited

djc commented Mar 2, 2021 •

edited

djc left a comment •

edited

djc commented Mar 7, 2021 •

edited