optimise rfc3339 (and rfc2822) #844

conradludgate · 2022-10-15T09:05:30Z

We use to_rfc3339() a lot in our observability libraries at TrueLayer. Upon profiling, I noticed that it took up a large portion of time, so I looked into optimising it.

I was able to do more optimisations but the requirement of the year formatting causes the code to be pretty tricky. I can attempt it in a later PR if this is accepted.

In our observability code I know that the year is 0..=9999 and the timezone is Utc so I had a bit more gains from cheating 😅

djc · 2022-10-15T10:23:11Z

Nice results!

Is the datetime_from_str regression stable? Why did you move all that code out of the Debug impls into an inherent method?

conradludgate · 2022-10-15T10:33:12Z

Why did you move all that code out of the Debug impls into an inherent method

I was just adding some comments to the code 😅
This allows for better code-reuse while allowing more inlining to avoid dynamic dispatch overhead

Is the datetime_from_str regression stable

~~Yeah I'm not sure on that one. I'm still looking into it~~

The regression is a false positive. It's a little flaky on my hardware but I can't reproduce any conclusive evidence that it's slower on either branch

It dances around between 122ns to 127ns for both 0.4.x and this branch.

conradludgate · 2022-10-15T11:07:09Z

Ok, I'm done tinkering 🙏

esheppa

Looks great, it's nice to specialize the to_rfc* implementations and that is an impressive performance uplift

src/format/mod.rs

djc · 2022-10-16T18:56:02Z

So can you explain what changes caused the majority of the performance improvements?

In particular I'm still unclear why it's good for performance to move the Debug code into write_into() methods; is that actually necessary for performance or is there a way of calling Debug::fmt without a performance penalty relative to calling write_into()? All other things being equal, I think leaving that code in the Debug impl would be more idiomatic.

conradludgate · 2022-10-16T20:26:52Z

Using the fmt methods needs a Formatter type. We don't always have access to one in our implementations. Currently in std there's no way to use fmt into a String without using dynamic dispatch. This is what the generic write_into methods provide by making it generic over fmt::Write. This then supports Formatter and String directly, and it inlines very well.

I'm no longer sure if the byte arrays provide a huge benefit of speed, but it is the only way to avoid the dyn overhead of integer writing, so it remains for now. Initially the idea was that writing into a static buffer would reduce the number of allocation checks and potential reallocs, although considering we specify the capacity upfront I believe this is negligible

conradludgate · 2022-10-17T06:54:53Z

I've just stumbled upon ufmt, which is a dyn-free formatting machinery library and it shows similar gains https://docs.rs/ufmt/latest/ufmt/#benchmarks

Their Debug trait is basically what the write_into is trying to be, since its generic over the writer

djc

Okay, I reviewed this in more detail and I think all the changes here are good, but I would like to have it split up in smaller commits in order to review the changes more carefully. Probably one commit for moving things into write_into(), also added a bunch of suggestions for things I think should be separate commits (separate PRs is also fine if you prefer).

I also think the docstrings you added need some work. Please write docstrings that describe the invariants/interface the function provides, instead of adding comments-as-docstrings. Preferably the first line of a docstring can stand alone and is no longer than a single line.

I would also like some benchmarks that compare writing into a byte array, then copying into a String with directly writing into the String (should probably reserve capacity up front). I'm definitely not a fan of the write_utf8_bytes() strategy...

Thank you for working on this!

src/format/mod.rs

src/naive/time/mod.rs

src/format/mod.rs

conradludgate · 2022-10-17T10:50:20Z

Latest net improvements from the refactor

esheppa · 2022-10-17T12:20:38Z

So can you explain what changes caused the majority of the performance improvements?

I did some experimentation around this, and found:

Pre allocating in the format function, gave about 15% of speedup
The new implementations of RFC3339 and RFC2822 alone, but with the original code in to_rfc* (eg with the Item::Fixed(Fixed::Rfc3339), gives about half of the speedup
My assumption is that the rest comes from the specialized impl avoiding the format function

conradludgate · 2022-10-18T06:55:48Z

@djc the commits are all there now, in small logical chunks with the benchmark results in the commit message. Hope that helps

djc

This is looking great, thanks!

esheppa

Thanks for this @conradludgate - I'm happy with this as is but I've left a few comments to discuss prior to merging

src/format/mod.rs

src/naive/date.rs

greyblake · 2022-10-21T11:06:21Z

@djc @conradludgate Any updates? (looking forward for the release #850 👀 )

…respectively)

djc · 2022-10-26T13:59:11Z

Thanks for sticking with it!

conradludgate force-pushed the optimise-rfc3339 branch 2 times, most recently from a2e61fc to 431307c Compare October 15, 2022 09:46

conradludgate changed the title ~~optimise rfc3339~~ optimise rfc3339 (and rfc2822) Oct 15, 2022

esheppa reviewed Oct 16, 2022

View reviewed changes

src/format/mod.rs Outdated Show resolved Hide resolved

esheppa reviewed Oct 16, 2022

View reviewed changes

src/format/mod.rs Outdated Show resolved Hide resolved

djc reviewed Oct 17, 2022

View reviewed changes

conradludgate force-pushed the optimise-rfc3339 branch from c23831b to 8cc9249 Compare October 17, 2022 10:39

conradludgate commented Oct 17, 2022

View reviewed changes

src/format/mod.rs Show resolved Hide resolved

conradludgate force-pushed the optimise-rfc3339 branch from 8cc9249 to c820718 Compare October 17, 2022 10:49

djc approved these changes Oct 18, 2022

View reviewed changes

djc mentioned this pull request Oct 18, 2022

0.4.23 release planning #850

Closed

esheppa approved these changes Oct 19, 2022

View reviewed changes

src/format/mod.rs Show resolved Hide resolved

src/format/mod.rs Outdated Show resolved Hide resolved

src/naive/date.rs Outdated Show resolved Hide resolved

danielhenrymantilla approved these changes Oct 22, 2022

View reviewed changes

conradludgate added 4 commits October 26, 2022 14:22

use less intermediate formatting (-5% improvement)

14e2272

skip DelayedFormat for rfc3339 (net -58% improvement)

a03c3a2

extract out locales for a later change

c5c63c3

skip DelayedFormat for rfc2822 (net -55% improvement for 2822)

30c8b9e

conradludgate force-pushed the optimise-rfc3339 branch from 5145475 to e42fc89 Compare October 26, 2022 13:29

conradludgate added 2 commits October 26, 2022 14:33

avoid int formatting as much as possible (net -70%/-68% on 2822/3339 …

040125d

…respectively)

remove dyn formatting from timezone (-74/78% on 2822/3339 respectively)

c6bb0d1

conradludgate force-pushed the optimise-rfc3339 branch from e42fc89 to c6bb0d1 Compare October 26, 2022 13:33

djc merged commit 3e2f151 into chronotope:0.4.x Oct 26, 2022

pitdicker mentioned this pull request Sep 8, 2023

Format day in RFC 2822 without padding #1272

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimise rfc3339 (and rfc2822) #844

optimise rfc3339 (and rfc2822) #844

conradludgate commented Oct 15, 2022 •

edited

Loading

djc commented Oct 15, 2022

conradludgate commented Oct 15, 2022 •

edited

Loading

conradludgate commented Oct 15, 2022

esheppa left a comment

djc commented Oct 16, 2022

conradludgate commented Oct 16, 2022 •

edited

Loading

conradludgate commented Oct 17, 2022

djc left a comment

conradludgate commented Oct 17, 2022

esheppa commented Oct 17, 2022

conradludgate commented Oct 18, 2022

djc left a comment

esheppa left a comment

greyblake commented Oct 21, 2022 •

edited

Loading

djc commented Oct 26, 2022

optimise rfc3339 (and rfc2822) #844

optimise rfc3339 (and rfc2822) #844

Conversation

conradludgate commented Oct 15, 2022 • edited Loading

djc commented Oct 15, 2022

conradludgate commented Oct 15, 2022 • edited Loading

conradludgate commented Oct 15, 2022

esheppa left a comment

Choose a reason for hiding this comment

djc commented Oct 16, 2022

conradludgate commented Oct 16, 2022 • edited Loading

conradludgate commented Oct 17, 2022

djc left a comment

Choose a reason for hiding this comment

conradludgate commented Oct 17, 2022

esheppa commented Oct 17, 2022

conradludgate commented Oct 18, 2022

djc left a comment

Choose a reason for hiding this comment

esheppa left a comment

Choose a reason for hiding this comment

greyblake commented Oct 21, 2022 • edited Loading

djc commented Oct 26, 2022

conradludgate commented Oct 15, 2022 •

edited

Loading

conradludgate commented Oct 15, 2022 •

edited

Loading

conradludgate commented Oct 16, 2022 •

edited

Loading

greyblake commented Oct 21, 2022 •

edited

Loading