Large TOML document performance #342

yuvadm · 2019-10-06T05:51:11Z

I'm attempting to read a large-ish TOML document like so:

println!("Reading toml size {}", s.len());
let t = toml::from_str(s).unwrap();

where the str size is around 4MB (this is an auto-generated TOML, obviously). The file is essentially a lot of very small tables in the following format:

[table-id-1]
name = "Foo"
url = "http://example.com"
tags = ["tag-name"]

The loading time is unreasonable, spinning the CPU up to 100% and taking way too long, over a minute before I kill the process.

Am I wrong to expect this library to be able to handle files that big?

The text was updated successfully, but these errors were encountered:

alexcrichton · 2019-10-09T05:50:48Z

Thanks for the report! Can you gist the file here that's being parsed so some profiling can be done locally to figure out where the time is being spent? Also, to confirm, are you compiling with --release?

yuvadm · 2019-10-09T07:24:44Z

@alexcrichton sure, you can pull the file from here https://github.com/streamlib/library/blob/radiodb/library/radiodb.toml

As for --release, thanks for pointing that out, that definitely improves performance significantly but it still takes a few seconds (about 5 on my machine) to parse the entire file, which can still be improved.

alexcrichton · 2019-10-09T16:52:19Z

Thanks! Looks like there's definitely some low hanging fruit for us to optimize here, and agreed that we can definitely improve this!

Some local profiling shows:

+   66.12%  toml     toml               [.] toml::de::headers_equal                                                            
+   16.28%  toml     toml               [.] <toml::de::MapVisitor as serde::de::MapAccess>::next_key_seed                      
+    8.52%  toml     libc-2.27.so       [.] __memcmp_avx2_movbe                                                                
+    6.11%  toml     toml               [.] <toml::de::MapVisitor as serde::de::MapAccess>::next_key_seed                      
     0.33%  toml     toml               [.] toml::tokens::Tokenizer::next                                                      
     0.30%  toml     toml               [.] <toml::tokens::CrlfFold as core::iter::traits::iterator::Iterator>::next

Apparently the headers_equal function is super hot and a good candidate to optimize!

yuvadm · 2019-10-09T17:22:52Z

IIUC this function is part of some attempt to match an existing header name? Any easy fix here that I might be able to test?

gsquire · 2019-10-14T23:35:11Z

I took a peek at the file posted above and it seems like there are a lot of headers which aligns with the profile. Would something like rayon's all make sense here?

We could feature flag the crate addition but even that seems non-optimal for a one line update.

est31 · 2019-10-25T03:54:26Z

Apparently the headers_equal function is super hot and a good candidate to optimize!

As the person who added that function, note that this function only exists in master and was added by commit 7c9b0a3. @yuvadm most likely used a crates.io release. It's slow before and after the commit as as far as I can tell. The bad performance is caused by these two snippets:

https://github.com/alexcrichton/toml-rs/blob/9ed2903517fe1e63e70fb2138a22296aa434da9e/src/de.rs#L366-L381

https://github.com/alexcrichton/toml-rs/blob/9ed2903517fe1e63e70fb2138a22296aa434da9e/src/de.rs#L500-L503

As the find function is called for every single table, it creates a O(n^2) buildup when you have n tables (before 7c9b0a3, there was a == instead of a headers_equal invocation, but same behaviour really).

A speedup can most likely be attained by creating a lookup structure to get sublinear lookup times.

est31 · 2019-10-25T03:58:47Z

Yeah the file linked has 31315 headers, and squaring that gives a very large number...

est31 · 2019-10-25T06:28:30Z

@yuvadm could you try my PR #349 ? It should fix the speed issue.

yuvadm · 2019-10-26T13:07:44Z

@est31 your fix looks awesome on my side! Brings performance back to where it should be.

alexcrichton · 2019-10-28T14:21:57Z

Thanks for looking into this and fixing it @est31!

yuvadm · 2019-10-28T15:37:40Z

Thanks again @est31 this is amazing and @alexcrichton thanks for the quick version bump!

yuvadm mentioned this issue Oct 8, 2019

Rethink TOML format for library streamlib/streamlib#9

Closed

est31 mentioned this issue Oct 25, 2019

Decrease deserialization complexity from quadratic to linear #349

Merged

alexcrichton closed this as completed in #349 Oct 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Large TOML document performance #342

Large TOML document performance #342

yuvadm commented Oct 6, 2019 •

edited

alexcrichton commented Oct 9, 2019

yuvadm commented Oct 9, 2019

alexcrichton commented Oct 9, 2019

yuvadm commented Oct 9, 2019

gsquire commented Oct 14, 2019

est31 commented Oct 25, 2019

est31 commented Oct 25, 2019

est31 commented Oct 25, 2019 •

edited

yuvadm commented Oct 26, 2019

alexcrichton commented Oct 28, 2019

yuvadm commented Oct 28, 2019

Large TOML document performance #342

Large TOML document performance #342

Comments

yuvadm commented Oct 6, 2019 • edited

alexcrichton commented Oct 9, 2019

yuvadm commented Oct 9, 2019

alexcrichton commented Oct 9, 2019

yuvadm commented Oct 9, 2019

gsquire commented Oct 14, 2019

est31 commented Oct 25, 2019

est31 commented Oct 25, 2019

est31 commented Oct 25, 2019 • edited

yuvadm commented Oct 26, 2019

alexcrichton commented Oct 28, 2019

yuvadm commented Oct 28, 2019

yuvadm commented Oct 6, 2019 •

edited

est31 commented Oct 25, 2019 •

edited