[EXPERIMENT] Branch mispredictions in twitter.json #2061

jkeiser · 2023-09-05T21:23:21Z

This is an accounting of what causes our various branch misses in stage 2. I took twitter.json and transformed it via a series of steps into a single file with a single array and only empty strings where each scalar was in the original. I did this such that the size was always identical, values in roughly the same position, and (with the exception of array nesting removal) the number of structurals and size of the output from stage 2 identical.

The raw results from icelake are here, but here's the rundown. These numbers assume that branch misses from all these sources are completely independent, which probably isn't the case, but probably isn't completely wrong, either. Fascinatingly to me, this does seem to be a complete list: removing all of these sources of misprediction brings branch misses down from 773 to 3.

Integers with more than 1 digit (*) - 30% of branch misses in the file
Container type switching - 28%
String length - 18%
String/number/bool/null type switching - 17%
Backslashes - 6%
UTF-8 - 1%

(*) Of note is that a file where all numbers are replaced with 18-digit integers (or the same for 8-digit) seems to be just about as unpredictable as a file where the numbers have all sorts of different sizes. 1-digit numbers do not have this issue. I'm not sure why this is.

numbers.

Daniel Lemire and others added 6 commits September 5, 2023 11:05

Redesigning visit_primitive so that it is optimized for strings and

588c067

numbers.

Branch tests with twitter.json.

47e3e95

All branch tests for twitter.json

02b488b

Moar jsonexamples

90aa198

Add jsonexamples generator and miss reduction script

5d2107b

Show more transitions

c3414a1

jkeiser force-pushed the jkeiser/branch-test branch 2 times, most recently from 02b0d46 to 65c243a Compare September 12, 2023 19:01

Add results

6f196d0

jkeiser force-pushed the jkeiser/branch-test branch from 65c243a to 6f196d0 Compare September 12, 2023 19:02

jkeiser added 3 commits September 13, 2023 11:10

Right-justify int columns in markdown tables

6a0d7bc

Updates

0edf100

Updates

d1de135

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EXPERIMENT] Branch mispredictions in twitter.json #2061

[EXPERIMENT] Branch mispredictions in twitter.json #2061

jkeiser commented Sep 5, 2023 •

edited

[EXPERIMENT] Branch mispredictions in twitter.json #2061

Are you sure you want to change the base?

[EXPERIMENT] Branch mispredictions in twitter.json #2061

Conversation

jkeiser commented Sep 5, 2023 • edited

jkeiser commented Sep 5, 2023 •

edited