internal/arenaskl: batch allocate nodes #3097

jbowens · 2023-11-22T16:36:55Z

While applying a batch to the memtable, we iterate over the batch and one-by-one we allocate memory for each individual KV pair's node in the memtable. Each individual KV performs an atomic load and an atomic increment. Allocating a contiguous swath of memory from the arena all at once may reduce the overhead of batch application. If KVs within a batch are more likely to be proximate to each other than to KVs being committed by other batches (almost certainly), this would also improve read-time cpu cache locality by avoiding interleaving KVs from separate concurrently-applied batches.

The amount of memory required for a node is a function of the size of the key, the size of the value, and the height of the skiplist node. The height of the skiplist node is random and not a function of the existing skiplist at all. It could be decided ahead of time before even entering the commit pipeline, but then we'd need a place to remember it. We could conceivably steal 5 bits from the trailer somewhere (20 is the max node height; ⌈log₂(20)⌉ = 5) but that sounds delicate. Or, we could avoid explicitly deciding each individual KV's node's height pre-application, and instead accumulate an aggregate height of all nodes in the batch. Then, at batch application time, we'd need to randomly distribute the aggregate height to individual KVs. (This could use some thought.)

jbowens added T-storage A-storage performance labels Nov 22, 2023

jbowens added this to Incoming in Storage via automation Nov 22, 2023

jbowens mentioned this issue Nov 22, 2023

storage: improve single delete safety cockroachdb/cockroach#114492

Open

nicktrav moved this from Incoming to Backlog in Storage Nov 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

internal/arenaskl: batch allocate nodes #3097

internal/arenaskl: batch allocate nodes #3097

jbowens commented Nov 22, 2023 •

edited

internal/arenaskl: batch allocate nodes #3097

internal/arenaskl: batch allocate nodes #3097

Comments

jbowens commented Nov 22, 2023 • edited

jbowens commented Nov 22, 2023 •

edited