[Persistence] Don't persist ALL channel_monitors on every bitcoin block connection. #2647

G8XSU · 2023-10-04T01:50:32Z

Currently on every bitcoin block update we persist all channel_monitors with updated best_block.

This can be troublesome for large node operators with 1000's of channels.

It also causes a thunder herd problem (ref), and hammers the storage with many requests all at once.

G8XSU · 2023-10-04T01:51:01Z

Assigning to myself, will see if it is doable.

G8XSU · 2023-10-04T01:54:42Z

Adding more detail:
Currently on every bitcoin block update we persist all channel_monitors with updated best_block.

This can be troublesome for large node operators with 1000's of channels.

It also causes a thunder herd problem (ref), and hammers the storage with many requests all at once.

benthecarman · 2023-10-22T05:28:10Z

Probably easiest way to do this would just have a config option and do the writes in batches

G8XSU · 2023-12-12T19:42:21Z

Approach:
We want to persist monitors at some cadence, easiest thing to would be to stop persisting on every block and instead persist on every 10th/50th block.

This will cut down IO by a factor of 10/50 but doesn't solve the thundering herd problem. All monitors will rush to get persisted after the same block.

So idea is to introduce somewhat random yet deterministic distribution scheme for monitor persists.
Partition_key will be a function of (monitor, block_height).

This partitioning strategy will alleviate thundering herd issue and hot partition problem for monitor persists and we can kind of evenly distribute the IO load.

For a node with 500 channels, this should cut down IO from 250k monitor persist calls to ~5-6k persists in an 8 hour interval.

Note this will also mean that on node restarting, a monitor will be at max 50 blocks out-of-date and we will need to sync them.

TheBlueMatt added this to the 0.1.1 milestone Oct 15, 2023

TheBlueMatt mentioned this issue Nov 29, 2023

Stop reading monitors when persisting in updating persister #2706

Closed

G8XSU self-assigned this Dec 12, 2023

This was referenced Mar 21, 2024

Don't pause events for chainsync persistence #2957

Merged

Optimize ChannelMonitor persistence on block connections. #2966

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Persistence] Don't persist ALL channel_monitors on every bitcoin block connection. #2647

[Persistence] Don't persist ALL channel_monitors on every bitcoin block connection. #2647

G8XSU commented Oct 4, 2023 •

edited

G8XSU commented Oct 4, 2023

G8XSU commented Oct 4, 2023

benthecarman commented Oct 22, 2023

G8XSU commented Dec 12, 2023 •

edited

[Persistence] Don't persist ALL channel_monitors on every bitcoin block connection. #2647

[Persistence] Don't persist ALL channel_monitors on every bitcoin block connection. #2647

Comments

G8XSU commented Oct 4, 2023 • edited

G8XSU commented Oct 4, 2023

G8XSU commented Oct 4, 2023

benthecarman commented Oct 22, 2023

G8XSU commented Dec 12, 2023 • edited

G8XSU commented Oct 4, 2023 •

edited

G8XSU commented Dec 12, 2023 •

edited