4.x: Additional CQv2 message store optimisations #11112

lhoguin · 2024-04-29T08:38:33Z

Planning to include these for 4.0.

lhoguin · 2024-04-30T12:54:31Z

After testing the commit CQ: Don't scan shared store files before deleting them does help avoid a backlog of compaction/delete operations in the store GC process. With 24 millions message, 4 queues (2 normal, 2 fan-out), I get about a minute and a half of backlog that is still being done after the queues have consumed all messages without the patch, and no backlog with the patch.

I will see if the other commit helps at all next.

lhoguin · 2024-05-06T16:27:01Z

The second commit greatly improves dirty recovery times. On my machine, with data that's 24 million messages spread over many files and two queues, node recovery goes from 4min30 to less than 2min. The problem was that the old code was gathering messages from queues one by one (meaning 3 or 4 Erlang messages per AMQP message!!). Now it does so per segment file.

There are still parts that could be improved for making dirty recovery blazingly fast but they require storing additional state on disk and so will not be investigated fully for now.

lhoguin · 2024-05-07T12:09:51Z

I will work on merging my 4.x PRs next week.

lhoguin · 2024-05-14T12:43:19Z

Making one more addition to this PR so it is not ready to be merged yet.

This only applies to v2 because modifying this part of the v1 code is seen as too risky considering v1 will soon get removed.

lhoguin · 2024-05-16T09:22:02Z

I have dropped the first commit. It turns out that the function doing the scanning also removes entries from the ets index, and we need to keep that. So unfortunately we cannot avoid scanning for the time being. To better handle that we have little choice other than reworking the file format on disk.

lhoguin · 2024-05-24T12:34:56Z

I will do some refactoring that should help detect problems like https://github.com/rabbitmq/rabbitmq-server/pull/11288/files better as well as to clearly separate parts of the code that relate with each other.

essen force-pushed the loic-faster-cq-shared-store-gc branch from b2f11a2 to 66ad60e Compare April 29, 2024 08:48

michaelklishin changed the title ~~CQ: Additional message store GC optimisations DO NOT MERGE~~ 4.x: DO NOT MERGE Additional CQv2 message store GC optimisations Apr 29, 2024

essen force-pushed the loic-faster-cq-shared-store-gc branch from 66ad60e to 7427cfe Compare May 2, 2024 09:36

This comment was marked as outdated.

Sign in to view

essen force-pushed the loic-faster-cq-shared-store-gc branch from 4351976 to 84695ff Compare May 6, 2024 15:49

essen force-pushed the loic-faster-cq-shared-store-gc branch 2 times, most recently from 07be784 to fbf11f5 Compare May 7, 2024 10:46

lhoguin changed the title ~~4.x: DO NOT MERGE Additional CQv2 message store GC optimisations~~ 4.x: Additional CQv2 message store optimisations May 7, 2024

lhoguin marked this pull request as ready for review May 7, 2024 12:09

michaelklishin added this to the 4.0.0 milestone May 7, 2024

essen force-pushed the loic-faster-cq-shared-store-gc branch 2 times, most recently from fd3a118 to 817c59e Compare May 14, 2024 14:57

gomoripeti mentioned this pull request May 14, 2024

Optimise how rdq files are scanned at CQ shared message store recovery startup #11072

Closed

essen force-pushed the loic-faster-cq-shared-store-gc branch 2 times, most recently from 5a7610a to d8a5536 Compare May 15, 2024 11:30

lhoguin added 2 commits May 16, 2024 11:16

CQ: Make dirty recovery of shared store more efficient

6019e75

This only applies to v2 because modifying this part of the v1 code is seen as too risky considering v1 will soon get removed.

CQ: Write large messages into their own files

0575002

essen force-pushed the loic-faster-cq-shared-store-gc branch from d8a5536 to 0575002 Compare May 16, 2024 09:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4.x: Additional CQv2 message store optimisations #11112

4.x: Additional CQv2 message store optimisations #11112

lhoguin commented Apr 29, 2024

lhoguin commented Apr 30, 2024

This comment was marked as outdated.

lhoguin commented May 6, 2024

lhoguin commented May 7, 2024

lhoguin commented May 14, 2024

lhoguin commented May 16, 2024

lhoguin commented May 24, 2024

4.x: Additional CQv2 message store optimisations #11112

Are you sure you want to change the base?

4.x: Additional CQv2 message store optimisations #11112

Conversation

lhoguin commented Apr 29, 2024

lhoguin commented Apr 30, 2024

This comment was marked as outdated.

lhoguin commented May 6, 2024

lhoguin commented May 7, 2024

lhoguin commented May 14, 2024

lhoguin commented May 16, 2024

lhoguin commented May 24, 2024