perf: significantly improve the memory usage of histogram #610

shappir · 2024-02-06T12:20:04Z

Significantly reduce the amount of memory used by histograms, especially for high cardinality/lots of buckets:

Create bucketValues object with prototype of zero values instead of copying. This way counter per bucket is only allocated when its value is greater than zero (first time it's incremented).
Allocate empty bucketExemplars (when using exemplars) instead of pre-filling with nulls.

Additional optimizations:

Insert valueFromMap into hash only when it's allocated (don't reinsert every time)
Don't lookup again for bucketExemplars. Instead reuse previous lookup

Notes:

Removed Object.freeze(this.bucketValues) because it causes += 1 to fail, even though the value in the prototype isn't actually changed. (This feels like a JS or v8 bug)
Because initial counter values are in a prototype, can't use hasOwnProperty to check for bucket existence

shappir · 2024-02-11T10:27:08Z

This is the reason Object.freeze needs to be removed: https://github.com/tc39/how-we-work/blob/main/terminology.md#override-mistake

SimenB

No immediate issues jumps out when looking through this 👍 Do you have any umbers or graph to confirm this helps things?

lib/histogram.js

Co-authored-by: Simen Bekkhus <sbekkhus91@gmail.com>

shappir · 2024-02-12T09:53:13Z

Do you have any umbers or graph to confirm this helps things?

No systematic results. I will try get some.

shappir · 2024-02-14T16:00:10Z

My tests show a memory saving of only 5% 😢
Guess I shouldn't have called it significant ...
It's borderline worth it - your call.
(I will add the test example into the repo if you want.)

zbjornson · 2024-02-14T17:59:37Z

How many buckets total, and how many with values, did you test and get 5% savings?

shappir · 2024-02-15T08:56:01Z

@zbjornson @SimenB I tested one histogram with the default buckets: [0.005, 0.01, 0.025, 0.05, 0.1, 0.25, 0.5, 1, 2.5, 5, 10]
Tests:

10 labels with random distribution, inserting 1000 random values
10 labels with random distribution, inserting 10000 random values
5 labels with random distribution, inserting 100000 random values

I got roughly the same results in all cases, peaking at a saving of 5%.

Generally speaking (in percentages):

The less cardinality you have the less benefit you'll get
There's a fixed overhead for the histogram itself, so the less values you have, the less benefit you'll get

Dan Shappir added 2 commits February 6, 2024 12:40

perf: significantly improve the memory usage of histogram

7c2cc63

perf: significantly improve the memory usage of histogram (changelog)

25a29cd

shappir mentioned this pull request Feb 6, 2024

Prom-client high memory usage #611

Open

Update CHANGELOG.md

2881fab

SimenB approved these changes Feb 12, 2024

View reviewed changes

lib/histogram.js Outdated Show resolved Hide resolved

Update lib/histogram.js

81b5440

Co-authored-by: Simen Bekkhus <sbekkhus91@gmail.com>

Dan Shappir added 2 commits February 12, 2024 11:57

pref: use optional chaining

99281be

Merge branch 'master' of github.com:shappir/prom-client

a1870a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: significantly improve the memory usage of histogram #610

perf: significantly improve the memory usage of histogram #610

shappir commented Feb 6, 2024

shappir commented Feb 11, 2024

SimenB left a comment

shappir commented Feb 12, 2024

shappir commented Feb 14, 2024

zbjornson commented Feb 14, 2024

shappir commented Feb 15, 2024 •

edited

perf: significantly improve the memory usage of histogram #610

Are you sure you want to change the base?

perf: significantly improve the memory usage of histogram #610

Conversation

shappir commented Feb 6, 2024

shappir commented Feb 11, 2024

SimenB left a comment

Choose a reason for hiding this comment

shappir commented Feb 12, 2024

shappir commented Feb 14, 2024

zbjornson commented Feb 14, 2024

shappir commented Feb 15, 2024 • edited

shappir commented Feb 15, 2024 •

edited