src/histogram: Make Histogram::observe atomic across collects #314

mxinden · 2020-04-05T12:14:06Z

Motivation

A histogram supports two main execution paths:

observe which increases the overall observation counter, updates the observation sum and increases a single bucket counter.
collect which snapshots the state of the histogram and exposes it as a Protobuf struct.

If an observe and a collect operation interleave, the latter could be exposing a snapshot of the histogram that does not uphold all histogram invariants. For example for the invariant that the overall observation counter should equal the sum of all bucket counters: Say that an observe increases the overall counter but before updating a specific bucket counter a collect operation snapshots the histogram.

The above race condition has been solved in the Golang Prometheus client with prometheus/client_golang#457 by introducing the notion of shards, one hot shard for observe operations to record their observation and one cold shard for collect operations to collect a consistent snapshot of the histogram.

observe operations hit the hot shard and record their observation. Collect operations switch hot and cold, wait for all observe calls to finish on the previously hot now cold shard and then expose the consistent snapshot.

This pull request ports prometheus/client_golang#457 to the Rust Prometheus client.

Content of the pull request

The pull request contains three commits:

src/histogram: Add test ensuring Histogram::observe is atomic

Showcasing the above race condition in the current imlementation.
src/{histogram,atomic64}: Make Histogram::observe atomic across collects

Porting Lock-free atomic observations in Histograms! prometheus/client_golang#457 to fix the race
condition.
benches/histogram: Add benchmark for concurrent observe and collect

Adding a benchmark to show the impact of the patch. While the benchmark does not show a performance impact through the patch on my laptop, I am happy to test this more thoroughly on a larger machine (128 cores) in case there is general interest to accept this patch.

Greater picture

Fixing this race condition is especially attractive now that with Prometheus v2.17.0 the isolation level has been increased (See changelog entry below and prometheus/prometheus#6841).

This release implements isolation in TSDB. API queries and recording rules are
guaranteed to only see full scrapes and full recording rules.

Trade-off

While this pull request fixes the above described race condition it does increase complexity:

Introduction of the notion of shards.
The observe code path is mostly untouched other than one additional atomic operation and increased Ordering levels and thus stays lock-free.
collect operations need to happen sequentially (enforced through a single Mutex). A single collect and multiple observe operations can still operate concurrently. Given that the collect operation should happen rarely (> 1s) this should not introduce a performance impact.
In order to coordinate hot and cold shards the 64 bit histogram counter is split into a 1 bit shard index and 63 bit counter. Thus the amount of observations a histogram can record is divided by two. While this might sound like an issue, one could still record one observation per milisecond for 292_277_266 years.

I hope the above description makes sense. Let me know if this is something you are willing to accept into master.

Thanks a bunch for maintaining this library!

If an observe and a collect operation interleave, the latter should not expose a snapshot of the histogram that does not uphold all histogram invariants. For example for the invariant that the overall observation counter should equal the sum of all bucket counters: Say that an `observe` increases the overall counter but before updating a specific bucket counter a collect operation snapshots the histogram. This commits adds a basic unit test to test that the above is not happening. Signed-off-by: Max Inden <mail@max-inden.de>

A histogram supports two main execution paths: 1. `observe` which increases the overall observation counter, updates the observation sum and increases a single bucket counter. 2. `proto` (aka. collecting the metric, from now on referred to as the collect operation) which snapshots the state of the histogram and exposes it as a Protobuf struct. If an observe and a collect operation interleave, the latter could be exposing a snapshot of the histogram that does not uphold all histogram invariants. For example for the invariant that the overall observation counter should equal the sum of all bucket counters: Say that an `observe` increases the overall counter but before updating a specific bucket counter a collect operation snapshots the histogram. This commits adjusts the `HistogramCore` implementation to make such race conditions impossible. It introduces the notion of shards, one hot shard for `observe` operations to record their observation and one cold shard for collect operations to collect a consistent snapshot of the histogram. `observe` operations hit the hot shard and record their observation. Collect operations switch hot and cold, wait for all `observe` calls to finish on the previously hot now cold shard and then expose the consistent snapshot. Signed-off-by: Max Inden <mail@max-inden.de>

Add a basic benchmark test which spawns 4 threads in the background continuously calling `observe` 1_000 times and then `collect`. At the same time call `observe` within the `Bencher::iter` closure to measure impact of background threads on `observe` call. Signed-off-by: Max Inden <mail@max-inden.de>

Signed-off-by: Max Inden <mail@max-inden.de>

lucab · 2020-04-22T13:41:34Z

@BusyJay @breeswish can you have a look at this PR?

I was chatting with @mxinden trying to figure out a better approach instead of Mutex<()>, but we ran out of smarter alternatives. Do you have any feedback on this? Or perhaps is this fine?

breezewish · 2020-05-11T11:16:24Z

Sorry I missed this PR .. 🧐 @BusyJay Do you have suggestions about the Mutex in this PR?

BusyJay · 2020-05-11T12:18:16Z

No, the mutex seems simple and reasonable.

Rusts drop semantics can be confusing sometimes. E.g. `let _ = l.lock()` would drop the lock guard immediately whereas `let _guard = l.lock()` would drop the guard in LIFO order at the end of the current scope. Instead of relying on the above guarantee with `let _guard`, drop the mutex guard explicitely hopefully making this less error prone in the future. Signed-off-by: Max Inden <mail@max-inden.de>

src/histogram.rs

Signed-off-by: Max Inden <mail@max-inden.de>

mxinden · 2020-06-22T08:13:36Z

@lucab @breeswish @BusyJay any objections in regards to this pull request? If not, would one of you mind approving it?

src/atomic64.rs

src/histogram.rs

Signed-off-by: Max Inden <mail@max-inden.de>

mxinden · 2020-07-14T11:52:59Z

Thanks for the review @lucab. Would you mind taking another look?

mxinden changed the title ~~src/{histogram,atomic64}: Make Histogram::observe atomic across collects~~ src/histogram: Make Histogram::observe atomic across collects Apr 5, 2020

mxinden added 3 commits April 20, 2020 17:13

mxinden force-pushed the atomic-histogram branch from df3c64d to 329c5d0 Compare April 20, 2020 15:28

mxinden added 2 commits April 20, 2020 17:29

src/histogram: Account for missing take without protobuf feature

2b3fbc0

Signed-off-by: Max Inden <mail@max-inden.de>

src/histogram,benches/histogram: Run Rust fmt

2faf0e0

Signed-off-by: Max Inden <mail@max-inden.de>

mxinden force-pushed the atomic-histogram branch from 329c5d0 to 2faf0e0 Compare April 20, 2020 15:29

mxinden mentioned this pull request May 11, 2020

Add summary metric type to the encode function #320

Merged

mxinden added 2 commits June 19, 2020 14:13

Merge branch 'tikv/master' into atomic-histogram

3db6ddf

mxinden force-pushed the atomic-histogram branch from 43a4a86 to 79a499f Compare June 19, 2020 12:35

lucab reviewed Jun 19, 2020

View reviewed changes

src/histogram.rs Outdated Show resolved Hide resolved

src/histogram: Remove underscore prefix from used variable

b0ef4e2

Signed-off-by: Max Inden <mail@max-inden.de>

lucab reviewed Jul 9, 2020

View reviewed changes

src/atomic64.rs Outdated Show resolved Hide resolved

src/atomic64.rs Outdated Show resolved Hide resolved

src/histogram.rs Outdated Show resolved Hide resolved

src/histogram.rs Outdated Show resolved Hide resolved

mxinden added 5 commits July 14, 2020 12:56

src/{atomic,histogram}: Make swap take ordering and tighten usage

ff8f075

Signed-off-by: Max Inden <mail@max-inden.de>

src/histogram: Test invariant that sum == count with observe(1.0)

0df9322

Signed-off-by: Max Inden <mail@max-inden.de>

src/atomic64: Adjust swap doc comment

e4c58f2

Signed-off-by: Max Inden <mail@max-inden.de>

src/histogram: Remove pub from shard related structs and fn

4ddabff

Signed-off-by: Max Inden <mail@max-inden.de>

Merge branch 'tikv/master' into atomic-histogram

390a7d5

lucab approved these changes Jul 14, 2020

View reviewed changes

Merge branch 'master' into atomic-histogram

9fd6bfa

lucab merged commit 4fdff69 into tikv:master Jul 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src/histogram: Make Histogram::observe atomic across collects #314

src/histogram: Make Histogram::observe atomic across collects #314

mxinden commented Apr 5, 2020

lucab commented Apr 22, 2020

breezewish commented May 11, 2020

BusyJay commented May 11, 2020

mxinden commented Jun 22, 2020

mxinden commented Jul 14, 2020

src/histogram: Make Histogram::observe atomic across collects #314

src/histogram: Make Histogram::observe atomic across collects #314

Conversation

mxinden commented Apr 5, 2020

Motivation

Content of the pull request

Greater picture

Trade-off

lucab commented Apr 22, 2020

breezewish commented May 11, 2020

BusyJay commented May 11, 2020

mxinden commented Jun 22, 2020

mxinden commented Jul 14, 2020