Use a worker-specific random source to remove lock contention. #178

matthewdale · 2020-12-22T23:21:13Z

What does this PR do ?

Use a separate random number source per worker. Currently, the worker function shouldSample() calls rand.Float64(), which uses a lock to serialize all calls to the package-global pseudorandom number generator, creating lock contention between all instances of worker.

Changes:

Create and seed a separate pseudorandom number generator for every worker to remove the lock contention Use of rand.Float64() reduces parallelism #80
Add a test for the shouldSample() function to assert that it returns true the expected percentage of the time
Add a benchmark for the shouldSample() function to measure improvement from removing lock contention

Benchmarks against master

name            old time/op  new time/op  delta
ShouldSample     444ns ± 0%   190ns ± 1%  -57.26%  (p=0.000 n=8+9)
ShouldSample-2   876ns ± 2%    94ns ± 0%  -89.27%  (p=0.000 n=10+9)
ShouldSample-4  1.17µs ± 1%  0.09µs ± 2%  -92.59%  (p=0.000 n=8+8)
ShouldSample-8  1.25µs ± 3%  0.09µs ± 4%  -92.77%  (p=0.000 n=9+10)

hush-hush

Thanks for the PR, this looks great. I added a few nit-pick but otherwise this looks ready to go.

statsd/worker_test.go

statsd/worker.go

and run TestShouldSample subtests in parallel.

hush-hush

Thanks for the PR !

rafaeljusto · 2021-01-27T13:31:05Z

statsd/worker.go

@@ -59,7 +71,7 @@ func (w *worker) processMetric(m metric) error {
 }

 func (w *worker) shouldSample(rate float64) bool {
-	if rate < 1 && rand.Float64() > rate {
+	if rate < 1 && w.random.Float64() > rate {


@matthewdale @hush-hush Shouldn't we have a local lock here? From the docs:

The default Source is safe for concurrent use by multiple goroutines, but Sources created by NewSource are not.

I think this is causing race conditions.

You're correct. When MutexMode is enabled, the call to worker.processMetric() is made from the same goroutine as the call to the top-level metric emit function (e.g. Count). I was previously under the impression that all calls to worker.processMetric() would be made from a single goroutine per worker, but that's not the case.

I'm working on a fix for the data race bug and will submit it when it's ready.

Use a worker-specific random source to remove lock contention.

1bcf5eb

matthewdale force-pushed the use_independent_rand branch from 4254d3c to 1bcf5eb Compare December 23, 2020 00:20

matthewdale mentioned this pull request Dec 23, 2020

Use of rand.Float64() reduces parallelism #80

Closed

hush-hush self-requested a review January 13, 2021 13:01

hush-hush requested changes Jan 15, 2021

View reviewed changes

statsd/worker_test.go Show resolved Hide resolved

statsd/worker.go Show resolved Hide resolved

Add clarifying comment to per-worker random source

3c89ce4

and run TestShouldSample subtests in parallel.

matthewdale requested a review from hush-hush January 16, 2021 23:46

hush-hush approved these changes Jan 18, 2021

View reviewed changes

hush-hush merged commit f6b3e86 into DataDog:master Jan 18, 2021

hush-hush pushed a commit that referenced this pull request Jan 18, 2021

Use a worker-specific random source to remove lock contention. (#178)

3b72bdc

rafaeljusto reviewed Jan 27, 2021

View reviewed changes

matthewdale mentioned this pull request Jan 29, 2021

Add a Client test that can detect data races when emitting metrics from separate goroutines. #180

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a worker-specific random source to remove lock contention. #178

Use a worker-specific random source to remove lock contention. #178

matthewdale commented Dec 22, 2020

hush-hush left a comment

hush-hush left a comment

rafaeljusto Jan 27, 2021

matthewdale Jan 28, 2021

Use a worker-specific random source to remove lock contention. #178

Use a worker-specific random source to remove lock contention. #178

Conversation

matthewdale commented Dec 22, 2020

What does this PR do ?

Benchmarks against master

hush-hush left a comment

Choose a reason for hiding this comment

hush-hush left a comment

Choose a reason for hiding this comment

rafaeljusto Jan 27, 2021

Choose a reason for hiding this comment

matthewdale Jan 28, 2021

Choose a reason for hiding this comment