Cleanup tenant data in ingester #3782

pstibrany · 2021-02-03T10:54:53Z

What this PR does: This PR does additional cleanup of in-memory data when tenant's TSDB is closed in ingester. Specifically:

We now remove tenant's metadata still in memory, and update metadata-related metrics
We remove all validation metrics for removed tenant.

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

CHANGELOG.md

pkg/ingester/metrics.go

pkg/util/validation/validate.go

- Remove metadata - Metadata metrics - Validation metrics Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

pracucci

You did great! Good job 👏

pkg/util/metrics_helper.go

pkg/util/validation/validate.go

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

nicolai86 · 2021-02-22T20:41:33Z

pkg/util/metrics_helper.go

+			continue
+		}
+
+		lbls := labels.NewBuilder(nil)


what about re-using the same *labels.Builder here and calling lbls.Reset(nil) at the end of the loop, like this:

lbls := labels.NewBuilder(nil) nextMetric: for m := range ch { // ... result = append(result, lbls.Labels()) lbls.Reset(nil) }

This improves the overall runtime by GetLabels by ~24% for small label sets and ~50% for large label sets because it avoids allocating in a loop.

Compare these two benchmarks:

A

func BenchmarkGetLabels_SmallSet(b *testing.B) { m := prometheus.NewCounterVec(prometheus.CounterOpts{ Name: "test", ConstLabels: map[string]string{ "cluster": "abc", }, }, []string{"reason", "user"}) m.WithLabelValues("bad", "user1").Inc() m.WithLabelValues("worse", "user1").Inc() m.WithLabelValues("worst", "user1").Inc() m.WithLabelValues("bad", "user2").Inc() m.WithLabelValues("worst", "user2").Inc() m.WithLabelValues("worst", "user3").Inc() for i := 0; i < b.N; i++ { GetLabels(m, map[string]string{"user": "user1", "reason": "worse"}) } }

BEFORE pkg: github.com/cortexproject/cortex/pkg/util BenchmarkGetLabels_SmallSet BenchmarkGetLabels_SmallSet-8 269169 3818 ns/op PASS AFTER pkg: github.com/cortexproject/cortex/pkg/util BenchmarkGetLabels_SmallSet BenchmarkGetLabels_SmallSet-8 445689 2639 ns/op PASS

B

func BenchmarkGetLabels_LargeSet(b *testing.B) { m := prometheus.NewCounterVec(prometheus.CounterOpts{ Name: "test", ConstLabels: map[string]string{ "cluster": "abc", }, }, []string{"reason", "user"}) for i := 1; i <= 1000; i++ { m.WithLabelValues("bad", fmt.Sprintf("user%d", i)).Inc() m.WithLabelValues("worse", fmt.Sprintf("user%d", i)).Inc() m.WithLabelValues("worst", fmt.Sprintf("user%d", i)).Inc() if i % 2 == 0 { m.WithLabelValues("bad", fmt.Sprintf("user%d", i)).Inc() m.WithLabelValues("worst", fmt.Sprintf("user%d", i)).Inc() } else { m.WithLabelValues("worst", fmt.Sprintf("user%d", i)).Inc() } } for i := 0; i < b.N; i++ { GetLabels(m, map[string]string{"user": "user1", "reason": "worse"}) } }

BEFORE pkg: github.com/cortexproject/cortex/pkg/util BenchmarkGetLabels_LargeSet BenchmarkGetLabels_LargeSet-8 710 1698894 ns/op PASS AFTER pkg: github.com/cortexproject/cortex/pkg/util BenchmarkGetLabels_LargeSet BenchmarkGetLabels_LargeSet-8 1431 876509 ns/op PASS

This is not performance-critical code, since it's called pretty rarely. But low-hanging optimization like this looks good. Would you like to send PR? (I think we need to call Reset at the beginning of the loop, because not all iterations get to the end. Interestingly unit tests don't catch this problem.)

I'll post a PR for this today. Thanks!

posted #3863 which just contains the 2 line change I mentioned, plus the benchmarks to show the diff.

pull-request-size bot added the size/L label Feb 3, 2021

pstibrany requested a review from pracucci February 3, 2021 10:56

pracucci reviewed Feb 3, 2021

View reviewed changes

beorn7 mentioned this pull request Feb 4, 2021

Enable deletion of metrics from a vector based on partially specified labels prometheus/client_golang#834

Closed

pstibrany added 7 commits February 5, 2021 10:12

Cleanup tenant data in ingester

997b3e3

- Remove metadata - Metadata metrics - Validation metrics Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

CHANGELOG.md

8b861ec

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

Improve CHANGELOG entry.

5c5f9f1

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

Make lint happy.

066b876

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

Added function to gete all available labels from collector.

ea336ea

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

Clean all discarded samples and metadata for user.

2d3fa64

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

Review feedback.

b9f052d

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

pracucci approved these changes Feb 5, 2021

View reviewed changes

pkg/util/metrics_helper.go Show resolved Hide resolved

pkg/util/validation/validate.go Outdated Show resolved Hide resolved

pstibrany added 2 commits February 5, 2021 11:32

Fix check.

35ce54b

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

Review feedback.

026db8e

Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>

pstibrany merged commit 3aa0dba into cortexproject:master Feb 5, 2021

nicolai86 reviewed Feb 22, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup tenant data in ingester #3782

Cleanup tenant data in ingester #3782

pstibrany commented Feb 3, 2021 •

edited

pracucci left a comment

nicolai86 Feb 22, 2021

pstibrany Feb 23, 2021 •

edited

nicolai86 Feb 23, 2021

nicolai86 Feb 23, 2021

Cleanup tenant data in ingester #3782

Cleanup tenant data in ingester #3782

Conversation

pstibrany commented Feb 3, 2021 • edited

pracucci left a comment

Choose a reason for hiding this comment

nicolai86 Feb 22, 2021

Choose a reason for hiding this comment

pstibrany Feb 23, 2021 • edited

Choose a reason for hiding this comment

nicolai86 Feb 23, 2021

Choose a reason for hiding this comment

nicolai86 Feb 23, 2021

Choose a reason for hiding this comment

pstibrany commented Feb 3, 2021 •

edited

pstibrany Feb 23, 2021 •

edited