About the performance overhead of attribute deduplication in recordingSpan#snapshot #5130

moonspirit · 2024-03-31T17:17:53Z

Problem Statement

I updated from 1.21 to 1.24, just found i can get noticeable performance improvement for setAttributes memory alloc, trace context inject/extract and other aspects.

But after profiling my rpc framework, i have some thoughts about deduplicating attributes in recordingSpan#snapshot.

To support Tail Sampling（sampling errors） , we have to sample all spans with RecordAndSample or RecordOnly, that means we need to store attributes for all spans, that makes recordingSpan#snapshot being a critical path.

here is a profiling frame graph which enables tail sampling and set sample fraction to 1/1024 (expect to be less overhead)

the profile show that The current cost of this part(attribute deduplication even all my attributes are unique, no duplications) is about the same as that of propagation.compositeTextMapPropagator.Extract.

I would expect this processing of snapshots can be optimized.

Proposed Solution

I would prefer to delay attributes deduplication when we decided to record and sample that span, that means we could delay the operation to SpanProcessor

Alternatives

Or provide an option not to deduplication attributes

dmathieu · 2024-04-02T09:28:17Z

Do you have the same flame graph running on 1.21 that would show the difference between both versions? (maybe with 1.22 too?)

moonspirit · 2024-04-03T12:46:37Z

Do you have the same flame graph running on 1.21 that would show the difference between both versions? (maybe with 1.22 too?)

Hi, dmathieu, Here is the different graph for the same benchmark

otel v1.21

otel v1.24

the code is here https://github.com/moonspirit/grpc-tracing-bench
(grpc has bad performance for metadata.FromIncomingContext or peer.Peer.Addr.String())

moonspirit added the enhancement New feature or request label Mar 31, 2024

This was referenced Apr 2, 2024

sdk/log: Drop duplicated KeyValues #5086

Closed

sdk/log: Add WithoutDedup option #5133

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the performance overhead of attribute deduplication in recordingSpan#snapshot #5130

About the performance overhead of attribute deduplication in recordingSpan#snapshot #5130

moonspirit commented Mar 31, 2024 •

edited

dmathieu commented Apr 2, 2024

moonspirit commented Apr 3, 2024 •

edited

About the performance overhead of attribute deduplication in recordingSpan#snapshot #5130

About the performance overhead of attribute deduplication in recordingSpan#snapshot #5130

Comments

moonspirit commented Mar 31, 2024 • edited

Problem Statement

Proposed Solution

Alternatives

dmathieu commented Apr 2, 2024

moonspirit commented Apr 3, 2024 • edited

moonspirit commented Mar 31, 2024 •

edited

moonspirit commented Apr 3, 2024 •

edited