Fix Trace Cache Iteration Crash #692

TimPansino · 2022-11-16T18:22:04Z

Before contributing, please read our contributing guidelines and code of conduct.

Overview

Describe the changes present in the pull request

Related Github Issue

Include a link to the related GitHub issue, if applicable

Testing

The agent includes a suite of tests which should be used to
verify your changes don't break existing functionality. These tests will run with
Github Actions when a pull request is made. More details on running the tests locally can be found
here,
For most contributions it is strongly recommended to add additional tests which
exercise your changes.

github-actions · 2022-11-16T18:24:25Z

🦙 MegaLinter status: ❌ ERROR

Descriptor	Linter	Files	Fixed	Errors	Elapsed time
✅ PYTHON	bandit	1		0	3.0s
✅ PYTHON	black	1	0	0	0.54s
❌ PYTHON	flake8	1		18	0.33s
✅ PYTHON	isort	1	0	0	0.19s
✅ PYTHON	pylint	1		0	2.74s

See errors details in artifact MegaLinter reports on CI Job page
Set VALIDATE_ALL_CODEBASE: true in mega-linter.yml to validate all sources, not only the diff

MegaLinter is graciously provided by

hmstepanek

I apologize that I didn't see this earlier. I had this one on my todo list but I didn't see you'd already opened a PR for it until just now. I do not believe this fix will work as the crash was during the list() cast prior to the loop running. If the list casting didn't fix this, I do not believe a copy will fix it either. My comment on the issue was a bit rushed and not thought through. This type of fix would only work if the code inside the loop was where the _cache was being modified (the list casting would have also fixed this kind of issue). However, since the crash is still happening, I do not think that's what's happening here (nor do I see any code inside the loop that modifies the size of the _cache). I think this is being modified in a separate thread so the two threads are accessing the _cache simultaneously. I believe we need to add a lock here. I've opened a draft PR that does that here: https://github.com/newrelic/newrelic-python-agent/pull/702/files. We can either open that draft PR or we can modify this fix to be like what's in that draft.

TimPansino · 2022-11-30T20:53:06Z

@hmstepanek

If the list casting didn't fix this, I do not believe a copy will fix it either.

The issue with a list is that it's not casting, it's iterating on the dict.items into a list, then iterating the list. The crash has less area to happen but can still happen if the change is made during the short time when the dict.items is iterated over by the list constructor (instead of the longer running for loop).

A shallow copy doesn't iterate as far as I'm aware, this is a patch that's been used in other places I've seen to guard against iteration issues.

That being said, I'm not opposed to using locking but I wonder if that causes potential performance issues. It's certainly harder to maintain. I think the simpler solution might be more appropriate considering this is only affecting event loop wait time and debug logic, as opposed to potential impacts to any time that the trace cache is accessed.

TimPansino · 2022-12-01T00:33:51Z

Superseded by #704

Fix trace cache iteration crash

8210077

TimPansino force-pushed the fix-trace-cache-crash branch from b2d2812 to 8210077 Compare November 16, 2022 18:24

Merge branch 'main' into fix-trace-cache-crash

ceb3a7f

TimPansino marked this pull request as ready for review November 16, 2022 18:34

TimPansino requested a review from a team as a code owner November 16, 2022 18:34

Merge branch 'main' into fix-trace-cache-crash

2a2d838

hmstepanek requested changes Nov 30, 2022

View reviewed changes

TimPansino mentioned this pull request Dec 1, 2022

Add TraceCache Guarded Iteration #704

Merged

TimPansino closed this Dec 1, 2022

TimPansino deleted the fix-trace-cache-crash branch December 1, 2022 00:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Trace Cache Iteration Crash #692

Fix Trace Cache Iteration Crash #692

TimPansino commented Nov 16, 2022

github-actions bot commented Nov 16, 2022 •

edited

hmstepanek left a comment

TimPansino commented Nov 30, 2022

TimPansino commented Dec 1, 2022

Fix Trace Cache Iteration Crash #692

Fix Trace Cache Iteration Crash #692

Conversation

TimPansino commented Nov 16, 2022

Overview

Related Github Issue

Testing

github-actions bot commented Nov 16, 2022 • edited

🦙 MegaLinter status: ❌ ERROR

hmstepanek left a comment

Choose a reason for hiding this comment

TimPansino commented Nov 30, 2022

TimPansino commented Dec 1, 2022

github-actions bot commented Nov 16, 2022 •

edited