Collect stack frames immediately in Ruby 3.0 #150

jhawthorn · 2021-04-28T20:56:35Z

As of ruby/ruby@0e276dc, which shipped in Ruby 3.0, it seems to be safe to collect stack frames inside the signal handler. This should allow more accurate results than waiting for the postponed job to run since that can only measure when interrupts are checked.

This new behaviour is wrapped inside a preprocessor check for Ruby 3+

Additionally, this moves the "in signal handler" checks up a level, and uses pthread_mutex_trylock, which should be more reliable and I believe will fix the issue described in #123 and #124

@tenderlove and I had some discussion about whether this could have issues due to the writes in ruby/ruby@0e276dc being reordered.

One concern is whether there could be issues with hardware memory reordering (particularly on arm). I believe this is safe only because we are only considering the stack from our current thread. These writes will appear consistent in our interrupt handler because of this.

Another concern is that the compiler could reorder the writes in ruby/ruby@0e276dc. It doesn't seem to be, but that could absolutely happen. I think we should investigate making a change to Ruby to ensure the writes in vm_push_frame/vm_pop_frame aren't reordered.

Because this is in the postponed job handler, we should never be able to reenter this code. I also believe this has a race condition if another signal arrives between the if statement and the increment. Signals can also arrive in any thread so this would not be a safe way to avoid reentrancy.

This entures we won't re-enter the signal handler from another signal whether or not it happens in our own thread.

Co-authored-by: Aaron Patterson <aaron.patterson@gmail.com>

eregon · 2022-08-22T11:03:53Z

One concern is whether there could be issues with hardware memory reordering (particularly on arm). I believe this is safe only because we are only considering the stack from our current thread. These writes will appear consistent in our interrupt handler because of this.

I think being in a signal handler gives up on the the "golden rule of multithreading": "a single thread should see everything it does as if it was done in that order".
My understanding is a signal handler is basically considered concurrency/like another thread in C.

AFAIK the signal could trigger anywhere, e.g. between two writes on vm_push_frame (even if they are in order, a big if) and that could cause issues, isn't it?

ivoanjo · 2022-08-22T11:16:46Z

AFAIK the signal could trigger anywhere, e.g. between two writes on vm_push_frame (even if they are in order, a big if) and that could cause issues, isn't it?

+1 I planned experimenting with this and raising it at some point -- the change in ruby/ruby@0e276dc did reorder things as far as the C source goes, but as far as I see it there really doesn't seem to be anything guaranteeing that the compiler won't reorder the write to ec->cfp with the actual initialization of the structure.

So... yeah this doesn't seem particularly safe at this moment.

(But it would be great if rb_profile_frames could indeed be made async-safe!)

mame · 2022-11-16T10:18:13Z

As @eregon and @ivoanjo have said, I think there is a theoretical concern with this change.

However, it may not lead to a visible error on major architectures and C compilers. Even in a potentially reproducible environment, it will be very unlikely to be visible unless the signal handler is called at an extremely unlucky timing.

I think it is a possible option to ignore such a theoretical concern and take the practical benefits, but it's a difficult decision. I don't interfere with your decisions in stackprof, but if it were the interpreter itself, I would like to be very cautious.

jhawthorn and others added 3 commits April 28, 2021 13:23

Use a mutex to avoid reentering signal handler

cf67aff

This entures we won't re-enter the signal handler from another signal whether or not it happens in our own thread.

Collect stack frames immediately on interrupt

6c3f4d7

Co-authored-by: Aaron Patterson <aaron.patterson@gmail.com>

tenderlove merged commit 2a23159 into tmm1:master Apr 29, 2021

casperisfine mentioned this pull request Jul 8, 2021

Stop checking for Process.pid and trust the fork decorators rails/rails#41850

Closed

isobit mentioned this pull request Aug 23, 2021

Program hangs if sample-collection time is > interval #123

Open

3 tasks

jhawthorn mentioned this pull request Feb 16, 2022

Use more accurate profiling on Ruby 3.1 and fix async-signal-safety #172

Merged

This was referenced Jul 12, 2022

Incompatible with YJIT 3.2.0 #179

Closed

Use postponed jobs if YJIT is enabled. #180

Merged

eregon mentioned this pull request Aug 22, 2022

Problem with Ruby 2.7.6 and version 0.2.20 #182

Closed

byroot mentioned this pull request Feb 2, 2023

Automatic Fork Safety excon/excon#814

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collect stack frames immediately in Ruby 3.0 #150

Collect stack frames immediately in Ruby 3.0 #150

jhawthorn commented Apr 28, 2021 •

edited

eregon commented Aug 22, 2022 •

edited

ivoanjo commented Aug 22, 2022

mame commented Nov 16, 2022

Collect stack frames immediately in Ruby 3.0 #150

Collect stack frames immediately in Ruby 3.0 #150

Conversation

jhawthorn commented Apr 28, 2021 • edited

eregon commented Aug 22, 2022 • edited

ivoanjo commented Aug 22, 2022

mame commented Nov 16, 2022

jhawthorn commented Apr 28, 2021 •

edited

eregon commented Aug 22, 2022 •

edited