Even accept #1920

nateberkopec · 2019-08-20T11:42:01Z

Attempts to spread out work more appropriately. More loaded workers will listen less.

Perform non-blocking socket poll to determine if work is immediately available, to evenly balance work across multiple processes before balancing across multiple threads within a single process, which is less efficient due to Ruby's Global VM Lock.

This makes the loaded accept wait time dynamic, such that less loaded servers will wait less. That should even out the delay experienced by a new client connecting to a highly loaded cluster. This commit also makes the waiting dependent on worker mode with more than one worker.

nateberkopec · 2019-08-20T11:43:26Z

So, next steps here will be to find a benchmark and a benchmarking app that reproduces the original case described in #1646. I tried (#1646 (comment)) but couldn't find anything yet.

nateberkopec · 2019-08-20T12:29:49Z

I think that the way a typical load generating tool (such as wrk) generates load means that this benchmarking this PR is a little more difficult than usual. I think the benchmarking app I laid out previously will work, but I need to work on the load generation to make it work in a more production-like way.

nateberkopec · 2019-08-29T09:26:38Z

I’ve been trying this with Siege as well (which emits requests after a set delay rather than after the last request has been responded to) and still not seeing a result. Next step will be for me to reproduce using TechEmpower, which is the only benchmark we can claim has gotten better so far.

_{Sent with GitHawk}

ayufan · 2019-11-21T15:56:21Z

I opened this issue #2078, that seems to talk almost about the same problem as this one.

However, this cares about a busy workers, but not rather idle ones.

@nateberkopec @dentarg What do you think about the other issue and the proposed solution (not a sleep in that form, but something close that looks at thread pool), but by injecting latency earlier. I would think about making this change as a non-changing behavior (disabled by default), and this could be then enabled to improve the performance.

I tested the changes introduced by this branch: https://docs.google.com/spreadsheets/d/1y2YqrPPgZ-RtjKiCGplJ7YkwjMXkZx6vEPq7prFbUn4/edit#gid=0.
It seems to not bring any major improvements. I did also
re-run my branch again to ensure that this was no fluke,
and it still brings substantial improvements.

wjordan

This PR seems to have different behavior from #1646, due to a conflicting change introduced by #1648 (see comment). The result is that this PR basically has no effect which seems consistent with the benchmark tests above.

Compare to master...wjordan:out_of_band (the branch we have been running in production for the past year) that combines the behavior of the two PRs properly by moving the out_of_band logic into wait_until_not_full.

wjordan · 2019-12-11T19:11:13Z

lib/puma/thread_pool.rb

+                        (busy_threads.to_f / @max) * WORK_AVAILABLE_TIMEOUT_FACTOR
+                      end
+          end
+
          return busy_threads if @max > busy_threads


This line was added by #1648, and makes the behavior of this PR quite different from the one in #1646 (which is based on a commit prior to #1648). Here, the call to @not_full.wait will never be reached unless busy_threads >= @max (ie, the thread pool is maxxed out with requests), which would be an extremely rare case.

I believe this is the reason this PR has no observable effect in benchmarks.

wjordan · 2019-12-12T01:37:28Z

Re-ran the TechEmpower benchmark to confirm this PR is currently not working, see #2079 (comment).

nateberkopec · 2020-02-10T15:35:05Z

Closing in favor of the 2 open and active PRs addressing this issue: #2079 and #2092

wjordan and others added 4 commits September 11, 2018 18:03

Merge branch 'master' into f-even-accept

3800677

Merge branch 'master' into f-even-accept

43afb83

nateberkopec added feature perf labels Aug 20, 2019

nateberkopec mentioned this pull request Aug 20, 2019

Balance incoming requests across processes #1646

Closed

nateberkopec added the waiting-for-changes Waiting on changes from the requestor label Aug 20, 2019

Rubocop

9556406

nateberkopec removed the waiting-for-changes Waiting on changes from the requestor label Sep 11, 2019

nateberkopec added this to the 5.0.0 milestone Sep 23, 2019

dentarg mentioned this pull request Nov 21, 2019

Puma does not prefer idle-workers in cluster mode #2078

Closed

wjordan suggested changes Dec 11, 2019

View reviewed changes

wjordan mentioned this pull request Dec 11, 2019

Inject small delay for busy workers to improve requests distribution #2079

Merged

8 tasks

wjordan mentioned this pull request Dec 17, 2019

Least-busy load balancing in cluster mode #2092

Closed

8 tasks

nateberkopec closed this Feb 10, 2020

wjordan mentioned this pull request Feb 26, 2020

Add bind_workers option to bind separate listeners per worker #2128

Closed

8 tasks

nateberkopec deleted the f-even-accept branch March 14, 2020 21:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Even accept #1920

Even accept #1920

nateberkopec commented Aug 20, 2019

nateberkopec commented Aug 20, 2019

nateberkopec commented Aug 20, 2019

nateberkopec commented Aug 29, 2019

ayufan commented Nov 21, 2019

wjordan left a comment •

edited

wjordan Dec 11, 2019

wjordan commented Dec 12, 2019

nateberkopec commented Feb 10, 2020

Even accept #1920

Even accept #1920

Conversation

nateberkopec commented Aug 20, 2019

nateberkopec commented Aug 20, 2019

nateberkopec commented Aug 20, 2019

nateberkopec commented Aug 29, 2019

ayufan commented Nov 21, 2019

wjordan left a comment • edited

Choose a reason for hiding this comment

wjordan Dec 11, 2019

Choose a reason for hiding this comment

wjordan commented Dec 12, 2019

nateberkopec commented Feb 10, 2020

wjordan left a comment •

edited