Use Notify instead of InternalsGuard #164

levkk · 2023-05-27T18:51:23Z

Potential fix for #154

Problem

InternalsGuard is problematic because of the deadlock. Using a ReentrantLock is not possible because both put and drop are holding a mutable reference to the internals. Using RefCell won't work also, because it doesn't allow two or more mutable borrows, which is what's required to make this work, a violation of Rust safety guarantees.

Possible Solution

Switch to using Tokio's Notify that provides a fair queue for waiters. There is no more internal pool guarantee that connections are fairly given to the tasks that have waited the longest, but this fairness may be good enough if the Tokio scheduler and the kernel scheduler ensure fairness as well. Starvation is possible if the caller infinitely retries calls to get.

Additionally, the entire make_pooled function is timed out using the connection_timeout setting. This ensures that there is no internal starvation & that is_valid() is timed out as well; if it isn't, we can starve all tasks & block the caller indefinitely while waiting for a promise.

Open Questions

The error forwarding is not clear to me. I removed it, but I'm not entirely sure what it does. May need some help here to understand if I broke something.

codecov · 2023-05-27T18:54:34Z

Codecov Report

Patch coverage: 96.96% and project coverage change: +0.84 🎉

Comparison is base (ea5c162) 72.12% compared to head (0dc8eda) 72.96%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #164      +/-   ##
==========================================
+ Coverage   72.12%   72.96%   +0.84%     
==========================================
  Files           6        6              
  Lines         599      603       +4     
==========================================
+ Hits          432      440       +8     
+ Misses        167      163       -4

Impacted Files	Coverage Δ
bb8/src/inner.rs	`87.38% <95.91%> (+1.06%)`	⬆️
bb8/src/internals.rs	`95.45% <100.00%> (+2.69%)`	⬆️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

Fix merge conflict

dertin · 2023-06-18T19:05:25Z

Has there been any progress with the tests? This PR have any issues as 'blocked' waiting for some discussion or support ... ?

levkk · 2023-06-19T05:17:16Z

Not really, just waiting for djc to take a look.

djc · 2023-09-26T13:05:54Z

Sorry for taking so long to look at this. So if I understand correctly, this fix will make the pool less fair? And also, more tasks have to do work if a number of them are waiting for a connection to become available, whereas previously they would just spend that time sleeping?

Because, while the previous design with waiters made it so that earlier requesters would be sent the freed up connection, in this design all waiters are notified and then make another attempt at getting a connection?

levkk · 2023-09-26T16:26:17Z

I don't think so. Notify is fair according to the docs and notify_one notifies only one task, not all of them.

while waiting on djc/bb8#164 to ship

levkk · 2024-01-04T08:23:04Z

So if I understand correctly, this fix will make the pool less fair?

I took a look at the implementation of Notify. It uses a FIFO queue [1] [2] for tasks, so the first task to .await will be the first one to be woken up. I think that's good enough to maintain fairness, since the pool will never wake up more tasks than there are available connections. The only way I can think we can have a task skip the queue is a bug in the tokio scheduler.

[1] https://docs.rs/tokio/latest/src/tokio/sync/notify.rs.html#985
[2] https://docs.rs/tokio/latest/src/tokio/sync/notify.rs.html#732

djc · 2024-01-26T11:52:13Z

I tried to simplify this a bit, please take a look at #186.

levkk added 3 commits May 27, 2023 11:39

Switch to notify

4061b57

Switch to notify

90e85e4

Fmt

8ab68c0

levkk and others added 7 commits May 27, 2023 13:35

Add sync dependency

7a0bf83

More tests

54297cb

fix comment

898bd15

try to get full test coverage

2e0a876

Merge branch 'main' into levkk-fix-waiters-2

ed79d55

Fix merge conflict

23e67a3

Merge pull request #3 from zainkabani/zain/fix-branch

0dc8eda

Fix merge conflict

This was referenced Jun 14, 2023

Fix deadlock #170

Closed

Fix deadlock postgresml/bb8#2

Merged

nicolasvan added a commit to OneSignal/pgcat that referenced this pull request Oct 5, 2023

Use the postgresml bb8 fork

cfd2f6d

while waiting on djc/bb8#164 to ship

nicolasvan mentioned this pull request Oct 5, 2023

fix: Use the postgresml bb8 fork OneSignal/pgcat#2

Merged

djc mentioned this pull request Jan 26, 2024

Use Notify to coordinate waiters #186

Merged

djc closed this Jan 29, 2024

Sagebati mentioned this pull request Jan 30, 2024

Attempt to subtract with overflow, new version 8.2 #187

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Notify instead of InternalsGuard #164

Use Notify instead of InternalsGuard #164

levkk commented May 27, 2023

codecov bot commented May 27, 2023 •

edited

dertin commented Jun 18, 2023

levkk commented Jun 19, 2023

djc commented Sep 26, 2023

levkk commented Sep 26, 2023

levkk commented Jan 4, 2024

djc commented Jan 26, 2024

Use Notify instead of InternalsGuard #164

Use Notify instead of InternalsGuard #164

Conversation

levkk commented May 27, 2023

Problem

Possible Solution

Open Questions

codecov bot commented May 27, 2023 • edited

Codecov Report

dertin commented Jun 18, 2023

levkk commented Jun 19, 2023

djc commented Sep 26, 2023

levkk commented Sep 26, 2023

levkk commented Jan 4, 2024

djc commented Jan 26, 2024

codecov bot commented May 27, 2023 •

edited