Refactor `tokio_postgres::Connection` implementation #1066

50U10FCA7 · 2023-09-22T21:17:30Z

Minor changes improving tokio_postgres::Connection implementation:

makes Connection::stream polling independent from polling pending requests/responses;
Connection::poll_read now ensures the written messages was flushed, before sending the next ones;
covers private items with docs.

# Conflicts: # tokio-postgres/src/connection.rs

sfackler · 2023-09-22T21:56:44Z

What is the purpose of these changes?

50U10FCA7 · 2023-09-22T22:05:36Z

@sfackler Thanks for the quick reply.

These changes should allow to process something like this more efficiently:

let (client, connection) =
        tokio_postgres::connect(..).await?;

let rc_client = Rc::new(client);

tokio::spawn_local(move || {
    rc_client.clone().query(..)
})

tokio::spawn_local(move || {
    rc_client.clone().query(..)
})

without blocking Connection polling on specific request/response from Client.

Also, now Connection::stream::poll_flush is waited by Connection to return Poll::Ready (to ensure the msg was written).

Private docs updated to make code a little bit readable.

p.s. All changes are minor and doesn't break the public API, only internal implementation is changed.

sfackler · 2023-09-22T23:09:45Z

without blocking Connection polling on specific request/response from Client.

What blocking is happening before this change?

Also, now Connection::stream::poll_flush is waited by Connection to return Poll::Ready (to ensure the msg was written).

In what context would a message not be written?

50U10FCA7 · 2023-09-23T03:25:41Z

@sfackler

What blocking is happening before this change?

poll_read on master branch can process only one response at time, even if socket is already received messages of the next responses:

receive (1, 2) a new BackendMessage;
try to send received BackendMessage to Client;
receiver is not ready;
try to send it in the next poll (or delayed message which is always AsyncMessage and will poll Connection again, because it yields a value).

But there can be a situation when socket already has a full response (all its messages) and some messages of another one. In this case we can try to send a part of the second response within current poll:

receive all available BackendMessages;
try to send all messages of the pending responses within current poll.

I think this should be more efficient, because we don't have a guarantee that runtime will poll the Connection again right after the current poll. And because we not poll the socket again (we are waiting for a Client's receiver), the next time Connection::poll_read will be polled by runtime only if:

A new Client's request received;
When Client's receiver is dropped/able to receive a response.

If we assume that Response's receiver is not polled for a long time (doing more important things in futures::future::select(client.query(..), another_fut) for example) and we have another one Response that is received by socket and ready to be send - we should do it I think.

In what context would a message not be written?

In case when Framed cannot write a frame to a socket (I think its possible if machine has a small window and full frame cannot be written, but not sure it is possible in real-world scenarios).

p.s. Also found that both TcpStream and UnixStream supports vectored write, but tokio_postgres::Socket doesn't use it, is there any reason for that?

sfackler · 2023-09-23T15:02:44Z

receive all available BackendMessages;

try to send all messages of the pending responses within current poll.

A sufficiently fast server could cause that logic to OOM by giving it an unbounded amount of messages to buffer.

I think this should be more efficient, because we don't have a guarantee that runtime will poll the Connection again right after the current poll.

Efficiency claims should be measured, not assumed.

In case when Framed cannot write a frame to a socket (I think its possible if machine has a small window and full frame cannot be written, but not sure it is possible in real-world scenarios).

The next poll of the connection will continue to drive the flush.

p.s. Also found that both TcpStream and UnixStream supports vectored write, but tokio_postgres::Socket doesn't use it, is there any reason for that?

Because there aren't any vectored writes happening in the client AFAIK.

50U10FCA7 added 7 commits September 19, 2023 06:59

WIP [skip ci]

90ecb57

WIP [skip ci]

74fbd43

WIP [skip ci]

b48f329

WIP [skip ci]

f4e04eb

WIP [skip ci]

f49240d

Corrections [run ci]

cc18b99

Merge remote-tracking branch 'origin/master' into refactor-connection

bf02f14

# Conflicts: # tokio-postgres/src/connection.rs

50U10FCA7 marked this pull request as draft September 22, 2023 21:20

50U10FCA7 marked this pull request as ready for review September 22, 2023 21:21

50U10FCA7 added 3 commits September 22, 2023 16:51

Do not await CopyIn request [run ci]

8250f31

Do not await CopyIn request [run ci]

a9a85aa

Do not await CopyIn request [run ci]

ed54e96

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `tokio_postgres::Connection` implementation #1066

Refactor `tokio_postgres::Connection` implementation #1066

50U10FCA7 commented Sep 22, 2023

sfackler commented Sep 22, 2023

50U10FCA7 commented Sep 22, 2023 •

edited

sfackler commented Sep 22, 2023

50U10FCA7 commented Sep 23, 2023

sfackler commented Sep 23, 2023

Refactor tokio_postgres::Connection implementation #1066

Are you sure you want to change the base?

Refactor tokio_postgres::Connection implementation #1066

Conversation

50U10FCA7 commented Sep 22, 2023

sfackler commented Sep 22, 2023

50U10FCA7 commented Sep 22, 2023 • edited

sfackler commented Sep 22, 2023

50U10FCA7 commented Sep 23, 2023

sfackler commented Sep 23, 2023

Refactor `tokio_postgres::Connection` implementation #1066

Refactor `tokio_postgres::Connection` implementation #1066

50U10FCA7 commented Sep 22, 2023 •

edited