transport: fail NewStream() with better error message when conn is closed #4426

menghanl · 2021-05-12T21:42:17Z

No description provided.

menghanl · 2021-05-12T21:44:26Z

Found when debugging Test/KeepaliveClientStaysHealthyWithResponsiveServer #4424

dfawley · 2021-05-13T00:28:53Z

The test you changed now appears to be flaky.

dfawley · 2021-05-13T02:49:46Z

internal/transport/http2_client.go

+			t.mu.Lock()
+			err := t.closeErr
+			t.mu.Unlock()
+			if err != nil {
+				return nil, err
+			}


I don't believe the lock is necessary, but it's harmless in any case. t.cancel() is called only after t.closeErr is set, and t.closeErr can only be set once, before t.cancel().

t.ctx is derived from a parent context, so t.cancel() isn't the only way to cancel t.ctx. And the read and write may race in the other case.
(That parent context is ClientConn.ctx, so this only happens when the ClientConn is closed)

That makes sense, but is pretty complicated. How about:

func (t *http2Client) getCloseErr() { t.mu.Lock() defer t.mu.Unlock() if err := t.closeErr; err != nil { return err } if t.ctx.Err() != nil { return ErrConnClosing } return nil // Not closed, why are you calling this?? }

Or is there some way to block until t.Close finishes so we know t.closeErr is valid?

How is this different?

And I don't like returning nil here. Even if it should never happen.

internal/transport/transport_test.go

dfawley · 2021-05-17T22:51:51Z

internal/transport/http2_client.go

+			t.mu.Lock()
+			err := t.closeErr
+			t.mu.Unlock()
+			if err != nil {
+				return nil, err
+			}


That makes sense, but is pretty complicated. How about:

func (t *http2Client) getCloseErr() { t.mu.Lock() defer t.mu.Unlock() if err := t.closeErr; err != nil { return err } if t.ctx.Err() != nil { return ErrConnClosing } return nil // Not closed, why are you calling this?? }

Or is there some way to block until t.Close finishes so we know t.closeErr is valid?

dfawley · 2021-05-17T23:04:28Z

internal/transport/transport_test.go

+				// - if it is after the transport is closed (case <-ct.ctxDone),
+				//   we don't care about the error.


Why would the transport be closed? It was gracefully closed, but we have an active stream (from line 755) so it should never close, should it? Unless the server ends the stream, but it looks like it does not, until it receives the client's end-stream.

We start 200 goroutines to run this.
Some of them could be so slow that they run after the end of the line 755 stream.

dfawley · 2021-05-25T17:53:16Z

Tests are failing:

Error: ../internal/transport/http2_client.go:406:22: not enough arguments in call to t.controlBuf.finish

menghanl · 2021-05-25T18:19:01Z

This PR no longer works after moving controlbuf.finish() (#4447).
go test ./internal/transport -run Test/GracefulClose -count 100 is racy

NewStream() will fail due to controlbuf error, after the receiver exits, before the transport is closed (transport is closed by the sender, but it takes time for it to see the error, it's a race), so it won't be able to reliably return the transport close error.

This will need more thoughts, and probably more changes.

dfawley · 2021-06-01T23:23:21Z

Can you file an issue to follow-up on this later? (Unless you have something you're actively working on already.)

menghanl requested a review from dfawley May 12, 2021 21:42

menghanl assigned dfawley May 12, 2021

menghanl added no release notes Type: Internal Cleanup Refactors, etc labels May 12, 2021

dfawley assigned menghanl and unassigned dfawley May 13, 2021

dfawley reviewed May 13, 2021

View reviewed changes

menghanl assigned dfawley and unassigned menghanl May 17, 2021

dfawley reviewed May 17, 2021

View reviewed changes

dfawley assigned menghanl and unassigned dfawley May 17, 2021

menghanl assigned dfawley and unassigned menghanl May 25, 2021

menghanl added 4 commits May 25, 2021 10:36

[flaky_keepalive_debug] 1

bee9fb5

[better_conn_close_error] test

6bb168a

[better_conn_close_error] test 2

f3e0c6b

[better_conn_close_error] c1

9602024

menghanl force-pushed the better_conn_close_error branch from b122a1f to 9602024 Compare May 25, 2021 17:37

dfawley assigned menghanl and unassigned dfawley May 25, 2021

[better_conn_close_error] after rebase, tests failing

5ee3d25

menghanl closed this Jun 1, 2021

menghanl deleted the better_conn_close_error branch June 21, 2021 21:19

github-actions bot locked as resolved and limited conversation to collaborators Dec 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transport: fail NewStream() with better error message when conn is closed #4426

transport: fail NewStream() with better error message when conn is closed #4426

menghanl commented May 12, 2021

menghanl commented May 12, 2021 •

edited

dfawley commented May 13, 2021

dfawley May 13, 2021

menghanl May 17, 2021

dfawley May 17, 2021

menghanl May 25, 2021

dfawley May 17, 2021

dfawley May 17, 2021

menghanl May 25, 2021

dfawley commented May 25, 2021

menghanl commented May 25, 2021 •

edited

dfawley commented Jun 1, 2021

		// - if it is after the transport is closed (case <-ct.ctxDone),
		// we don't care about the error.

transport: fail NewStream() with better error message when conn is closed #4426

transport: fail NewStream() with better error message when conn is closed #4426

Conversation

menghanl commented May 12, 2021

menghanl commented May 12, 2021 • edited

dfawley commented May 13, 2021

dfawley May 13, 2021

Choose a reason for hiding this comment

menghanl May 17, 2021

Choose a reason for hiding this comment

dfawley May 17, 2021

Choose a reason for hiding this comment

menghanl May 25, 2021

Choose a reason for hiding this comment

dfawley May 17, 2021

Choose a reason for hiding this comment

dfawley May 17, 2021

Choose a reason for hiding this comment

menghanl May 25, 2021

Choose a reason for hiding this comment

dfawley commented May 25, 2021

menghanl commented May 25, 2021 • edited

dfawley commented Jun 1, 2021

menghanl commented May 12, 2021 •

edited

menghanl commented May 25, 2021 •

edited