GODRIVER-2101 Direct read/write retries to another mongos if possible #1358

prestonvasquez · 2023-08-18T01:38:53Z

Summary

When possible, deprioritize failed mongos during retry attempts.

Background & Motivation

Our current retry logic for sharded clusters can lead to an operation that failed with a retryable error being retried on the same mongos.

prestonvasquez · 2023-08-25T18:27:17Z

x/mongo/driver/operation.go

+		return nil, err
+	}
+
+	filteredServers := filterDeprioritizedServers(selectedServers, oss.deprioritizedServers)


This method was designed to add more filters if the need arrises in the future.

x/mongo/driver/operation.go

matthewdale

Looks good, with one question about the behavior when CSOT is enabled 👍

matthewdale · 2023-09-11T07:00:16Z

mongo/integration/retryable_writes_prose_test.go

+			// Note that setting this value greater than 2 will result in false
+			// negatives. The current specification does not account for CSOT, which
+			// might allow for an "inifinite" number of retries over a period of time.
+			// Because of this, we only track the "previous server".


Is there a task for updating the retryable reads/writes "deprioritized mongos" behavior to account for multiple retries (i.e. CSOT)? The vast majority of sharded clusters have >2 mongos nodes, so that seems like a questionably useful feature for drivers that support CSOT.

For now, there is no task to do this. Here are a couple of reasons from discussions with @comandeo:

We do not want this new mechanism to replace SDAM/interfere with SDAM too much.

We believe that mongos may recover from the error fast enough, and there is no reason to exclude ones that failed earlier

It is rather a rare occasion that multiple mongoses fail with retryable errors. This looks like a network issue, and this is handled by SDAM

github-actions · 2023-09-11T19:56:29Z

API Change Report

No changes found!

prestonvasquez added 5 commits August 14, 2023 13:59

GODRIVER-2101 Expand test to use pigeonhole principle

691b976

GODRIVER-2101 Direct read/write retries to another mongos if possible

354597e

GODRIVER-2101 Revert unecessary changes

44dd3f4

GODRIVER-2101 revert changes to collection and cursor

ecea751

GORIVER-2101 resolve merge conflict

7e74cd0

comandeo mentioned this pull request Aug 21, 2023

DRIVERS-1571 Retry on different mongos when possible mongodb/specifications#1450

Merged

4 tasks

prestonvasquez added 3 commits August 21, 2023 17:31

GODRIVER-2101 Apply opServerSelector

3e242f8

GODRIVER-2101 Fix static analysis errors

4787e20

GODRIVER-2101 Remove empty line

d189587

prestonvasquez marked this pull request as ready for review August 25, 2023 17:23

prestonvasquez requested a review from a team as a code owner August 25, 2023 17:23

prestonvasquez requested review from qingyang-hu and matthewdale and removed request for a team August 25, 2023 17:23

prestonvasquez commented Aug 25, 2023

View reviewed changes

qingyang-hu reviewed Aug 28, 2023

View reviewed changes

x/mongo/driver/operation.go Outdated Show resolved Hide resolved

qingyang-hu reviewed Aug 28, 2023

View reviewed changes

x/mongo/driver/operation.go Outdated Show resolved Hide resolved

GODRIVER-2101 Use map 'ok' value

96c42ef

prestonvasquez had a problem deploying to api-report August 31, 2023 01:20 — with GitHub Actions Failure

prestonvasquez requested a review from qingyang-hu August 31, 2023 01:20

Merge branch 'master' into GODRIVER-2101

da00d59

prestonvasquez had a problem deploying to api-report August 31, 2023 15:09 — with GitHub Actions Failure

matthewdale previously approved these changes Sep 11, 2023

View reviewed changes

qingyang-hu previously approved these changes Sep 11, 2023

View reviewed changes

GODRIVER-2101 Resolve merge conflict

943ecbe

prestonvasquez dismissed stale reviews from qingyang-hu and matthewdale via 943ecbe September 11, 2023 16:42

prestonvasquez temporarily deployed to api-report September 11, 2023 16:42 — with GitHub Actions Inactive

prestonvasquez requested review from matthewdale and qingyang-hu September 11, 2023 17:04

prestonvasquez enabled auto-merge September 11, 2023 20:28

prestonvasquez disabled auto-merge September 11, 2023 20:28

prestonvasquez enabled auto-merge September 11, 2023 20:28

qingyang-hu approved these changes Sep 12, 2023

View reviewed changes

prestonvasquez added this pull request to the merge queue Sep 12, 2023

Merged via the queue into mongodb:master with commit d92f20d Sep 12, 2023
21 checks passed

prestonvasquez deleted the GODRIVER-2101 branch September 12, 2023 15:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GODRIVER-2101 Direct read/write retries to another mongos if possible #1358

GODRIVER-2101 Direct read/write retries to another mongos if possible #1358

prestonvasquez commented Aug 18, 2023 •

edited

prestonvasquez Aug 25, 2023

matthewdale left a comment

matthewdale Sep 11, 2023

prestonvasquez Sep 11, 2023 •

edited

github-actions bot commented Sep 11, 2023

GODRIVER-2101 Direct read/write retries to another mongos if possible #1358

GODRIVER-2101 Direct read/write retries to another mongos if possible #1358

Conversation

prestonvasquez commented Aug 18, 2023 • edited

Summary

Background & Motivation

prestonvasquez Aug 25, 2023

Choose a reason for hiding this comment

matthewdale left a comment

Choose a reason for hiding this comment

matthewdale Sep 11, 2023

Choose a reason for hiding this comment

prestonvasquez Sep 11, 2023 • edited

Choose a reason for hiding this comment

github-actions bot commented Sep 11, 2023

API Change Report

prestonvasquez commented Aug 18, 2023 •

edited

prestonvasquez Sep 11, 2023 •

edited