Add job management docs for cutover in physical cluster replication jobs #18525

kathancox · 2024-05-07T17:03:19Z

This PR adds detail on jobs management to the cutover page under physical cluster replication. This affects scheduled jobs and changefeeds.

github-actions · 2024-05-07T17:03:42Z

Files changed:

src/current/_includes/v23.2/known-limitations/pcr-scheduled-changefeeds.md:

src/current/_includes/v24.1/known-limitations/pcr-scheduled-changefeeds.md:

src/current/v23.2/create-and-configure-changefeeds.md
src/current/v23.2/cutover-replication.md
src/current/v23.2/known-limitations.md
src/current/v23.2/physical-cluster-replication-overview.md
src/current/v24.1/create-and-configure-changefeeds.md
src/current/v24.1/cutover-replication.md
src/current/v24.1/known-limitations.md
src/current/v24.1/physical-cluster-replication-overview.md

netlify · 2024-05-07T17:04:00Z

✅ Deploy Preview for cockroachdb-interactivetutorials-docs canceled.

Name	Link
🔨 Latest commit	`098ea87`
🔍 Latest deploy log	https://app.netlify.com/sites/cockroachdb-interactivetutorials-docs/deploys/6646261384545400082f1ae3

netlify · 2024-05-07T17:04:00Z

✅ Deploy Preview for cockroachdb-api-docs canceled.

Name	Link
🔨 Latest commit	`098ea87`
🔍 Latest deploy log	https://app.netlify.com/sites/cockroachdb-api-docs/deploys/66462613409e880008d32520

netlify · 2024-05-07T17:06:43Z

✅ Netlify Preview

Name	Link
🔨 Latest commit	`098ea87`
🔍 Latest deploy log	https://app.netlify.com/sites/cockroachdb-docs/deploys/66462613a7c5e00008d6034a
😎 Deploy Preview	https://deploy-preview-18525--cockroachdb-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

kathancox · 2024-05-07T17:42:32Z

src/current/v23.2/cutover-replication.md

+
+[Changefeeds]({% link {{ page.version.version }}/change-data-capture-overview.md %}) will fail on the promoted cluster immediately after cutover. We recommend that you recreate changefeeds on the promoted cluster.
+
+[Scheduled changefeeds]({% link {{ page.version.version }}/create-schedule-for-changefeed.md %}) will continue on the promoted cluster. You will need to manage [pausing]({% link {{ page.version.version }}/pause-schedules.md %}) or [canceling]({% link {{ page.version.version }}/drop-schedules.md %}) the schedule on the original primary and promoted standby clusters.


@msbutler I don't know if this is quite correct. I made an assumption of what would happen here because scheduled changefeeds are a one-time table scan rather than a continuous job like a regular changefeed. Please correct me!
Also, I have not added this as a limitation yet, do we want to do so?

I think what you have here is fine! Perhaps you could explain why we recommend some manual intervention: we don't recommend two clusters writing changefeeds to the same sink.

We should definitely add a known limitation for this.

msbutler

conducted a close read of the 23.2 version, assuming 24.1 version is basically the same.

msbutler · 2024-05-07T19:39:18Z

src/current/v23.2/cutover-replication.md

+
+### Changefeeds
+
+[Changefeeds]({% link {{ page.version.version }}/change-data-capture-overview.md %}) will fail on the promoted cluster immediately after cutover. We recommend that you recreate changefeeds on the promoted cluster.


i'm not a cdc expert, but its probably worth mentioning why they fail (they fail after cluster restore, for example as well). I think we fail them because we don't want two seperate clusters running a changefeed to the same sink, right?

msbutler · 2024-05-07T19:41:25Z

src/current/v23.2/cutover-replication.md

+
+[Changefeeds]({% link {{ page.version.version }}/change-data-capture-overview.md %}) will fail on the promoted cluster immediately after cutover. We recommend that you recreate changefeeds on the promoted cluster.
+
+[Scheduled changefeeds]({% link {{ page.version.version }}/create-schedule-for-changefeed.md %}) will continue on the promoted cluster. You will need to manage [pausing]({% link {{ page.version.version }}/pause-schedules.md %}) or [canceling]({% link {{ page.version.version }}/drop-schedules.md %}) the schedule on the original primary and promoted standby clusters.


I think what you have here is fine! Perhaps you could explain why we recommend some manual intervention: we don't recommend two clusters writing changefeeds to the same sink.

We should definitely add a known limitation for this.

kathancox · 2024-05-09T16:44:16Z

src/current/v23.2/cutover-replication.md

+{{site.data.alerts.end}}
+
+### Changefeeds
+


@msbutler I added the known limitations around scheduled changefeeds for this. Also updated the changefeed text.
@rharding6373 could you take a look at the changefeed text here to confirm that "two clusters running the same changefeed to one sink" is the reason that we fail changefeeds on full cluster restore (and in this case cutover)

This is correct. Thanks for checking.

Thanks Rachael!

kathancox · 2024-05-13T15:26:35Z

src/current/_includes/v23.2/known-limitations/pcr-scheduled-changefeeds.md

@@ -0,0 +1 @@
+After the [cutover process]({% link {{ page.version.version }}/cutover-replication.md %}) for [physical cluster replication]({% link {{ page.version.version }}/physical-cluster-replication-overview.md %}), [scheduled changefeeds]({% link {{ page.version.version }}/create-schedule-for-changefeed.md %}) will continue on the promoted cluster. You will need to manage [pausing]({% link {{ page.version.version }}/pause-schedules.md %}) or [canceling]({% link {{ page.version.version }}/drop-schedules.md %}) the schedule on the original primary and promoted standby clusters to avoid two clusters running the same changefeed to one sink. [Tracking GitHub issue](https://github.com/cockroachdb/cockroach/issues/123776)


Note to docs team reviewers: the known limitation tracking GH issue link has different formats between v23.2 + v24.1 following the update to known limitations for GA.

msbutler · 2024-05-13T19:36:26Z

src/current/_includes/v24.1/known-limitations/pcr-scheduled-changefeeds.md

@@ -0,0 +1 @@
+After the [cutover process]({% link {{ page.version.version }}/cutover-replication.md %}) for [physical cluster replication]({% link {{ page.version.version }}/physical-cluster-replication-overview.md %}), [scheduled changefeeds]({% link {{ page.version.version }}/create-schedule-for-changefeed.md %}) will continue on the promoted cluster. You will need to manage [pausing]({% link {{ page.version.version }}/pause-schedules.md %}) or [canceling]({% link {{ page.version.version }}/drop-schedules.md %}) the schedule on the original primary and promoted standby clusters to avoid two clusters running the same changefeed to one sink. [#123776](https://github.com/cockroachdb/cockroach/issues/123776)


nit: i think we should only instruct the user to pause or cancel on the newly promoted cluster.

@msbutler What is the expectation for users when the scheduled backup is paused on the promoted cluster; that they pause or cancel the backup schedule on the original cluster? Assume cancel given the storage/collection possible collision?

@msbutler Ah, I realize now (I think...) that I got the emphasis wrong on your comment; that is, let's only talk about the newly promoted cluster. I have updated to this effect! 🙃

Amruta-Ranade

LGTM!

kathancox · 2024-05-16T15:40:24Z

TFTRs!

kathancox force-pushed the pcr-schedule-pause-destination branch from 117611e to 57eda19 Compare May 7, 2024 17:21

kathancox commented May 7, 2024

View reviewed changes

kathancox requested a review from msbutler May 7, 2024 17:42

msbutler reviewed May 7, 2024

View reviewed changes

kathancox force-pushed the pcr-schedule-pause-destination branch from ab321b2 to 2d1acb9 Compare May 9, 2024 16:42

kathancox commented May 9, 2024

View reviewed changes

kathancox requested a review from msbutler May 9, 2024 16:44

kathancox commented May 13, 2024

View reviewed changes

msbutler reviewed May 13, 2024

View reviewed changes

kathancox requested a review from msbutler May 15, 2024 14:29

msbutler approved these changes May 16, 2024

View reviewed changes

kathancox requested a review from Amruta-Ranade May 16, 2024 14:39

Amruta-Ranade approved these changes May 16, 2024

View reviewed changes

kathancox force-pushed the pcr-schedule-pause-destination branch from e37f4d2 to 453cba9 Compare May 16, 2024 15:08

kathancox added 4 commits May 16, 2024 11:28

Add job management docs for cutover in pcr

0bfe9b2

Eng feedback 1

aef40b9

Update to only promoted

9ef4fe9

Place known limitations

098ea87

kathancox force-pushed the pcr-schedule-pause-destination branch from 453cba9 to 098ea87 Compare May 16, 2024 15:28

kathancox merged commit dd912ec into main May 16, 2024
6 checks passed

kathancox deleted the pcr-schedule-pause-destination branch May 16, 2024 15:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add job management docs for cutover in physical cluster replication jobs #18525

Add job management docs for cutover in physical cluster replication jobs #18525

kathancox commented May 7, 2024

github-actions bot commented May 7, 2024 •

edited

netlify bot commented May 7, 2024 •

edited

netlify bot commented May 7, 2024 •

edited

netlify bot commented May 7, 2024 •

edited

kathancox May 7, 2024

msbutler May 7, 2024

msbutler left a comment

msbutler May 7, 2024

msbutler May 7, 2024

kathancox May 9, 2024

rharding6373 May 15, 2024

kathancox May 15, 2024

kathancox May 13, 2024

msbutler May 13, 2024

kathancox May 13, 2024 •

edited

kathancox May 15, 2024

Amruta-Ranade left a comment

kathancox commented May 16, 2024


		[Changefeeds]({% link {{ page.version.version }}/change-data-capture-overview.md %}) will fail on the promoted cluster immediately after cutover. We recommend that you recreate changefeeds on the promoted cluster.

		[Scheduled changefeeds]({% link {{ page.version.version }}/create-schedule-for-changefeed.md %}) will continue on the promoted cluster. You will need to manage [pausing]({% link {{ page.version.version }}/pause-schedules.md %}) or [canceling]({% link {{ page.version.version }}/drop-schedules.md %}) the schedule on the original primary and promoted standby clusters.


		### Changefeeds

		[Changefeeds]({% link {{ page.version.version }}/change-data-capture-overview.md %}) will fail on the promoted cluster immediately after cutover. We recommend that you recreate changefeeds on the promoted cluster.

		@@ -0,0 +1 @@
		After the [cutover process]({% link {{ page.version.version }}/cutover-replication.md %}) for [physical cluster replication]({% link {{ page.version.version }}/physical-cluster-replication-overview.md %}), [scheduled changefeeds]({% link {{ page.version.version }}/create-schedule-for-changefeed.md %}) will continue on the promoted cluster. You will need to manage [pausing]({% link {{ page.version.version }}/pause-schedules.md %}) or [canceling]({% link {{ page.version.version }}/drop-schedules.md %}) the schedule on the original primary and promoted standby clusters to avoid two clusters running the same changefeed to one sink. [Tracking GitHub issue](https://github.com/cockroachdb/cockroach/issues/123776)

Add job management docs for cutover in physical cluster replication jobs #18525

Add job management docs for cutover in physical cluster replication jobs #18525

Conversation

kathancox commented May 7, 2024

github-actions bot commented May 7, 2024 • edited

Files changed:

netlify bot commented May 7, 2024 • edited

✅ Deploy Preview for cockroachdb-interactivetutorials-docs canceled.

netlify bot commented May 7, 2024 • edited

✅ Deploy Preview for cockroachdb-api-docs canceled.

netlify bot commented May 7, 2024 • edited

✅ Netlify Preview

Choose a reason for hiding this comment

Choose a reason for hiding this comment

msbutler left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kathancox May 13, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Amruta-Ranade left a comment

Choose a reason for hiding this comment

kathancox commented May 16, 2024

github-actions bot commented May 7, 2024 •

edited

netlify bot commented May 7, 2024 •

edited

netlify bot commented May 7, 2024 •

edited

netlify bot commented May 7, 2024 •

edited

kathancox May 13, 2024 •

edited