Fix retry #32

hphilipps · 2020-09-30T12:17:37Z

@agneum @x4m @NikolayS
This is building on #31 and fixing the gcs retry logic.

I simplified the retry logic by completely removing the Uploader. The Uploader was chunking files into 20MiB pieces and retrying each of the pieces when the writer failed. But this can not work:
When a writer.Write() finally fails (after doing it's own retries for around 30s), it seems not possible to reuse the same writer for the same upload anymore, so we need to create a new writer and try again. But when we create a new writer, the SessionID of the resumable upload is getting lost and a new writer will always write the file from scratch with offset 0 (in 16MiB chunks) to gcs, so we also need to start giving it the first 20MiB part again instead of trying to continue with the current part.

I simplified this by reading the whole file into a buffer and simply retrying the whole file. The downside of course is that we will use a lot of memory for big files, but there is no other way as long as PutObject() takes an io.Reader - it only can be read once and thus needs to be cached for retrying. If we want to avoid that, we would need to do retries higher up the stack in wal-g or change the signature of PutObject to take a file-handle that can be re-opened.

If there would be a way to extend the gcs client retry parameters to retry longer than just for around 30s, we maybe wouldn't need all of this. But I couldn't find a way to configure retries for a storage.Writer so far.

I will work on making chunksize a config option next. This is getting a bit complicated, because we will want different chunksizes for wal-push (small files) and backup-push (big files).

hphilipps · 2020-09-30T13:26:32Z

I checked: there is no way to change the retryDeadline or backoff strategy of the storage.Writer - this is hardcoded in an internal package :-(

https://github.com/googleapis/google-api-go-client/blob/master/internal/gensupport/resumable.go#L27

hphilipps · 2020-10-01T15:36:22Z

I opened an issue about the hardcoded retryDeadline: googleapis/google-api-go-client#685

hphilipps · 2020-10-01T18:18:30Z

Thinking about it - even when we would get the option to set a higher retryDeadline, there are cases where it might never recover: A resumable upload session is always going to the same backend and if that backend dies the session maybe will be lost forever - not sure how this is working in GCS internally.

agneum · 2020-10-02T09:57:51Z

I created a PR to override retryDeadline: googleapis/google-api-go-client#686

hphilipps · 2020-10-02T17:39:38Z

That's great @agneum, thanks!

NikolayS · 2020-10-05T16:43:07Z

@hphilipps With the approach implemented in this PR, keeping in mind that Postgres stores data in files not exceeding 1 GiB, right now, we are going to use N * 1 GiB of RAM (where N is concurrency level from the WAL-G configuration).

As I understand, you're going to think about the next steps, to reduce memory consumption. What could it be?

split into chunks,
teach Reader to re-read files partially, with some offset.

Right?

What do you think, how difficult this would be to implement? Wouldn't it require too many changes in WAL-G? (And potentially affecting other types of storages rather than GCS)

cc @x4m @agneum

x4m · 2020-10-05T18:04:55Z

Dunno, we do not observe memory consumption more than 1Gb, why would GCS storage require more?

x4m · 2020-10-05T18:06:36Z

Actually usually we observe ~440Mb. Because netowork is throttled to the same bandwidth as disk, but we send compressed data, so there are not so much of buffers before netowork.

hphilipps · 2020-10-05T20:30:24Z

@x4m @NikolayS To explain the issue again:

The wal-g storages PutObject() method takes an io.Reader to read the data it should upload to GCS. The storage.Writer of the google-go-api-client package can read this data in small chunks and upload them piece by piece, because it is using a resumable upload session to GCS, which supports small chunks to be transferred in multiple requests, and those chunks are automatically retried on error. But if uploading one chunk is failing for longer than 32s, the whole resumable session failed and can't be reused. So we need to do a retry of the whole file from the beginning. But as we only got an io.Reader and an io.Reader can only be read once, we need to buffer the whole file in memory inside of the PutObject() method to be able to retry the upload. And as @NikolayS stated, if we upload with 10 parallel threads and the files are around 400MiB on average, we already consume 4GiB of RAM (on average - in the worst case it could be much more). This could maybe work for us, but I don't think it's a good solution in general.

Changing this in the wal-g storages package seems impossible - io.Reader is used all over the place through the call-stack, which makes sense as we want to stream the data instead of buffering it. A retry would need to take place in wal-g at the place where we are accessing the file on disk for reading to get around buffering in memory.

x4m · 2020-10-06T04:44:45Z

So, if errors of upload occur longer than 32 seconds we need to store whole file?
Is it configurable?
I think we do not want to retry whole upload from the beginning we want to cover all network failures with retries of chunks.

hphilipps · 2020-10-06T09:53:23Z

@x4m Exactly, the current implementation of the google-go-api-client has retryDeadline = 32s hardcoded: googleapis/google-api-go-client#685

So storage.Writer is failing after retrying the failed chunk for 32s and can't be re-used. And if we want to retry sending this file via a new storage.Writer we need to give it the same data again, which we only can do if we buffered the data within PutObject. Or we do the retry at the place in wal-g where the file is opened, so we can just read from disk (or os cache) again instead of needing to buffer in memory - but the way wal-g is creating tarballs is looking very complicated and I'm not sure yet if it would be easy or even possible to add retry logic into that if uploading a tarball failed down the road.

Of course we don't want to resent the whole file if possible. This is exactly why storage.Writer is doing a chunked upload internally and only retrying the failed chunks. But if it get's a 503 for longer than 32s, then possibly the GCS backend server doing the resumable upload session (all chunks for a session go always to the same backend I was told by a GCS engineer) died and the session is lost anyway and we need to start uploading the whole file again.

x4m · 2020-10-07T03:27:00Z

In AWS we send multiparts which are persistent, not dependent on liveliness of server backend.
I think in GCS we can create a lot of small objects and then combine them into big one.
If WAL-G dies in a middle it's not critical for us - wal-g delete will clean up garbage eventually anyway.

agneum · 2020-10-07T06:15:24Z

@hphilipps @NikolayS @x4m
To sum up, it seems we have several possible ways:

Increase retryDeadline. There is a PR: fix(internal/gensupport): allow user to provide his own retryDeadline for uploading (#685) googleapis/google-api-go-client#686. I've checked it running new uploader tests (they are really useful) from this PR. Unfortunately, a timeout increasing will not fix the root cause of the problem (GCS backend is down. Also, what if an increased timeout is not enough yet)
Resend the whole part - it looks like a pretty difficult task. Also, it brings a high risk to broke WAL-G for all storages
Buffer uploading data in memory - this is an inappropriate solution due to memory overhead
Implement multipart uploading for small chunks, then composite uploaded objects using the provided API: https://cloud.google.com/storage/docs/composite-objects

In my point of view, the most preferable look the last method with multipart uploading and object composition. I'm going to implement it.

hphilipps · 2020-10-07T09:05:49Z

@agneum That's a great finding - composite objects sound like the perfect solution to avoid buffering too much in memory for retrying! Let me know if I can help in any way - I'm happy to review your solution.

And I totally agree with your evaluation of the other possible solutions - they all seem either incomplete or very hard to accomplish.

NikolayS · 2020-10-07T10:29:41Z

Special thanks to @x4m for guiding us today.

agneum · 2020-10-11T16:46:20Z

Here is a PR #33 that implements multipart uploading.

hphilipps · 2020-12-14T16:27:32Z

Closing in favour of #33.

Henri Philipps added 2 commits September 29, 2020 19:11

add storage.writer mock and error-injection tests

e9112e4

simplify and fix retry logic

4a141fd

hphilipps mentioned this pull request Sep 30, 2020

add storage.writer mock and error-injection tests #31

Closed

hphilipps closed this Dec 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix retry #32

Fix retry #32

hphilipps commented Sep 30, 2020 •

edited

hphilipps commented Sep 30, 2020 •

edited

hphilipps commented Oct 1, 2020

hphilipps commented Oct 1, 2020

agneum commented Oct 2, 2020

hphilipps commented Oct 2, 2020

NikolayS commented Oct 5, 2020 •

edited

x4m commented Oct 5, 2020

x4m commented Oct 5, 2020

hphilipps commented Oct 5, 2020

x4m commented Oct 6, 2020

hphilipps commented Oct 6, 2020

x4m commented Oct 7, 2020

agneum commented Oct 7, 2020

hphilipps commented Oct 7, 2020

NikolayS commented Oct 7, 2020

agneum commented Oct 11, 2020

hphilipps commented Dec 14, 2020

Fix retry #32

Fix retry #32

Conversation

hphilipps commented Sep 30, 2020 • edited

hphilipps commented Sep 30, 2020 • edited

hphilipps commented Oct 1, 2020

hphilipps commented Oct 1, 2020

agneum commented Oct 2, 2020

hphilipps commented Oct 2, 2020

NikolayS commented Oct 5, 2020 • edited

x4m commented Oct 5, 2020

x4m commented Oct 5, 2020

hphilipps commented Oct 5, 2020

x4m commented Oct 6, 2020

hphilipps commented Oct 6, 2020

x4m commented Oct 7, 2020

agneum commented Oct 7, 2020

hphilipps commented Oct 7, 2020

NikolayS commented Oct 7, 2020

agneum commented Oct 11, 2020

hphilipps commented Dec 14, 2020

hphilipps commented Sep 30, 2020 •

edited

hphilipps commented Sep 30, 2020 •

edited

NikolayS commented Oct 5, 2020 •

edited