feat(request): optionally retry network failures #353

nbrustein · 2015-11-17T13:57:23Z

Retries can be turned on by setting

Stripe.max_retries_on_network_failure = 2 # or any other positive integer

Once the max_retries_on_network_failure is set to a positive integer, and request that fails on a network error will be retried. There will be a sleep between each retry, the length of which is determined using an exponential backoff algorithm.

If a post request is made without an idempotency key and retries are on, then an idempotency key will be added automatically, since it is unsafe to retry without one.

max_retry_sleep_seconds is a float that can be set to configure that maximum time to sleep between retries (default is 2 seconds)

base_retry_sleep_seconds is a float can be set to configure the time to wait before the initial retry (default is 0.5 seconds)

nbrustein · 2015-11-17T14:00:14Z

This is a re-submission of #280

@kyleconroy, I rebased, added the exponential backoff, and am now only adding an idempotency key to post requests. A couple questions:

Which README should I update? The README in this project has very little currently, so I don't think that's the place for it.
Are you sure we don't need to add an idempotency key to DELETE requests? Seems like if the first one had already hit the server successfully, then the second one will get an InvalidRequest error.

kyleconroy · 2015-11-17T17:37:46Z

lib/stripe.rb

@@ -232,6 +268,15 @@ def self.request_headers(api_key)
    end
  end

+  # the build machines run ruby 1.8.7, and so do not have SecureRandom


We no longer support Ruby 1.8.7, so we can assume that SecureRandom is defined.

kyleconroy · 2015-11-17T17:50:40Z

The main README is the best place we have right now for this documentation.
Agreed, we should add the tokens for both POST and DELETE.

That leaves the interface. max_retries_on_network_failure is too verbose. Maybe network_retry_attempts, network_retry_max_sleep, network_retry_min_sleep instead?

cc @russelldavis @brandur for thoughts.

russelldavis · 2015-11-17T19:04:56Z

How about just max_network_retries?

kyleconroy · 2015-11-17T19:11:17Z

@russelldavis are you proposing that we don't expose the other values?

russelldavis · 2015-11-17T19:20:20Z

Oh, sorry, I was just thinking of the first value. How about:

max_network_retries
initial_network_retry_delay
max_network_retry_delay

kyleconroy · 2015-11-17T19:42:24Z

lib/stripe.rb

+      retry_count = retry_count + 1
+      sleep sleep_time(retry_count)
+      response = execute_request_with_rescues(request_opts, api_base_url, retry_count)
+      if self.on_successful_retry


What is the use case for this callback?

I wanted to know if we were ever actually hitting this thing, so I put some logging in that callback. That's how I know this code has actually caught errors for us. (A bit of an edge case, for sure. I wouldn't complain too much if you wanted to remove it)

brandur · 2015-11-17T20:24:13Z

Looks good to me generally!

Honestly, related to #313, this would be much more cleanly implemented as a Faraday or Excon middleware, but given that we're not there yet, it makes sense not to block on that.

One possible idea here is to just remove the options for max_retry_sleep_seconds and base_retry_sleep_seconds and just use reasonable defaults. Allow users to just set the maximum number of retries instead, and just add the additional configuration back if there's any demand for it (unlikely IMO). This would have the advantage of keeping the API a little smaller in case we want to re-implement it.

russelldavis · 2015-11-17T20:46:09Z

lib/stripe.rb

    raise APIConnectionError.new(message + "\n\n(Network error: #{e.message})")
  end
+
+  def self.should_retry?(e, retry_count)
+    return false unless self.max_retries_on_network_failure > retry_count


nit: this seems easier to follow if rewritten as:

return false if retry_count >= self.max_retries_on_network_failure

booleanbetrayal · 2015-11-17T21:37:02Z

looking forward to seeing this PR land!

kyleconroy · 2015-11-17T22:25:30Z

Agreed, keeping the surface area of this change low (one settable attribute) seems like a smart choice. I'd also be a fan of getting rid of the retry callback.

nbrustein · 2015-11-18T19:53:15Z

@kyleconroy, summarizing the discussion above, I'm going to do the following:

Rename max_retries_on_network_failure to max_network_retries
Rename max_retry_sleep_seconds to max_network_retry_delay
Rename base_retry_sleep_seconds to initial_network_retry_delay
Document retry stuff in README.rdoc (this still seems a little strange to me given what else is in that file currently, but I don't mind adding it)
For max_network_retry_delay and initial_network_retry_delay I can hardcode them and make them not configurable, or I can just leave them configurable but not bother to document them. (Let me know what you prefer.)
Remove on_successful_retry (Or I can just leave it, add a comment for why it's there, and not document it. Let me know what you prefer.)

Anything else I missed? Thanks for the comments.

kyleconroy · 2015-11-18T19:55:40Z

Please hardcode the values, making them not configurable
Please remove on_successful_retry

Other than that, looks good!

nbrustein · 2015-11-18T20:35:29Z

Should be good to go now.

booleanbetrayal · 2015-11-23T17:28:43Z

@kyleconroy - anything else blocking this feature?

kyleconroy · 2015-11-23T17:36:15Z

@nbrustein max_retries_on_network_failure needs to be renamed to max_network_retries. generate_random_idempotency_key should only use SecureRandom. We no longer support 1.8.7, so we can assume it's available.

…ailure

nbrustein · 2015-11-23T19:34:02Z

@kyleconroy those changes are in. Let me know if there's anything else.

kyleconroy · 2015-11-23T19:50:00Z

LGTM @brandur can you take one last look as well?

brandur · 2015-11-23T21:17:22Z

@kyleconroy +1 from me.

@nbrustein Thanks for the patch!

feat(request): optionally retry network failures

kyleconroy · 2015-11-23T21:38:27Z

Thanks again @nbrustein. We'll cut a new release in around a week, as we'll want to wait for the holidays to be over.

booleanbetrayal · 2015-11-23T21:41:18Z

👍

dziemian007 · 2016-01-05T15:02:50Z

Hi @kyleconroy, could you please update rubygems so we can see see this change without specifying github repository in Gemfile? Thanks!

brandur · 2016-01-05T17:27:24Z

@dziemian007 Released as 1.32.0. Sorry about the delay!

booleanbetrayal · 2016-01-05T17:42:58Z

🎉

nbrustein mentioned this pull request Nov 17, 2015

feat(request): optionally retry all network failures #280

Closed

kyleconroy reviewed Nov 17, 2015
View reviewed changes

russelldavis reviewed Nov 17, 2015
View reviewed changes

nbrustein force-pushed the retry-network-failures branch 2 times, most recently from 15d818b to 66ced30 Compare November 18, 2015 20:26

feat(request): optionally retry all requests that fail on a network f…

a31986f

…ailure

nbrustein force-pushed the retry-network-failures branch from 66ced30 to a31986f Compare November 23, 2015 19:31

kyleconroy added a commit that referenced this pull request Nov 23, 2015

Merge pull request #353 from nbrustein/retry-network-failures

4e01b13

feat(request): optionally retry network failures

kyleconroy merged commit 4e01b13 into stripe:master Nov 23, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(request): optionally retry network failures #353

feat(request): optionally retry network failures #353

nbrustein commented Nov 17, 2015

nbrustein commented Nov 17, 2015

kyleconroy Nov 17, 2015

kyleconroy commented Nov 17, 2015

russelldavis commented Nov 17, 2015

kyleconroy commented Nov 17, 2015

russelldavis commented Nov 17, 2015

kyleconroy Nov 17, 2015

nbrustein Nov 17, 2015

brandur commented Nov 17, 2015

russelldavis Nov 17, 2015

booleanbetrayal commented Nov 17, 2015

kyleconroy commented Nov 17, 2015

nbrustein commented Nov 18, 2015

kyleconroy commented Nov 18, 2015

nbrustein commented Nov 18, 2015

booleanbetrayal commented Nov 23, 2015

kyleconroy commented Nov 23, 2015

nbrustein commented Nov 23, 2015

kyleconroy commented Nov 23, 2015

brandur commented Nov 23, 2015

kyleconroy commented Nov 23, 2015

booleanbetrayal commented Nov 23, 2015

dziemian007 commented Jan 5, 2016

brandur commented Jan 5, 2016

booleanbetrayal commented Jan 5, 2016

feat(request): optionally retry network failures #353

feat(request): optionally retry network failures #353

Conversation

nbrustein commented Nov 17, 2015

nbrustein commented Nov 17, 2015

kyleconroy Nov 17, 2015

Choose a reason for hiding this comment

kyleconroy commented Nov 17, 2015

russelldavis commented Nov 17, 2015

kyleconroy commented Nov 17, 2015

russelldavis commented Nov 17, 2015

kyleconroy Nov 17, 2015

Choose a reason for hiding this comment

nbrustein Nov 17, 2015

Choose a reason for hiding this comment

brandur commented Nov 17, 2015

russelldavis Nov 17, 2015

Choose a reason for hiding this comment

booleanbetrayal commented Nov 17, 2015

kyleconroy commented Nov 17, 2015

nbrustein commented Nov 18, 2015

kyleconroy commented Nov 18, 2015

nbrustein commented Nov 18, 2015

booleanbetrayal commented Nov 23, 2015

kyleconroy commented Nov 23, 2015

nbrustein commented Nov 23, 2015

kyleconroy commented Nov 23, 2015

brandur commented Nov 23, 2015

kyleconroy commented Nov 23, 2015

booleanbetrayal commented Nov 23, 2015

dziemian007 commented Jan 5, 2016

brandur commented Jan 5, 2016

booleanbetrayal commented Jan 5, 2016