`AutoRefreshingProvider` caches erroneous responses indefinitely #1765

samwho · 2020-06-02T10:36:17Z

The implementation of AutoRefreshingProvider will cache an error forever, and this behaviour is causing us some problems using the library in production.

Code in question:

rusoto/rusoto/credential/src/lib.rs

Lines 274 to 298 in 69e7c91

    
           #[async_trait] 
        
           impl<P: ProvideAwsCredentials + Send + Sync + 'static> ProvideAwsCredentials 
        
               for AutoRefreshingProvider<P> 
        
           { 
        
               async fn credentials(&self) -> Result<AwsCredentials, CredentialsError> { 
        
                   loop { 
        
                       let mut guard = self.current_credentials.lock().await; 
        
                       match guard.as_ref() { 
        
                           // no result from the future yet, let's keep using it 
        
                           None => { 
        
                               let res = self.credentials_provider.credentials().await; 
        
                               *guard = Some(res); 
        
                           } 
        
                           Some(Err(e)) => return Err(e.clone()), 
        
                           Some(Ok(creds)) => { 
        
                               if creds.credentials_are_expired() { 
        
                                   *guard = None; 
        
                               } else { 
        
                                   return Ok(creds.clone()); 
        
                               }; 
        
                           } 
        
                       } 
        
                   } 
        
               } 
        
           }

I think there are a couple potential solutions to this:

Treat the Some(Err(e)) => return Err(e.clone()), match arm the same way expired credentials are treated and set the guard to None and loop again. This means it'll keep trying in the face of errors, so probably needs some sort of backoff / give-up mechanism.
Cache failures for a defined length, probably shorter than the success cache length.

If you agree that this behaviour needs changing and you let me know which approach you prefer, I'd be happy to contribute a PR. 🙂

If this is working as intended I would be keen to hear what workarounds you suggest. The behaviour we seem to be seeing is that sometimes requesting the current role from the instance profile in EC2 will fail, and that failure gets cached forever, causing our binary to get stuck in a non-functional state.

The text was updated successfully, but these errors were encountered:

iliana · 2020-06-04T21:35:24Z

I don't think this is working as intended and would be happy to review a PR to change the error behavior here.

samwho · 2020-06-05T09:46:43Z

🙏

Do you have a preferred approach?

iliana · 2020-06-12T18:11:54Z

Not off the top of my head. Our goal is to match the behaviors of other AWS SDKs when it comes to fundamentals like credentials, request signing, retries, etc., so your best reference is probably botocore.

nbaztec · 2021-07-29T10:46:08Z

#1933 fixes it but seems the project isn't maintained anymore.

lzuosym mentioned this issue Jan 27, 2023

cherry pick the changes to fix the autorefresh bug sportsball-ai/rusoto#1

Merged

ranandfigma mentioned this issue Feb 28, 2024

fix creds on error figma/rusoto#2

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`AutoRefreshingProvider` caches erroneous responses indefinitely #1765

`AutoRefreshingProvider` caches erroneous responses indefinitely #1765

samwho commented Jun 2, 2020

iliana commented Jun 4, 2020

samwho commented Jun 5, 2020

iliana commented Jun 12, 2020

nbaztec commented Jul 29, 2021

AutoRefreshingProvider caches erroneous responses indefinitely #1765

AutoRefreshingProvider caches erroneous responses indefinitely #1765

Comments

samwho commented Jun 2, 2020

iliana commented Jun 4, 2020

samwho commented Jun 5, 2020

iliana commented Jun 12, 2020

nbaztec commented Jul 29, 2021

`AutoRefreshingProvider` caches erroneous responses indefinitely #1765

`AutoRefreshingProvider` caches erroneous responses indefinitely #1765