Move delete failure detection to the cache fetcher. #296

dylanahsmith · 2016-10-01T03:00:20Z

Fixes #211

Problem

IdentityCache::MemoizedCacheProxy#delete was assuming the delete failed if cache_fetcher.delete returned falsey. However, that falsey value could either mean the memcached request failed or the key wasn't present, and different cache stores aren't consistent about whether nil or false means as a return value for their delete method as I detailed in #211 (comment).

This was confusing because we were logging something like [IdentityCache] delete failed for IDC:5:blob:App:18240036282024875059:41 which sounds like a probably with expiring the cache, but it could just mean that the value wasn't cached yet.

Solution

If IdentityCache::MemoizedCacheProxy#cache_fetcher is an IdentityCache::CacheFetcher, then it writes a IdentityCache::DELETED value instead of using a delete memcached request. In that case the return value isn't as ambiguous, it just returns falsey if the request failed. So, I was able to move the failure detection into IdentityCache::CacheFetcher#delete where we can rely on it and put a generic [IdentityCache] delete for #{key} message in IdentityCache::FallbackFetcher#delete.

Different cache stores return a different value from their delete methods, so we don't try to detect whether it failed, since a falsey return value could mean the key wasn't present or it couldn't send the command to the server. This is less ambiguous for write, which is used by the cache fetcher, so failure detection was moved into there.

sirupsen

Writing a test to ensure no-one moves this back to cause this problem again would be helpful. You can likely reproduce this with mocks, based on your comment.

sirupsen · 2016-10-01T13:03:09Z

lib/identity_cache/cache_fetcher.rb

@@ -11,7 +11,9 @@ def write(key, value)
    end

    def delete(key)
-      @cache_backend.write(key, IdentityCache::DELETED, :expires_in => IdentityCache::DELETED_TTL.seconds)
+      result = @cache_backend.write(key, IdentityCache::DELETED, expires_in: IdentityCache::DELETED_TTL.seconds)
+      IdentityCache.logger.debug { "[IdentityCache] delete #{ result ? 'recorded' : 'failed' } for #{key}" }


The reason you're using blocks here is to avoid allocating the logging string unless you're at the debugging level?

I was just doing it the way we were doing it before, but I think that is probably the reason.

That is the reason.

sirupsen · 2016-10-01T13:04:23Z

lib/identity_cache/fallback_fetcher.rb

@@ -11,7 +11,9 @@ def write(key, value)
    end

    def delete(key)
-      @cache_backend.delete(key)
+      result = @cache_backend.delete(key)


Why is it OK to do this in the fallback fetcher? What is the fallback fetcher?

The fallback fetcher is just for compatibility for cache backends that don't support CAS operations. memcached_store is the only active support cache store that I know of that supports CAS.

The difference is that now it will just unconditionally log [IdentityCache] delete for #{key} with debug logging and won't attempt to indicate whether it failed or not.

dylanahsmith · 2016-10-01T20:47:11Z

Sure, I'll add a test for this. Also, I think we should probably use a higher log level for logging that the delete failed, since normally we wouldn't log at the debug level in production, yet this could be useful to investigating cache corruption.

camilo · 2016-10-19T04:26:12Z

So, TL;DR the one reason to do this is to make the non-CAS capable adaptors logging less noisy? Or is most of that logging just useless now?

This LGTM in isolation but I wonder if we should make CAS a requirement it has improved a bunch of cache desynchronization things for us

cc @fbogsany

sirupsen approved these changes Oct 1, 2016

View reviewed changes

casperisfine force-pushed the master branch from 97719a0 to daaba73 Compare May 5, 2020 10:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move delete failure detection to the cache fetcher. #296

Move delete failure detection to the cache fetcher. #296

dylanahsmith commented Oct 1, 2016

sirupsen left a comment

sirupsen Oct 1, 2016 •

edited

dylanahsmith Oct 1, 2016

camilo Oct 19, 2016

sirupsen Oct 1, 2016

dylanahsmith Oct 1, 2016

dylanahsmith commented Oct 1, 2016

camilo commented Oct 19, 2016

Move delete failure detection to the cache fetcher. #296

Are you sure you want to change the base?

Move delete failure detection to the cache fetcher. #296

Conversation

dylanahsmith commented Oct 1, 2016

Problem

Solution

sirupsen left a comment

Choose a reason for hiding this comment

sirupsen Oct 1, 2016 • edited

Choose a reason for hiding this comment

dylanahsmith Oct 1, 2016

Choose a reason for hiding this comment

camilo Oct 19, 2016

Choose a reason for hiding this comment

sirupsen Oct 1, 2016

Choose a reason for hiding this comment

dylanahsmith Oct 1, 2016

Choose a reason for hiding this comment

dylanahsmith commented Oct 1, 2016

camilo commented Oct 19, 2016

sirupsen Oct 1, 2016 •

edited