Caching remote discovery results #2976

guruz · 2015-03-17T18:05:11Z

What about caching discovery results in the journal?

Normally this should not be needed as we're always syncing all files/dirs and storing the proper ETag in the journal.
However in practice I think there's often issues like fatal sync errors, network timeouts etc which then on the next sync mean a complete re-discovery..

Something to chat about next week.

@ogoffart @dragotin @ckamm

MorrisJobke · 2015-04-01T13:12:02Z

Currently we noticed some problems with randomly broken connections to an LDAP server. So the user backend can't properly hand back the home directory. To avoid random guessing what the correct home directory could be we hand back an exception (owncloud/core#15332). In this case the sync client also could see such responses (that will cause a HTTP 500). If this happens during discovery the whole discovery process is aborted and starts from scratch.(and maybe never finished because of this) Is it possible to just repeat the last (failed) request (because it is likely that the error was just temporarily caused) and then resume the discovery from that point? We maybe should improve the error message that is handed back to say "this was just a temporary error and could be fixed with the next request". (503 with Retry-After header)

That would make the client a bit more robust for these cases.

guruz · 2017-04-04T12:26:55Z

@ogoffart @ckamm @jturcotte @PVince81 @DeepDiver1975 Now that we're thinking of doing the recursive/infinity PROPFIND I'm wondering if it makes sense to think about this again.

The infinity PROPFIND could be implemented by having it fill a local cache.
map (remote_dirName+ETag)->contents

The issue was originaly about any discovery issue. It would also help with flaky network connections, basically everytime when the discovery would abort.

This should be implemented in a way to not fill so much memory again. (e.g. sqlite3?)

ogoffart · 2017-04-04T18:24:14Z

that cache already exists. It's called the database (sync journal)

guruz · 2017-04-04T19:01:37Z

Please read the bug above. This is about failed syncs and people complaining about the long repetive re-discovery

guruz · 2017-04-05T05:03:59Z

...which means it could be flushed after a successful sync.

guruz · 2017-04-05T05:04:35Z

Similar how it was implemented before: 980c176

dragotin · 2017-04-05T07:51:45Z

@ogoffart 💟
@guruz isn't it true that we only do real propfinds into a tree if the ETag of the directory has changed, and read from the client journal instead if not?

IIRC what we discussed long time ago was to keep the entire tree in memory to not have to rebuild the whole tree again, maybe that should be discussed again.

ogoffart · 2017-04-05T08:51:48Z

It's true we could have another table in the database if we want to persist the value between failled sync.

dragotin · 2017-04-05T14:47:18Z

@ogoffart how would that look like? And what would make you confident that it is still valid? What would it make different from the now existing table?

guruz · 2017-04-05T15:06:29Z

If we want to persist it between failed syncs but for simplicity not between client restarts, we could use a sqlite3 temporary DB which gets automatically cleaned up and if small enough stays in memory: http://stackoverflow.com/a/32004907/2941

I was so far only thinking about a simple key value store: (remote_dirName+ETag)->dir_contents
The dir_contents could even be plain XML returned from PROPFIND. We dice the incoming infinity PROPFIND into single PROPFINds or if we request a single PROPFIND we put it in directly.

guruz · 2017-04-05T15:07:03Z

And what would make you confident that it is still valid?

Having ETag in the cache key.

ogoffart · 2017-04-05T17:09:39Z

This is the current algorithm:

remote_discovery(folder) {
   folder_from_network = run_propfind(folder);
   folder_from_db = load_from_db(folder);
   if (folder_from_network.etag == folder_from_db) {
      readFromDb(folder);
   } else {
      file_tree_walk(folder_from_network);  // recursively go to every folder
  }
}

The new alorithm would looke like:

remote_discovery(folder) {
   folder_from_network = run_propfind(folder);
   folder_from_db = load_from_db_recursively(folder);
   if (folder_from_network.etag == folder_from_db) {
      readFromDb(folder);
   } else {
          folder_from_cache = load_from_cache(folder);
         if (folder_from_network.etag == folder_from_cache) {
              readFromCache(folder);
        } else {
            file_tree_walk(folder_from_network);  // recursively go to every folder
            add_to_cache(folder_from_network);
       }
  }
}

sounds pretty simple

If we want to persist it between failed syncs but for simplicity not between client restarts,

I think if it's in the db, it's as simple to make it persist accross client restart. We'd clear the cache after every successful sync anyway.

ckamm · 2017-04-07T11:17:09Z

I agree that it makes sense to cache PROPFIND responses - even if the rest of the sync failed and the resulting changes aren't propagated yet because of it.

ckamm · 2018-02-15T09:15:33Z

I agree with @ogoffart - it should be relatively straightforward since our metadata table already functions as a propfind cache. Just having a second table with the same kind of staleness and retrieval logic should work. Data could be cleared when the data arrives in the journal.

ckamm · 2019-04-11T11:35:33Z

I've started looking at this.

ogoffart · 2019-04-11T11:39:29Z

I think it would be better to do the discovery and propagation in parallel

Now that the discovery has been refactored, it should be easier.

Then the cache wouldn't make sense anymore

ckamm · 2019-04-12T07:38:52Z

@ogoffart Yeah, anything that reduces time between server query and writing to the db would do, and parallel propagation would have other advantages. Need to deal with deletes though. Let's discuss!

guruz added the Discussion label Mar 17, 2015

guruz added this to the 1.9 - Multi-account milestone Mar 17, 2015

guruz modified the milestones: 2.1-next, 2.0 - Multi-account Jun 24, 2015

dragotin closed this as completed Sep 2, 2015

guruz mentioned this issue Dec 29, 2015

Discovering really slow #4324

Closed

guruz mentioned this issue Feb 6, 2016

Sync client not able to finish discovery on large folders "Server error" #4435

Closed

guruz removed this from the 2.1 milestone Apr 4, 2017

guruz reopened this Apr 4, 2017

guruz added this to the 2.4.0 milestone Apr 4, 2017

guruz added the Performance label Apr 5, 2017

This was referenced Apr 5, 2017

After sync aborts, resume in a more graceful manner #5414

Open

Connection timeout / syncronisation restarts but fails again and again #2323

Closed

guruz modified the milestones: 2.5.0, 2.4.0 Jun 14, 2017

guruz mentioned this issue Feb 12, 2018

Excludes: Further optimize also patterns with slash and add trailing slash to some of the items in existing sync-exclude.lst #5017

Closed

guruz modified the milestones: 2.5.0, 2.6.0 Mar 27, 2018

guruz mentioned this issue May 30, 2018

"Checking" takes too long: make it parallel with upload/download? #6548

Closed

ogoffart self-assigned this Jul 4, 2018

ogoffart mentioned this issue Oct 15, 2018

OAuth2 token expiring during a very long discovery makes sync impossible #6814

Closed

ckamm modified the milestones: 2.6.0, 2.7.0 Mar 25, 2019

ckamm mentioned this issue Apr 11, 2019

PROPFIND timeout: show folder with error, and don't abort sync run #6826

Closed

ckamm added Enhancement and removed Discussion labels Apr 11, 2019

ckamm changed the title ~~[Brainstorming] Discovery cache~~ Caching remote discovery results Apr 11, 2019

ckamm self-assigned this Apr 11, 2019

ogoffart mentioned this issue Nov 13, 2019

Do Discovery and Propagation in parallel. #7589

Open

butonic mentioned this issue Jul 7, 2021

Poll interval from capabilities #8777

Merged

michaelstingl modified the milestones: 2.7.0, Backlog Mar 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caching remote discovery results #2976

Caching remote discovery results #2976

guruz commented Mar 17, 2015

MorrisJobke commented Apr 1, 2015

guruz commented Apr 4, 2017

ogoffart commented Apr 4, 2017

guruz commented Apr 4, 2017 via email •

edited

guruz commented Apr 5, 2017

guruz commented Apr 5, 2017

dragotin commented Apr 5, 2017

ogoffart commented Apr 5, 2017

dragotin commented Apr 5, 2017

guruz commented Apr 5, 2017

guruz commented Apr 5, 2017

ogoffart commented Apr 5, 2017

ckamm commented Apr 7, 2017

ckamm commented Feb 15, 2018

ckamm commented Apr 11, 2019

ogoffart commented Apr 11, 2019

ckamm commented Apr 12, 2019

Caching remote discovery results #2976

Caching remote discovery results #2976

Comments

guruz commented Mar 17, 2015

MorrisJobke commented Apr 1, 2015

guruz commented Apr 4, 2017

ogoffart commented Apr 4, 2017

guruz commented Apr 4, 2017 via email • edited

guruz commented Apr 5, 2017

guruz commented Apr 5, 2017

dragotin commented Apr 5, 2017

ogoffart commented Apr 5, 2017

dragotin commented Apr 5, 2017

guruz commented Apr 5, 2017

guruz commented Apr 5, 2017

ogoffart commented Apr 5, 2017

ckamm commented Apr 7, 2017

ckamm commented Feb 15, 2018

ckamm commented Apr 11, 2019

ogoffart commented Apr 11, 2019

ckamm commented Apr 12, 2019

guruz commented Apr 4, 2017 via email •

edited