Proposal: Add a mode where all actual file system interactions are delayed until the build completes #3321

jakemac53 · 2022-06-10T18:15:15Z

Problem

Today we eagerly delete all invalidated outputs, and then incrementally re-output them during the build.

This causes problems for the analyzer (or any other tool watching the file system), because of the large number of file updates over a long period of time. It will end up re-analyzing the same files multiple times as their dependencies are written, and it also causes spurious errors when the build starts, until it completes.

This leads to a bad IDE experience for users, especially in large projects with long builds.

Solution

The general idea is to delay all file system interactions until a build is complete, and then do the minimal amount of file system interactions possible. If a file was invalidated but re-written to contain the same content, we shouldn't touch it at all. If it changed, we should overwrite directly the existing file, etc.

High level design

Add a new lifecycle method to RunnerAssetWriter, onBuildComplete or similar, which is called when a build finishes. Most implementations will just delegate the call to any other writer they wrap, or possibly call super if they extend another writer.

Add a new DelayedAssetWriter which only records actions. It wraps another writer, but does not delegate calls to it. This will capture both the pending deletes as well as pending writes as they happen. It will need to cache the bytes of the writes. It will implement onBuildComplete and perform all the actual file system interactions. This class will also need to take an AssetReader in its constructor, if we want it to be able to compare digests and be smart about not doing unnecessary writes.

We will also need a new AssetReader implementation, which has a field which is this DelayedAssetWriter, and it will need to use that cache to read files from. Some more design will need to be done to figure out exactly the best place to slot that reader in.

Risks

Cache Size

This strategy requires all files written in a given build to fit in memory together, with an unbounded cache size. Some potential options we could consider:

Have a bounded sized cache, and write files in batches if the cache starts getting full.
Only use this strategy for Dart files, and not other files. Some other tools might not get the benefit, but it would still fix the analyzer problem.
Only use this strategy for files under a certain size.

Could be breaking for some builders

If a builder today calls out to an external process which reads from the file system, it won't see updated files

These builders are already encouraged to use package:scratch_space, which won't have this problem. All our own builders do this.

The text was updated successfully, but these errors were encountered:

jakemac53 · 2022-06-10T18:15:46Z

cc @scheglov @devoncarew @jacob314 @natebosch @srawlins

natebosch · 2022-06-18T02:56:05Z

LGTM.

Do we already have a way that written assets are communicated through to the AssetReader, or are we going through the files system always today?

I think we should consider any flags related to this as experimental until we nail down which heuristics are actually worthwhile in practice. I don't want to be stuck with multiple strategies to tweak cache size if we find out that our first attempt doesn't work well - so we should communicate clearly with experiment in flag names.

jakemac53 · 2022-06-21T15:36:55Z

Do we already have a way that written assets are communicated through to the AssetReader, or are we going through the files system always today?

I don't believe that we communicate between the reader/writer today, we will end up reading the file from disk even if it was just written, and don't cache it in memory.

jakemac53 added the type-enhancement A request for a change that isn't a bug label Jun 10, 2022

jakemac53 assigned scheglov and unassigned scheglov Jun 10, 2022

simolus3 mentioned this issue Dec 3, 2022

Add option to delay file system writes #3418

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Add a mode where all actual file system interactions are delayed until the build completes #3321

Proposal: Add a mode where all actual file system interactions are delayed until the build completes #3321

jakemac53 commented Jun 10, 2022 •

edited

jakemac53 commented Jun 10, 2022 •

edited

natebosch commented Jun 18, 2022

jakemac53 commented Jun 21, 2022

Proposal: Add a mode where all actual file system interactions are delayed until the build completes #3321

Proposal: Add a mode where all actual file system interactions are delayed until the build completes #3321

Comments

jakemac53 commented Jun 10, 2022 • edited

Problem

Solution

High level design

Risks

Cache Size

Could be breaking for some builders

jakemac53 commented Jun 10, 2022 • edited

natebosch commented Jun 18, 2022

jakemac53 commented Jun 21, 2022

jakemac53 commented Jun 10, 2022 •

edited

jakemac53 commented Jun 10, 2022 •

edited