Store information for files with lint warnings and errors in cache #9948

galvarez421 · 2018-02-06T00:31:50Z

The version of ESLint you are using.
4.15.0

The problem you want to solve.
Make cache feature (and its corresponding documentation) more useful and intuitive. Currently, the cache feature does not seem as useful or intuitive as it might be in that only information for files that pass linting with no warnings or errors is saved in the cache. I wonder if the cache feature could offer even more of a performance benefit if the cache were to store information for files with lint warnings or errors as well such that they don't have to run through a linting process if they haven't been changed since they were last linted with the cache feature enabled?

Your take on the correct solution to problem.
Perhaps we can extend the usage of the cache such that information for files with lint warnings or errors is also saved to the cache? @IanVS, following up on this comment

ilyavolodin · 2018-02-06T00:56:32Z

That sounds like a reasonable enhancement. I would be a bit worried about files with lining errors/warning with --fix flag and caching, since I think caching will happen before autofixes are applied.

not-an-aardvark · 2018-02-06T01:04:09Z

I'm concerned that this would make cache files quite large, because formatters need to be provided with the original source text of a file. If a cache file needed to maintain that information, it could effectively be the same size as the all of the javascript files combined. (However, maybe this could be mitigated by omitting the source text of a file from the results object, since we can just reread it from the filesystem if we know it hasn't changed).

platinumazure · 2018-02-06T02:04:04Z

However, maybe this could be mitigated by omitting the source text of a file from the results object, since we can just reread it from the filesystem if we know it hasn't changed

Agreed. We can still get a huge win here if we replace the full linting process with a read-and-report process for unchanged files with errors.

Full lint process (done on any file with errors even after linting once):

Read file from filesystem
Parse file contents
Prepare to lint (build selector events, set up scope analysis and visitor keys, etc.)
Traverse file and invoke visitors to lint file
Calculate output by applying fixes
Write output to filesystem
Cache results
Call formatter

Reading cached file results and reporting errors (assuming file mtime is earlier than cache write time so no need to re-lint):

Read cache to see that errors exist
Read file from filesystem
Call formatter with cached errors

My feeling is that this could be a massive benefit to someone doing a large refactor or introducing a new linting rule into their codebase. And it would be a small benefit to people who lint the codebase frequently even if they're only changing one file at a time.

galvarez421 · 2018-02-06T12:49:15Z

@ilyavolodin If auto fix occurs after results for a file have been cached, wouldn't that cause the file to be considered newer than the cache during the next lint and thereby cause the cache to be ignored for that file? In other words, wouldn't the cache process still work as expected (although maybe the timing of the autofix relative to the caching wouldn't)? Or are you concerned not about caching not working as expected but rather about the caching not being as effective as it could be due to the timing of autofixes? Based on the description by @platinumazure of the lint process, it seems that autofixes are applied before the results are outputted and cached, so maybe this is a non-issue.

platinumazure · 2018-02-06T13:05:25Z

Apologies, I don't know if caching is done before or after autofix. I was just thinking it should be afterwards, ideally speaking. If a lint run results in fixable errors only, then we could apply fixes and store a cache entry saying all went well.

…

On Feb 6, 2018 6:49 AM, "Gerry Alvarez" ***@***.***> wrote: @ilyavolodin <https://github.com/ilyavolodin> If auto fix occurs after results for a file have been cached, wouldn't that cause the file to be considered newer than the cache during the next lint and thereby cause the cache to be ignored for that file? In other words, wouldn't the cache process still work as expected (although maybe the timing of the autofix relative to the caching wouldn't)? Or are you concerned not about caching not working as expected but rather about the caching not being as effective as it could be due to the timing of autofixes? Based on the description by @platinumazure <https://github.com/platinumazure> of the lint process, it seems that autofixes are applied before the results are outputted and cached, so maybe this is a non-issue. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9948 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AARWei7gycw8VShGWqK6gZJARuCzS8oPks5tSEpMgaJpZM4R6UGq> .

platinumazure · 2018-07-04T12:05:37Z

I'm finally digging into this. At this point, it does look like we write to the cache after the file is linted and autofixed, so I think we should be able to cache files which failed linting (though we may need to remove the full source text from the file entry cache, as discussed above).

platinumazure · 2018-07-06T22:02:31Z

Relabeling as "cli" based on previous CLIEngine issues (which don't have to do with linting/fixing) being labeled as "cli".

* Chore: Extract current cache logic into lint-result-cache module * Chore: Moved config hash validation to LintResultCache * Chore: Added tests for lint-result-cache * Chore: Small cleanup * Chore: Removing unnecessary comments * Chore: Remove unnecessary test fixture file * Chore: options.cache to this.options.cache

platinumazure · 2018-07-12T00:22:43Z

TSC Summary:

eslint --cache currently only caches files that passed linting. This is useful for users who have fully integrated ESLint into their codebase, so most of their files should be passing at any given time. However, for users that have not yet fully integrated ESLint (i.e., users who are still cleaning up lint errors), the cache becomes useless because most files are not passing at any given time, and we currently don't cache files which failed linting. It was suggested that files which failed linting could be cached, as long as we check that they haven't been modified between the cache time and the next lint run (in other words, following the same logic we follow for successfully linted files in the cache).

One challenge is that we currently include the entire source code of files with linting problems in the lint results object. This could result in the cache file growing significantly if there are a lot of files which failed linting. A possible approach to avoid this is to remove the source property when caching the lint results, and then when retrieving results from the cache, simply read the file from the filesystem again. This will slow the overall cache performance for files that failed linting, but still be much faster than reading, parsing, and linting the file again. (PR #10571 follows this approach.)

TSC Question:

Should we add files which failed linting to the lint results cache?

platinumazure · 2018-07-19T21:50:52Z

This issue was accepted in today's TSC meeting.

* Chore: Extract current cache logic into lint-result-cache module * Chore: Moved config hash validation to LintResultCache * Chore: Added tests for lint-result-cache * Chore: Small cleanup * Update: Cache files that failed linting (fixes #9948) * Chore: Remove unused "removeEntry" API from LintResultCache * Ensure empty source is handled correctly * Remove unnecessary test fixture * Don't cache files with output property

eslint-deprecated bot added the triage An ESLint team member will look at this issue soon label Feb 6, 2018

platinumazure mentioned this issue Feb 7, 2018

--cache flag not working #9802

Closed

platinumazure mentioned this issue Jul 4, 2018

Docs: Only successfully linted files are cached (fixes #9802) #10557

Merged

platinumazure mentioned this issue Jul 4, 2018

Chore: Extract lint result cache logic (refs #9948) #10562

Merged

platinumazure added a commit that referenced this issue Jul 6, 2018

Update: Cache files that failed linting (fixes #9948)

9bb123b

platinumazure added cli Relates to ESLint's command-line interface and removed core Relates to ESLint's core APIs and features labels Jul 6, 2018

platinumazure mentioned this issue Jul 6, 2018

Update: Cache files that failed linting (fixes #9948) #10571

Merged

platinumazure added the tsc agenda This issue will be discussed by ESLint's TSC at the next meeting label Jul 12, 2018

not-an-aardvark closed this as completed in #10571 Jul 21, 2018

eslint-deprecated bot locked and limited conversation to collaborators Jan 18, 2019

eslint-deprecated bot added the archived due to age This issue has been archived; please open a new issue for any further discussion label Jan 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store information for files with lint warnings and errors in cache #9948

Store information for files with lint warnings and errors in cache #9948

galvarez421 commented Feb 6, 2018

ilyavolodin commented Feb 6, 2018

not-an-aardvark commented Feb 6, 2018

platinumazure commented Feb 6, 2018

galvarez421 commented Feb 6, 2018

platinumazure commented Feb 6, 2018 via email

platinumazure commented Jul 4, 2018

platinumazure commented Jul 6, 2018

platinumazure commented Jul 12, 2018 •

edited

platinumazure commented Jul 19, 2018

Store information for files with lint warnings and errors in cache #9948

Store information for files with lint warnings and errors in cache #9948

Comments

galvarez421 commented Feb 6, 2018

ilyavolodin commented Feb 6, 2018

not-an-aardvark commented Feb 6, 2018

platinumazure commented Feb 6, 2018

galvarez421 commented Feb 6, 2018

platinumazure commented Feb 6, 2018 via email

platinumazure commented Jul 4, 2018

platinumazure commented Jul 6, 2018

platinumazure commented Jul 12, 2018 • edited

platinumazure commented Jul 19, 2018

platinumazure commented Jul 12, 2018 •

edited