CrawlerProcess: initialize the reactor only once #5436

Gallaecio · 2022-03-02T15:07:24Z

Pending:

Test coverage
Check if CrawlerRunner is working as expected (should it initialize the reactor in any scenario?)

codecov · 2022-03-02T15:18:28Z

Codecov Report

Merging #5436 (edbf397) into 2.6 (23537a0) will decrease coverage by 0.07%.
The diff coverage is 100.00%.

❗ Current head edbf397 differs from pull request most recent head 3bf6bae. Consider uploading reports for the commit 3bf6bae to get more accurate results

@@            Coverage Diff             @@
##              2.6    #5436      +/-   ##
==========================================
- Coverage   88.77%   88.70%   -0.08%     
==========================================
  Files         163      163              
  Lines       10666    10667       +1     
  Branches     1818     1788      -30     
==========================================
- Hits         9469     9462       -7     
- Misses        922      929       +7     
- Partials      275      276       +1

Impacted Files	Coverage Δ
scrapy/crawler.py	`88.82% <100.00%> (+0.18%)`	⬆️
scrapy/core/downloader/handlers/__init__.py	`83.63% <0.00%> (-9.10%)`	⬇️
scrapy/downloadermiddlewares/cookies.py	`95.78% <0.00%> (-2.11%)`	⬇️
scrapy/shell.py	`67.96% <0.00%> (-0.79%)`	⬇️
scrapy/http/response/__init__.py	`97.43% <0.00%> (-0.04%)`	⬇️
scrapy/http/request/__init__.py	`97.77% <0.00%> (-0.03%)`	⬇️
scrapy/utils/python.py	`87.64% <0.00%> (ø)`
scrapy/commands/check.py	`71.01% <0.00%> (ø)`

scrapy/crawler.py

Gallaecio · 2022-03-07T09:34:48Z

CrawlerRunner is fine, and unlike CrawlerProcess, existing tests would have failed had it initialized the reactor.

odimko · 2022-04-13T15:49:23Z

@wRAR when is it going to be released?

Gallaecio · 2022-04-13T17:41:47Z

2 weeks ago according to plan :)

We have had some busy weeks. My current hopes are to release before the end of the month.

odimko · 2022-04-13T19:08:57Z

Awesome @Gallaecio 💯
thanks for letting me know. Looking forward to 👍🏻

Gallaecio marked this pull request as ready for review March 2, 2022 15:50

wRAR reviewed Mar 2, 2022

View reviewed changes

scrapy/crawler.py Outdated Show resolved Hide resolved

Gallaecio changed the base branch from master to 2.6 March 2, 2022 16:29

Gallaecio added 3 commits March 2, 2022 17:30

CrawlerProcess: initiate the reactor only once

3ecbea4

CrawlerProcess: test a multi-spider scenario

96fc4da

initiated → initialized

3bf6bae

Gallaecio force-pushed the fix-crawlerprocess-regression branch from 08b0fdf to 3bf6bae Compare March 2, 2022 16:30

Gallaecio modified the milestone: 2.6.2 Mar 4, 2022

elacuesta changed the title ~~CrawlerProcess: initiate the reactor only once~~ CrawlerProcess: initialize the reactor only once Mar 10, 2022

wRAR approved these changes Mar 25, 2022

View reviewed changes

wRAR merged commit 35b44f3 into scrapy:2.6 Mar 25, 2022

Gallaecio mentioned this pull request Apr 7, 2022

2.6.0 breaks calling multiple Spider in CrawlerProcess() #5435

Closed

Laerte mentioned this pull request Apr 8, 2022

scrapy check fails when there is more than one spider #5467

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CrawlerProcess: initialize the reactor only once #5436

CrawlerProcess: initialize the reactor only once #5436

Gallaecio commented Mar 2, 2022 •

edited

codecov bot commented Mar 2, 2022 •

edited

Gallaecio commented Mar 7, 2022

odimko commented Apr 13, 2022

Gallaecio commented Apr 13, 2022

odimko commented Apr 13, 2022

CrawlerProcess: initialize the reactor only once #5436

CrawlerProcess: initialize the reactor only once #5436

Conversation

Gallaecio commented Mar 2, 2022 • edited

codecov bot commented Mar 2, 2022 • edited

Codecov Report

Gallaecio commented Mar 7, 2022

odimko commented Apr 13, 2022

Gallaecio commented Apr 13, 2022

odimko commented Apr 13, 2022

Gallaecio commented Mar 2, 2022 •

edited

codecov bot commented Mar 2, 2022 •

edited