Skip to content

Commit

Permalink
Merge pull request #6347 from aisha-partha/master
Browse files Browse the repository at this point in the history
Closes #6342. Setting METAREFRESH_IGNORE_TAGS to [‘noscript’] by default
  • Loading branch information
wRAR committed May 10, 2024
2 parents 4300a1d + fe163d9 commit 10a843a
Show file tree
Hide file tree
Showing 3 changed files with 7 additions and 4 deletions.
4 changes: 4 additions & 0 deletions docs/topics/downloader-middleware.rst
Original file line number Diff line number Diff line change
Expand Up @@ -884,6 +884,10 @@ Meta tags within these tags are ignored.
The default value of :setting:`METAREFRESH_IGNORE_TAGS` changed from
``['script', 'noscript']`` to ``[]``.

.. versionchanged:: VERSION
The default value of :setting:`METAREFRESH_IGNORE_TAGS` changed from
``[]`` to ``['noscript']``.

.. setting:: METAREFRESH_MAXDELAY

METAREFRESH_MAXDELAY
Expand Down
2 changes: 1 addition & 1 deletion scrapy/settings/default_settings.py
Original file line number Diff line number Diff line change
Expand Up @@ -239,7 +239,7 @@
MEMUSAGE_WARNING_MB = 0

METAREFRESH_ENABLED = True
METAREFRESH_IGNORE_TAGS = []
METAREFRESH_IGNORE_TAGS = ["noscript"]
METAREFRESH_MAXDELAY = 100

NEWSPIDER_MODULE = ""
Expand Down
5 changes: 2 additions & 3 deletions tests/test_downloadermiddleware_redirect.py
Original file line number Diff line number Diff line change
Expand Up @@ -395,9 +395,8 @@ def test_ignore_tags_default(self):
"""content="0;URL='http://example.org/newpage'"></noscript>"""
)
rsp = HtmlResponse(req.url, body=body.encode())
req2 = self.mw.process_response(req, rsp, self.spider)
assert isinstance(req2, Request)
self.assertEqual(req2.url, "http://example.org/newpage")
response = self.mw.process_response(req, rsp, self.spider)
assert isinstance(response, Response)

def test_ignore_tags_1_x_list(self):
"""Test that Scrapy 1.x behavior remains possible"""
Expand Down

0 comments on commit 10a843a

Please sign in to comment.