Set CONTENT_LENGTH for chunked requests #2287

eugeneius · 2020-05-29T00:56:13Z

Description

Chunked requests don't contain a Content-Length header, but Puma buffers the entire request body upfront, which means it can determine the length before dispatching to the application.

The Rack spec doesn't mandate the presence of the CONTENT_LENGTH header, but it does refer to it as a "CGI key" and draws a distinction between it and the HTTP Content-Length header:

https://github.com/rack/rack/blob/v2.2.2/SPEC.rdoc

The environment must not contain the keys HTTP_CONTENT_TYPE or HTTP_CONTENT_LENGTH (use the versions without HTTP_). The CGI keys (named without a period) must have String values.

RFC 3875, which defines the CGI protocol including CONTENT_LENGTH, says:

https://tools.ietf.org/html/rfc3875#section-4.1.2

The server MUST set this meta-variable if and only if the request is accompanied by a message-body entity. The CONTENT_LENGTH value must reflect the length of the message-body after the server has removed any transfer-codings or content-codings.

"Removing a transfer-coding" is precisely what Puma is doing when it parses a chunked request.

RFC 7230, the most recent specification of HTTP 1.1, includes a pseudo-code algorithm for decoding chunked requests that roughly matches the behaviour implemented here:

https://tools.ietf.org/html/rfc7230#section-4.1.3

I say "roughly" because we don't update the Transfer-Encoding header or parse trailers.

Your checklist for this pull request

I have reviewed the guidelines for contributing to this repository.
I have added an entry to History.md if this PR fixes a bug or adds a feature. If it doesn't need an entry to HISTORY.md, I have added [changelog skip] the pull request title.
I have added appropriate tests if this PR fixes a bug or adds a feature.
My pull request is 100 lines added/removed or less so that it can be easily reviewed.
If this PR doesn't need tests (docs change), I added [ci skip] to the title of the PR.
If this closes any issues, I have added "Closes #issue" to the PR description or my commit messages.
I have updated the documentation accordingly.
All new and existing tests passed, including Rubocop.

Chunked requests don't contain a Content-Length header, but Puma buffers the entire request body upfront, which means it can determine the length before dispatching to the application. The Rack spec doesn't mandate the presence of the CONTENT_LENGTH header, but it does refer to it as a "CGI key" and draws a distinction between it and the HTTP Content-Length header: https://github.com/rack/rack/blob/v2.2.2/SPEC.rdoc > The environment must not contain the keys HTTP_CONTENT_TYPE or > HTTP_CONTENT_LENGTH (use the versions without HTTP_). The CGI keys > (named without a period) must have String values. RFC 3875, which defines the CGI protocol including CONTENT_LENGTH, says: https://tools.ietf.org/html/rfc3875#section-4.1.2 > The server MUST set this meta-variable if and only if the request is > accompanied by a message-body entity. The CONTENT_LENGTH value must > reflect the length of the message-body after the server has removed > any transfer-codings or content-codings. "Removing a transfer-coding" is precisely what Puma is doing when it parses a chunked request. RFC 7230, the most recent specification of HTTP 1.1, includes a pseudo- code algorithm for decoding chunked requests that roughly matches the behaviour implemented here: https://tools.ietf.org/html/rfc7230#section-4.1.3

evanphx

Tracking and advertising the size of the chunked body seems like a good change. Seems unlikely anyone was depending on CONTENT_LENGTH being absent when chunked encoding was used.

TheRusskiy · 2020-11-06T22:06:02Z

Hey, is there any chance this could be backported to 4.x version? Seems like a pretty serious omission and 5.x is full of breaking changes.
Thank you.

nateberkopec · 2020-11-09T17:02:01Z

@TheRusskiy We do not have a previous-major-version maintenance policy. I've created a new 4-3-stable branch, if you'd like to make a pull request against it and backport.

TheRusskiy · 2020-11-28T14:04:30Z

@nateberkopec backporting here: #2496

eugeneius mentioned this pull request May 29, 2020

Fixes raw_post being empty for "Transfer-Encoding: chunked" rails/rails#37423

Closed

nateberkopec added the bug label May 29, 2020

evanphx approved these changes May 29, 2020

View reviewed changes

nateberkopec merged commit f9ddd58 into puma:master May 31, 2020

TheRusskiy mentioned this pull request Nov 28, 2020

Backport set CONTENT_LENGTH for chunked requests #2496

Merged

8 tasks

tooooooooomy mentioned this pull request Dec 21, 2022

raw_post being empty for "Transfer-Encoding: chunked" rails/rails#46784

Closed

pedro108 mentioned this pull request Feb 9, 2023

Fix raw_post empty bug when when Transfer-Encoding: chunked rails/rails#47336

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set CONTENT_LENGTH for chunked requests #2287

Set CONTENT_LENGTH for chunked requests #2287

eugeneius commented May 29, 2020 •

edited by nateberkopec

evanphx left a comment

TheRusskiy commented Nov 6, 2020

nateberkopec commented Nov 9, 2020

TheRusskiy commented Nov 28, 2020

Set CONTENT_LENGTH for chunked requests #2287

Set CONTENT_LENGTH for chunked requests #2287

Conversation

eugeneius commented May 29, 2020 • edited by nateberkopec

Description

Your checklist for this pull request

evanphx left a comment

Choose a reason for hiding this comment

TheRusskiy commented Nov 6, 2020

nateberkopec commented Nov 9, 2020

TheRusskiy commented Nov 28, 2020

eugeneius commented May 29, 2020 •

edited by nateberkopec