Allow custom headers in multipart/form-data requests #1936

adriangb · 2021-11-12T18:55:22Z

Multipart requests allow HTTP headers to be set on individual form items.
This pull request allows multipart file uploads to specify those additional headers on a per-file basis.

Similar pull request against the Starlette project: encode/starlette#1311

Edit by @tomchristie: Updated description

tomchristie · 2022-01-07T10:02:11Z

httpx/_multipart.py

+                if "Content-Type" in headers:
+                    raise ValueError(
+                        "Content-Type cannot be included in multipart headers"
+                    )


Perhaps we don't need to do this check.

It's odd behaviour for the developer to set the content_type and then override it with the actual value provided in the custom headers. But it's not broken.

My preference would be that we don't do the explicit check here. In the case of conflicts I'd probably have header values take precedence.

I'm not absolute on this one, but slight preference.

Makes sense. I thought it'd be a good idea to check what requests does here. It looks like it silently ignores the header in the header. That is:

requests.post("http://example.com", files=[("test", ("test_filename", b"data", "text/plain", {"Content-Type": "text/csv"}))])

Gets sent as text/plain.

Digging into why this is the case, it seems like it's just an implementation detail in urllib3. It happens here.

I'm not sure what the right thing to do here is, but if you feel like it's best to go with no error and making header values take precedence, I'm happy to implement that.

Another alternative would be to have the 3rd parameter be either a string representing the content type or a headers dict. We can't really make the 3rd parameter always be a headers dict because that would be a breaking change for httpx.
This would eliminate the edge case, but deviates from requests' API. It seems pretty reasonable that if I'm specifying headers I'm doing advanced stuff and so specifying the content type in the headers directly would not be an issue.

I'm not sure what the right thing to do here is, but if you feel like it's best to go with no error and making header values take precedence, I'm happy to implement that.

I reckon let's do that, yeah.

Another alternative would be to have the 3rd parameter be either a string representing the content type or a headers dict. We can't really make the 3rd parameter always be a headers dict because that would be a breaking change for httpx.

I actually quite like that yes, neat idea. The big-tuples API is... not helpful really. But let's probably just go with the path of least resistance here. Perhaps one day we'll want an httpx 2.0, where we gradually start deprecating the various big-tuples bits of API in favour of a neater style.

I reckon let's do that, yeah.

👍 donzo

Another alternative would be to have the 3rd parameter be either a string representing the content type or a headers dict. We can't really make the 3rd parameter always be a headers dict because that would be a breaking change for httpx.

I actually quite like that yes, neat idea. The big-tuples API is... not helpful really. But let's probably just go with the path of least resistance here. Perhaps one day we'll want an httpx 2.0, where we gradually start deprecating the various big-tuples bits of API in favour of a neater style.

Agreed! I added a comment in the code explaining the reasoning behind the big tuple API (inherited from requests) and how we might want to change it in the future.

tomchristie · 2022-01-11T10:35:07Z

httpx/_multipart.py

+                    filename, fileobj = value  # type: ignore
+            else:
+                # corresponds to (filename, fileobj, content_type, headers)
+                headers = {k.title(): v for k, v in headers.items()}


I don't think we should .title() case here.

Ah... I see the comparison case. Huh. Fiddly.

httpx/_multipart.py

tomchristie · 2022-01-11T10:44:56Z

httpx/_multipart.py

+        if content_type is not None and "Content-Type" not in headers:
+            # note that unlike requests, we ignore the content_type
+            # provided in the 3rd tuple element if it is also included in the headers
+            # requests does the opposite


Okay maybe we should instead do it the other way. If the 4-tuple is used, just ignore the content_type variable. That'd be okay enough, matches requests more closely, and we can forget about fiddly case-based header checking.

requests does the opposite: it ignores the header in the 4th tuple element. so we'll still need the case-based header checking if we want to do exactly what requests does. either way, we need to know if the content type header exists in the 4th element tuple so we can either ignore the 3rd element or overwrite it with the 3rd element.

tomchristie

I'm happy with this pull request except that I would rather we don't force-change the casing on the headers. That introduces a hidden little bit of behaviour surprise that I'd rather avoid.

Obvs we do still want to do a case-insensitive comparison for the Content-Type case tho.

tomchristie · 2022-01-12T10:49:52Z

httpx/_multipart.py

            content_type = guess_content_type(filename)

+        if content_type is not None and "Content-Type" not in headers:


Perhaps...

has_content_type_header = any(["content-type" in key.lower() for key in headers]) if content_type is not None and not has_content_type_header: ...

?

I adapted it to any("content-type" in key.lower() for key in headers) (so it'll stop early).
Also removed the {header.title() ...} line.

httpx/_multipart.py

tomchristie · 2022-01-13T08:49:09Z

Lovely stuff. 👍

adriangb added 7 commits November 12, 2021 12:47

feat: allow passing multipart headers

b6a4c5b

Add test for including content-type in headers

01fc2fc

lint

79fcd72

Merge branch 'master' into multipart-advanced

08f96bf

Merge branch 'master' into multipart-advanced

9421685

Merge branch 'master' into multipart-advanced

a43ffe7

Merge branch 'master' into multipart-advanced

cbb9f05

adriangb requested a review from tomchristie December 24, 2021 13:26

adriangb added 2 commits January 5, 2022 11:40

Merge branch 'master' into multipart-advanced

0a913b9

Merge branch 'master' into multipart-advanced

c55188d

tomchristie reviewed Jan 7, 2022

View reviewed changes

Merge branch 'master' into multipart-advanced

8e04a40

Kludex mentioned this pull request Jan 7, 2022

Replace HTTP client on TestClient from requests to httpx encode/starlette#1376

Merged

4 tasks

adriangb added 2 commits January 10, 2022 08:21

Merge branch 'master' into multipart-advanced

6fe8c06

override content_type with headers

ec474fd

tomchristie reviewed Jan 11, 2022

View reviewed changes

httpx/_multipart.py Outdated Show resolved Hide resolved

tomchristie reviewed Jan 11, 2022

View reviewed changes

compare tuples based on length

79d1521

tomchristie reviewed Jan 12, 2022

View reviewed changes

adriangb added 2 commits January 12, 2022 08:43

incorporate suggestion

195c661

remove .title() on headers

1c9dc26

tomchristie reviewed Jan 13, 2022

View reviewed changes

httpx/_multipart.py Show resolved Hide resolved

tomchristie changed the title ~~feat: allow specification of additional headers in multipart/form-data requests~~ Allow custom headers in multipart/form-data requests Jan 13, 2022

tomchristie merged commit 0f1ff50 into encode:master Jan 13, 2022

adriangb deleted the multipart-advanced branch January 13, 2022 09:16

tomchristie mentioned this pull request Jan 26, 2022

Version 0.22.0 #2048

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow custom headers in multipart/form-data requests #1936

Allow custom headers in multipart/form-data requests #1936

adriangb commented Nov 12, 2021 •

edited by tomchristie

tomchristie Jan 7, 2022

adriangb Jan 7, 2022 •

edited

tomchristie Jan 10, 2022

adriangb Jan 10, 2022 •

edited

tomchristie Jan 11, 2022

tomchristie Jan 11, 2022

tomchristie Jan 11, 2022

adriangb Jan 11, 2022

tomchristie left a comment

tomchristie Jan 12, 2022

adriangb Jan 12, 2022

tomchristie commented Jan 13, 2022

		content_type = guess_content_type(filename)

		if content_type is not None and "Content-Type" not in headers:

Allow custom headers in multipart/form-data requests #1936

Allow custom headers in multipart/form-data requests #1936

Conversation

adriangb commented Nov 12, 2021 • edited by tomchristie

Choose a reason for hiding this comment

adriangb Jan 7, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adriangb Jan 10, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomchristie left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomchristie commented Jan 13, 2022

adriangb commented Nov 12, 2021 •

edited by tomchristie

adriangb Jan 7, 2022 •

edited

adriangb Jan 10, 2022 •

edited