Allow for garbage after comment data #178

davide-romanini · 2020-08-13T11:08:30Z

Many zip in the wild could have some garbage after comment data, and normal zip tools silently ignore the garbage and preserve the good data.
This PR just removes the check that, in case of garbage, removes all the comment, preserving more zip data.

rylev

Thanks for this. Could you add a test for this?

Plecra · 2020-08-19T13:33:55Z

I'm not sure this is permitted by the ZIP spec. @davide-romanini Could you give us example(s!) of ZIP files like this?

davide-romanini · 2020-08-19T17:01:06Z

@rylev I've added a test with a small sample of the issue.
@Plecra specs may or may be not allow for this. But reality is often different, especially for a format so old.
In my case I found that python zipfile generates garbage when opening a zip in mode a and setting a comment with a shorter length than the initial one:

>>> from zipfile import ZipFile
>>> with ZipFile('comment_garbage.zip', 'a') as z:
...     z.comment = b'long comment bla bla bla'
...
>>> with ZipFile('comment_garbage.zip', 'a') as z:
...     z.comment = b'short.'
...
>>>

This doesn't cause any problem for other zip reading libraries, so I think it should be handled even if not officially supported by the spec.

rylev · 2020-08-25T14:58:28Z

@Plecra This seems reasonable to me. Thoughts?

Plecra · 2020-08-25T20:46:49Z

@davide-romanini Thanks for such a simple example 😀 This has made it very clear.

7zip lists it as a warning, but the file doesn't seem to break any of the common ZIP readers. So the question is how far we should go in supporting these files: The comment length is the only thing that bounds the time it takes to open the ZIP files (which is important for negative ID), so will we still only search within the first ~65536 bytes? I'd like to check InfoZIP's implementation for how they handle this, in case there's another oddity.

Plecra · 2020-11-15T21:32:59Z

No need for this to wait any longer. Thanks for your work @davide-romanini

davide-romanini added 2 commits August 13, 2020 13:02

allow for garbage after comment data

5c4f090

fix fmt

b91f48a

rylev suggested changes Aug 17, 2020

View reviewed changes

Plecra mentioned this pull request Aug 19, 2020

Release 0.5.7 #180

Closed

add test for handling comment garbage

5eefdf8

Plecra merged commit f5061c2 into zip-rs:master Nov 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for garbage after comment data #178

Allow for garbage after comment data #178

davide-romanini commented Aug 13, 2020

rylev left a comment

Plecra commented Aug 19, 2020

davide-romanini commented Aug 19, 2020

rylev commented Aug 25, 2020

Plecra commented Aug 25, 2020

Plecra commented Nov 15, 2020

Allow for garbage after comment data #178

Allow for garbage after comment data #178

Conversation

davide-romanini commented Aug 13, 2020

rylev left a comment

Choose a reason for hiding this comment

Plecra commented Aug 19, 2020

davide-romanini commented Aug 19, 2020

rylev commented Aug 25, 2020

Plecra commented Aug 25, 2020

Plecra commented Nov 15, 2020