Handling invalid unicode escapes #192

GULPF · 2019-03-04T10:24:39Z

JSON5.parse uses the ES6 behavior and accepts invalid Unicode sequences like "\uDEAD". However, this is not the case for the CLI. The CLI instead outputs U+FFFD (the replacement character). This probably happens because of Node trying to ensure that the output is valid UTF-8. I think it's best if these codepoints are escaped in the output, so that "\uDEAD" generates the escaped output "\uDEAD".

The text was updated successfully, but these errors were encountered:

jordanbtucker · 2019-03-04T22:46:59Z

I wasn't able to reproduce this issue. Can you please provide some code that demonstrates this?

GULPF · 2019-03-04T23:17:32Z

input.json5 contains "\uDEAD". After running json5 input.json5 -o output.json, output.json contains the following bytes (in hex): 22 EF BF BD 22, which is a string only containing the replacement character.

jordanbtucker · 2019-03-06T19:09:34Z

Thanks for the example. That clarified the issue for me.

Your assumption about why this happens is correct. Since both JSON and JSON5 allow for invalid code unit sequences, the CLI should preserve them, however this would be a breaking change. The best approach may be to add a CLI flag in v2.2 that escapes invalid code point sequences. In v3.0 we can make that behavior the default and replace the CLI flag with one that replaces invalid code unit sequences with \uFFFD.

jordanbtucker self-assigned this Mar 4, 2019

jordanbtucker added the bug 🐛 label Mar 6, 2019

jordanbtucker added the pull-request-welcome label Mar 6, 2019

jordanbtucker added this to the v3.0.0 milestone Feb 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling invalid unicode escapes #192

Handling invalid unicode escapes #192

GULPF commented Mar 4, 2019 •

edited

jordanbtucker commented Mar 4, 2019

GULPF commented Mar 4, 2019 •

edited

jordanbtucker commented Mar 6, 2019

Handling invalid unicode escapes #192

Handling invalid unicode escapes #192

Comments

GULPF commented Mar 4, 2019 • edited

jordanbtucker commented Mar 4, 2019

GULPF commented Mar 4, 2019 • edited

jordanbtucker commented Mar 6, 2019

GULPF commented Mar 4, 2019 •

edited

GULPF commented Mar 4, 2019 •

edited