New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support transcoding with replacement #147
Comments
cc @clausecker |
Copy paste of some relevant parts...
|
I guess UTF-8 decoder capability and stress test is relevant here, although I wish it would provide a set of pairs { UTF-8 input, expected UTF32-output } instead of a specification based on visual inspection. My understanding is that UTF-8 decoding never should stop on error. Instead, it should always signal error by U+FFFD substitution, and it also always should re-synchronize on every valid first byte. |
We’ll definitely support replacement in the future! |
Add functions which transcode even with the input is invalid, by replacing invalid character sequences with a replacement character.
The text was updated successfully, but these errors were encountered: