Failed to detect CP932 encoded file #280

junfujisawa · 2023-06-01T05:19:13Z

I have a CP932 encoded file which contains CP932 specific Kanji character (such as U+9AD9). When I try to detect the encoding by using chardet.detect() function, I get the following result:
{'encoding': None, 'confidence': 0.0, 'language': None}

If I use chardet.detect_all() function, I get different result.
[{'encoding': 'SHIFT_JIS', 'confidence': 0.9472926785393057, 'language': 'Japanese'}]

Both are different from the expected result, 'CP932'.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failed to detect CP932 encoded file #280

Failed to detect CP932 encoded file #280

junfujisawa commented Jun 1, 2023

Failed to detect CP932 encoded file #280

Failed to detect CP932 encoded file #280

Comments

junfujisawa commented Jun 1, 2023