Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failed to detect CP932 encoded file #280

Open
junfujisawa opened this issue Jun 1, 2023 · 0 comments
Open

Failed to detect CP932 encoded file #280

junfujisawa opened this issue Jun 1, 2023 · 0 comments

Comments

@junfujisawa
Copy link

I have a CP932 encoded file which contains CP932 specific Kanji character (such as U+9AD9). When I try to detect the encoding by using chardet.detect() function, I get the following result:
{'encoding': None, 'confidence': 0.0, 'language': None}

If I use chardet.detect_all() function, I get different result.
[{'encoding': 'SHIFT_JIS', 'confidence': 0.9472926785393057, 'language': 'Japanese'}]

Both are different from the expected result, 'CP932'.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant