New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Release 5.0.0 #254
Merged
Merged
Release 5.0.0 #254
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Remove support for Python < 3.6
Co-authored-by: LSY <grizlupo@daum.net>
…, Greek, and Turkish
…hat use ASCII letters
Always remove XML tags when detecting single-byte charset encodings that use ASCII letters
Likely characters count for 25% now, and too many control characters decrease confidence, just like in uchardet.
charade merger is no longer recent, and `master` no longer supports Python 2.
Added Project Urls for documentation, Github Repo and Github Issues. Co-authored-by: Nirjas Jakilim <Nirzak@users.noreply.github.com>
…235) Fixes the warning: [WARNING] The 'rev' field of repo 'https://github.com/ambv/black' appears to be a mutable reference (moving tag / branch). Mutable references are never updated after first install and are not supported. See https://pre-commit.com/#using-the-latest-version-for-a-repository for more details. Hint: `pre-commit autoupdate` often fixes this. Fixed by running the command "pre-commit autoupdate".
Co-authored-by: Dan Blanchard <dan.blanchard@gmail.com>
Help to ensure new code contributions meet the coding standards of the project.
This approach avoids mixing code with configuration and therefore requires less custom code at build-time and installation-time. Removing this custom setup.py code reduces cross-project boilerplate. https://setuptools.readthedocs.io/en/latest/userguide/declarative_config.html
* slight increase in performance * update black version to 22.3.0 * reformat code * reformat code
* support for UTF-16 and UTF-32 detection missing BOMs * Changes per PR comments - Restored file suffix filter in test.py - Added functionality to identify valid unicode, to enhance detection - Generated some non-trivial unicode examples using supplementary plane 1 * clean up poorly written comments * Run black on PR * Fix some minor linting issues Co-authored-by: Jason Zavaglia <jason.zavaglia@gmail.com>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
In addition to that change, it features the following user-facing changes:
SingleByteCharSetProber
confidence to match latest uchardet (Tweak SingleByteCharSetProber confidence #209)detect_all
return child prober confidences (Make detect_all return child prober confidences #210)pressent
topresent
#220, Added Helpful Project Urls #221, Simple maintenance improvements #244 from too many to mention)