Version 3.0.0 #223

Ousret · 2022-10-18T19:24:42Z

3.0.0 (2022-10-20)

Added

Extend the capability of explain=True when cp_isolation contains at most two entries (min one), will log in details of the Mess-detector results
Support for alternative language frequency set in charset_normalizer.assets.FREQUENCIES
Add parameter language_threshold in from_bytes, from_path and from_fp to adjust the minimum expected coherence ratio
normalizer --version now specify if the current version provides extra speedup (meaning mypyc compilation whl)

Changed

Build with static metadata (not pyproject.toml yet)
Make language detection stricter
Optional: Module md.py can be compiled using Mypyc to provide an extra speedup up to 4x faster than v2.1

Fixed

CLI with opt --normalize fail when using full path for files
TooManyAccentuatedPlugin induce false positive on the mess detection when too few alpha characters have been fed to it
Sphinx warnings when generating the documentation

Removed

Coherence detector no longer returns 'Simple English' instead returns 'English'
Coherence detector no longer returns 'Classical Chinese' instead returns 'Chinese'
Breaking: Method first() and best() from CharsetMatch
UTF-7 will no longer appear as "detected" without a recognized SIG/mark (is unreliable/conflicts with ASCII)
Breaking: Class aliases CharsetDetector, CharsetDoctor, CharsetNormalizerMatch and CharsetNormalizerMatches
Breaking: Top-level function normalize
Breaking: Properties chaos_secondary_pass, coherence_non_latin and w_counter from CharsetMatch
Support for the backport unicodedata2

codecov-commenter · 2022-10-18T19:28:27Z

Codecov Report

Merging #223 (ef42849) into master (db134f3) will not change coverage.
The diff coverage is 100.00%.

❗ Current head ef42849 differs from pull request most recent head 28a8c6f. Consider uploading reports for the commit 28a8c6f to get more accurate results

@@           Coverage Diff           @@
##           master     #223   +/-   ##
=======================================
  Coverage   89.89%   89.89%           
=======================================
  Files          10       10           
  Lines        1187     1187           
=======================================
  Hits         1067     1067           
  Misses        120      120

Impacted Files	Coverage Δ
charset_normalizer/version.py	`100.00% <100.00%> (ø)`

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

release date

Ousret added 2 commits October 18, 2022 21:22

🔖 bump version 3.0.0

5da2ba8

📝 docs user::support languages update

e596fa9

Ousret and others added 7 commits October 18, 2022 21:31

🔧 update universal-wheel stage (missing build pkg)

3d28cde

Merge branch 'master' into release-3.0

bc5b939

Merge branch 'master' into release-3.0

e34c589

🔧 use a dedicated reqs.txt for the optional build

ef42849

Merge branch 'master' into release-3.0

28a8c6f

📝 Update CHANGELOG

013b67e

release date

Merge remote-tracking branch 'origin/release-3.0' into release-3.0

fd46874

Ousret merged commit 0ec52ef into master Oct 20, 2022

Ousret deleted the release-3.0 branch October 20, 2022 08:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Version 3.0.0 #223

Version 3.0.0 #223

Ousret commented Oct 18, 2022 •

edited

codecov-commenter commented Oct 18, 2022 •

edited

Version 3.0.0 #223

Version 3.0.0 #223

Conversation

Ousret commented Oct 18, 2022 • edited

3.0.0 (2022-10-20)

Added

Changed

Fixed

Removed

codecov-commenter commented Oct 18, 2022 • edited

Codecov Report

Ousret commented Oct 18, 2022 •

edited

codecov-commenter commented Oct 18, 2022 •

edited