Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Unicode Script into built-in rules. #751

Merged
merged 6 commits into from Dec 23, 2022

Commits on Dec 22, 2022

  1. Add CJK unicode into built-in rules.

    Make this change to add `CJK`, `HAN`, `HANGUL`, `KATAKANA`, `HIRAGANA` to built-in rules.
    
    https://unicode.org/faq/han_cjk.html
    
    - Chinese - `HAN`
    - Japanese - `KATAKANA`, `HIRAGANA`
    - Korean - `HANGUL`
    
    So we can easy to to match the CJK chars.
    huacnlee committed Dec 22, 2022
    Copy the full SHA
    524bfd7 View commit details
    Browse the repository at this point in the history
  2. Rewrite pest_meta::UNICODE_PROPERTY_NAMES to pest::unicode::unicode_p…

    …roperty_names.
    
    -  will generate property names by use macro.
    -  has been removed.
    huacnlee committed Dec 22, 2022
    Copy the full SHA
    57dfab7 View commit details
    Browse the repository at this point in the history
  3. Improve unicode char_property_functions macro for supports original…

    … BY_NAME values by `ucd-generate` generated.
    
    And export all property names from Unicode (Script).
    huacnlee committed Dec 22, 2022
    Copy the full SHA
    89504ed View commit details
    Browse the repository at this point in the history
  4. Copy the full SHA
    0b0248d View commit details
    Browse the repository at this point in the history

Commits on Dec 23, 2022

  1. Copy the full SHA
    3b6d1dc View commit details
    Browse the repository at this point in the history
  2. Update pest/src/unicode/mod.rs

    tomtau committed Dec 23, 2022
    Copy the full SHA
    34bc234 View commit details
    Browse the repository at this point in the history