Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with UTF-8 character in wordlist file #121

Open
jonasbn opened this issue Oct 2, 2022 · 4 comments
Open

Problem with UTF-8 character in wordlist file #121

jonasbn opened this issue Oct 2, 2022 · 4 comments
Assignees
Labels
bug Something isn't working wontfix This will not be worked on

Comments

@jonasbn
Copy link
Collaborator

jonasbn commented Oct 2, 2022

I am observing an issue with the action in the repository jonasbn/perl-task-date-holidays

The word: Rezić is reported as a spelling mistake even when listed in the word list file (.wordslist.txt).

REF: relevant jonasbn/perl-task-date-holidays@150683d of jonasbn/perl-task-date-holidays/.wordlist.txt as head has been altered.

This is the configuration:

matrix:
- name: Markdown
  aspell:
    lang: en
    ignore-case: true
  dictionary:
    wordlists:
    - .wordlist.txt
    encoding: utf-8
  pipeline:
  - pyspelling.filters.markdown:
  - pyspelling.filters.html:
      comments: false
      ignores:
      - code
      - pre
  sources:
  - '**/*.md'
  default_encoding: utf-8

REF: perl-task-date-holidays/.spellcheck.yaml

@jonasbn jonasbn self-assigned this Oct 2, 2022
@jonasbn jonasbn added the bug Something isn't working label Oct 2, 2022
jonasbn referenced this issue in jonasbn/perl-task-date-holidays Oct 2, 2022
@facelessuser
Copy link

Try specifyingRezic in your English dictionary. It may simply be due to how ASPELL normalizes characters in an English dictionary.

@jonasbn
Copy link
Collaborator Author

jonasbn commented Oct 2, 2022

Thanks @facelessuser I will try that

@facelessuser
Copy link

I'm kind of digging into the settings. I mainly use English words, so I don't have experience often with using some foreign words and such, so I haven't dug into all the Unicode normalization options and such. There may be an even better approach, but I may have to play around to see what that is.

@facelessuser
Copy link

From the Aspell documentation:

If a word contains a character that the language can’t handle it will still be ignored (for example a Cyrillic letter in a Latin based language).

I imagine this may simply be an issue of using certain characters within an English dictionary.

@jonasbn jonasbn added the wontfix This will not be worked on label Jan 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants