New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
utf8 case folding and/or comparisons? #135
Comments
Case-folding and case-insensitive comparisons are complex and locale dependent. Will be very tricky to vectorise, even if normalised input can be assumed. |
At https://github.com/SnellerInc/sneller we have Unicode support and even for upper/lower-case we ended up with huge lookup tables. Here we might consider using gathers, and check the outcome. Although I'm quite sure performance will be mainly bounded by cache misses penalties. |
@WojciechMula I didn't know you are one of the sneller developer, very impressive project BTW! |
Yeah, a lot of great stuff is there. :) Getting back to the main topic, there was an attempt to express lookup table as a huge vectorized if-ladder and it was very slow. |
Hi,
Thank you for your work on this. Would it be possible to use this (or any other SIMD-based) approach for fast case folding and/or case-insensitive comparison?
The text was updated successfully, but these errors were encountered: