New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make StringDictionaryBuilder
faster
#1851
Comments
I also have an implementation that I could contribute, it would need some changes for generic key types and maybe for null values. It's based on using |
Exciting, lets see how they compare 😄 |
StringDictionaryBuilder
faster
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
A while back I implemented an optimized string dictionary builder for IOx. This contains two major tricks to provide better performance:
raw_entry_mut
to not duplicate string values into the hashmapI have an implementation of this for arrow that needs a bit more polish, but leads to a 60% speedup over the current implementation in arrow. Unfortunately it depends on #1850 as it needs to be able to read the string data from an in-progress
StringBuilder
Describe the solution you'd like
Implement #1850 and then add this functionality
Describe alternatives you've considered
We could not do this
The text was updated successfully, but these errors were encountered: