Stargate Micro-Benchmarks (feat: jmh)

As per what's on the tin: this repo contains various trivial micro-benchmarks I use to validate (or disprove) minor performance improvements, especially ones that are unlikely to have globally measurable effect on overall performance.

Results

Illegal Chars Check test

On my dev machine I get numbers like this (reordered fastest-to-slowest):

java -jar target/microbenchmarks.jar .\*IllegalCharsCheck.\*

Benchmark                                Mode  Cnt        Score         Error  Units
IllegalCharsCheck.checkUsingBitset      thrpt    6  7766475.485 ± 4833682.790  ops/s
IllegalCharsCheck.switchInLoop          thrpt    6  7667539.164 ± 1236293.708  ops/s
IllegalCharsCheck.explicitLoopingType2  thrpt    6  5398358.111 ±  220305.428  ops/s
IllegalCharsCheck.explicitLoopingType1  thrpt    6  3624089.868 ± 1733100.773  ops/s
IllegalCharsCheck.guavaBasedCheck       thrpt    6  1808928.499 ±  758903.593  ops/s
IllegalCharsCheck.regExpBasedCheck      thrpt    6  1779749.670 ±   44544.703  ops/s
IllegalCharsCheck.defaultNaiveLooping   thrpt    6  1591499.269 ±   53157.329  ops/s

So: 3 groups in terms of performance:

Using switch or long-backed bitset is the fastest: due to simplicity I'd prefer switch
Explicitly looping, using String.indexOf() can be quite fast too, simple
Use of Regexp generated from chars is similar to Guava (perhaps Guava uses that approach); naive Stream usage has high overhead as well

and the performance difference is something like 4x - 5x.

Illegal Chars Replace test

On my dev machine I get numbers like this (reordered fastest-to-slowest):

java -jar target/microbenchmarks.jar .\*Encode.\*

EncodeIllegalChars.switchBasedEncode  thrpt    6  3314744.639 ± 197872.090  ops/s
EncodeIllegalChars.naiveEncode        thrpt    6  1420263.651 ± 167113.841  ops/s

So once again, switch-based scan-once is faster, here almost by factor of 3 (which makes sense as it is 6 individual char scans over single scan with switch of 6 matches). It is quite possible that time taken may be dominated by the few cases where replacements do occur (and String is allocated; both approaches avoid String creation for no-replacements cases).

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src/main/java/io/stargate/microb		src/main/java/io/stargate/microb
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src/main/java/io/stargate/microb

src/main/java/io/stargate/microb

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

pom.xml

pom.xml

Repository files navigation

Stargate Micro-Benchmarks (feat: jmh)

Results

Illegal Chars Check test

Illegal Chars Replace test

About

Releases

Packages

Languages

License

tatu-at-datastax/stargate-micro-benchmarks

Folders and files

Latest commit

History

Repository files navigation

Stargate Micro-Benchmarks (feat: jmh)

Results

Illegal Chars Check test

Illegal Chars Replace test

About

Resources

License

Stars

Watchers

Forks

Languages