Skip to content

stream-processing-with-spark/HLLAccumulator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HLLAccumulator

HyperLogLog-based accumulator for Apache Spark

A custom AccumulatorV2 implementation that uses HyperLogLogPlus to count unique elements seen. It provides a cardinality estimation of the observed elements.

Local deployment

This sample application is meant to be deployed local, using the sbt publishing function

sbt publishLocal

About

HyperLogLog-based accumulator for Apache Spark

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages