Hacker News new | ask | show | jobs
by Anduia 761 days ago
Using a hashtable is effective because you only compare elements within their hash buckets, not the entire set. However, they can become inefficient with very large datasets due to memory usage and processing time, which is where approximate counts shine.
1 comments

This algorithm is still spinning a lot of random. I would guess that this is much less overhead than hashing but still seems like it could be significant.