Hacker News new | ask | show | jobs
by dj_axl 2 hours ago
> A 100-bit bloom filter holding 100,000 keys is saturated instantly. Every bit is set. It returns “maybe present” for every key you ask about — which means it filters nothing, and every read falls through to a full file scan.

Hahaha. (Seems like the bloom filter library isn't set for maximum false positive rate and/or to autoexpand.)

Edit: Actually there's a BloomFalsePositive setting, maybe it never gets used? Also maybe it's not a library and it's a custom implementation.

1 comments

I guess you've never made a silly mistake, found it, and admitted it.

The author wrote this as a learning exercise. And is sharing the process.