| HN Mirror

The last time I gave it a serious try, back in 2019, I gave it ~120000 non-spam samples (several years of real emails) and ~25000 spam samples (1 month of spam).

After that it was getting about 5% false-positive (so 1 in 20 real emails went to spam) and about 3% false-negative. For me, 3% false negative means 25 spams to inbox a day.

Gmail gives me about 0.5% false positive (1 in 200) and 0.01% false negatives.