Hacker News new | ask | show | jobs
by ethan_zhao 108 days ago
Author here. I built this for 3mins.news, an AI news aggregator covering 180+ sources in 17 languages. The trickiest part was figuring out that articles in different languages about the same event share zero tokens — MinHash/LSH gives you Jaccard similarity of 0.

Happy to answer questions about the pgvector setup, Cloudflare Workers constraints, or the clustering algorithm tuning.