|
|
|
|
|
by ethan_zhao
108 days ago
|
|
Author here. I built this for 3mins.news, an AI news aggregator covering 180+ sources in 17 languages. The trickiest part was figuring out that articles in different languages about the same event share zero tokens — MinHash/LSH gives
you Jaccard similarity of 0. Happy to answer questions about the pgvector setup, Cloudflare Workers constraints, or the clustering algorithm tuning. |
|