| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by kshivendu 210 days ago

My article is an architecture breakdown of how Exa AI built web search that's better than Google — and what is the bare minimum cost to build web scale search today with napkin math.

Please check it out if you're curious about: - How modern AI search engines like Exa, Perplexity, and Parallel Web Systems operate under the hood. - Learning napkin math style estimation (technique popularized by legends like Jeff Dean) - How vector compression tricks like matryoshka embeddings + binary quantization change the economics of billion scale search

Please take my estimates with a grain of salt since my goal is to just get in the right ballpark. Also feel free to comment/DM if you see any wrong or suboptimal assumptions :)