Hacker News new | ask | show | jobs
by abadger9 1398 days ago
As someone who works in this space, ^ this. I would say don't overthink any component, use common crawl (https://commoncrawl.org/) to build your initial index, use a pagerank implementation that's been thoroughly researched and published, and use off the shelf components from the apache foundation when you can.
1 comments

abadger9, nice username, do you have any cool portfolios ref the same kinda work?