Hacker News new | ask | show | jobs
by rosha 3128 days ago
I recently had a similar idea which I ended up putting into real world, so I built a small search engine "UkookU" for used vehicles from scratch, it did not take me a long time to get the data and maintain it as I did not need to build any scrapers or care about sites blocking/abuse as I used a third party scrapping API called ProxyCrawl https://proxycrawl.com which allows me to make a proof of concept of the idea for free, so that shortened the time I needed to build the hadoop engine, etc.

I am thinking now recently to build a recruiting talent pool service, which is based on aggregated data from LinkedIn, Google, Bing, Yahoo, Facebook and many other sites and I am pitching something around it as I can get all the data with ProxyCrawl.

I am also thinking recently to do something about keyword ontologies, small markets around me using google/yandex data and offer it as a free helpful tool in the form of a mobile app. How do you normally get your data for your projects?