Hacker News new | ask | show | jobs
by player_piano 71 days ago
Hi, yes, my apologies. One thing I'm currently fixing is the workflow for bringing in many of the listings that sites like GovDeals cover but are not part of the available APIs. Scraping sites like GovDeals is kind of shady and not something I want to do, so I am ingesting and cleaning a lot of data from state/government websites myself. While I fix that, I've removed those references from the site.
3 comments

FWIW GovDeals does not care as long your scraping load is reasonable, at least they didn’t years ago when I asked them. They prefer personal scrapers (i.e. buyers looking for deals) stick to precise searches but they were okay with properly throttled site wide scraping if its a public site.

They make money not by optimizing per item profit or by exploiting information asymmetry, but by getting as many eyeballs on their site as possible to drive demand (and thus drive auction price up). They’re happy to be scraped as long as scrapers don’t bring them down because their core competency is giving municipal and state governments an aggregated platform and making the process easier from a bureaucratic point of view.

If you do the work of marketing for them (especially for free!) that’s a plus in their eyes. You’re not a competitor because they do the work of actually dealing with government departments like handling payments and paperwork.

Indexing GovDeals is not shady. You are just providing links to their website via search. That's how Google works.
Just my two cents, but GovDeals is probably the best clearinghouse. If you don’t have data from GovDeals it’s a non-starter.