Hacker News new | ask | show | jobs
by Spivak 1688 days ago
You don't have to mine the site at all. There's a schema for the data that basically all cooking sites support if they want SEO https://developers.google.com/search/docs/advanced/structure...
1 comments

Perhaps I'm dim, but I don't understand how that means you don't have to mine the site. Doesn't that simply mean you could perform the process all of the URLs in the database here by making the http request, grabbing those`recipeIngredient` sections, converting units and such, and then populating your simpler database from that?

That seems like pretty textbook (albeit very simplistic) mining to me.