Hacker News new | ask | show | jobs
by user24 3686 days ago
On Tuesday the Guardian reported that the BBC (fondly aka Auntie) would be archiving their recipes, so I quickly scraped the site and wrote a search engine for it.

Not sure how the site will evolve, if at all, but it was a fun side project!

Code is here if you want to play: https://github.com/user24/auntiesrecipes

2 comments

I thought archiving meant it would be no longer on their website, it would be awesome if they just released it all under something like the GPL. Could do some fun machine learning stuff with it...
I'd love them to release it as open data so that I'm not in murky waters.

If they move the recipes I can update the links. If they take the recipes down (which they've said they won't, now) I have got all the data so could rehost them.

edit: but, if you want to do some fun ML work, my scraper should help get you started!

Cool - you might want to look at using levenshtein distance algo to improve the search.
nice idea. Eventually, if I add more recipes, I'll need to move the search serverside too.