|
I can't find any details on the license or copyright status of these recipes. It looks like a scraper was used (Scrapy). The only note I could find in the paper just says: > Additional recipes were gathered from multiple cooking web pages, using automated scripts in a web scraping process. Is it legal to republish these recipes without explicit permission from the origin sites? I would be wary of using these for anything without more clarity. EDIT: There's no license information in the full dataset, but it does list the source URL of the scraped recipes. Summary of sites used: value,count
www.cookbooks.com,896341
www.food.com,290565
www.epicurious.com,94398
www.myrecipes.com,64862
www.allrecipes.com,61398
www.yummly.com,51963
www.tasteofhome.com,51594
tastykitchen.com,50320
food52.com,48501
recipes-plus.com,20524
|
“A mere listing of ingredients is not protected under copyright law. However, where a recipe or formula is accompanied by substantial literary expression in the form of an explanation or directions, or when there is a collection of recipes as in a cookbook, there may be a basis for copyright protection.”
https://www.copyright.gov/help/faq/faq-protect.html