Diffbot uses machine learning, NLP and computer vision to automatically extract data from web pages. We offer a host of APIs and services around this technology to hundreds of (paying) customers. We recently announced our profitability and the raising of $10M in Series A funding to bolster our significantly expanded efforts: http://www.diffbot.com/company/news/