Hacker News new | ask | show | jobs
by kmike84 3638 days ago
There are many methods for different tasks. What do you mean by 'data extraction', do you have some specific examples?
1 comments

After the OCR of documents, I have used mostly regex to extract information from semi-structured documents. One example would be invoices (invoice number, total amount etc.), another would be to extract product names, SKU numbers etc. from various documents.