|
|
|
Ask HN: How Do I Get Started with OCR(Optical Character Recognition)?
|
|
8 points
by muralimadhu
2499 days ago
|
|
I have no background in machine learning or computer vision. What I do have is a problem statement. I want to be able to parse and get structured text out of financial documents like W2s and Paystubs. For ex)parse out company name, salary etc from a W2. Off the shelf solutions like AWS Textract doesnt work very well. So far I have only been treating OCR as a blackbox. If I were to build an OCR service myself for a specialized set of financial documents, what theory and tools would I need to learning assuming I have a CS background, but not an ML background? Thanks in advance |
|
Is it because you want to own the IP or for learning purposes?