http://blog.konradvoelkel.de/2010/01/linux-ocr-and-pdf-probl...
http://www.konradvoelkel.com/2013/03/scan-to-pdfa/