Hacker News new | ask | show | jobs
by rainbowjelly 1004 days ago
> For the scientific literature, we need a ChatGPT equivalent to reconstruct LaTeX source that can reproduce each page. (We really need a successor to LaTeX that isn't such an arcane language, and can author fixed and flowable text with equal ease.)

Check out Nougat: OCRing scientific papers with a deep net trained end to end. It was released by Meta a few days ago.

“PDF format leads to a loss of semantic information, particularly for mathematical expressions. We propose Nougat (Neural Optical Understanding for Academic Documents), a Visual Transformer model that performs an Optical Character Recognition (OCR) task for processing scientific documents into a markup language, and demonstrate the effectiveness of our model on a new dataset of scientific documents.”

https://facebookresearch.github.io/nougat/