|
|
|
|
|
by rainbowjelly
1004 days ago
|
|
> For the scientific literature, we need a ChatGPT equivalent to reconstruct LaTeX source that can reproduce each page. (We really need a successor to LaTeX that isn't such an arcane language, and can author fixed and flowable text with equal ease.) Check out Nougat: OCRing scientific papers with a deep net trained end to end. It was released by Meta a few days ago. “PDF format leads to a loss of semantic information, particularly for mathematical expressions. We propose Nougat (Neural Optical Understanding for Academic Documents), a Visual Transformer model that performs an Optical Character Recognition (OCR) task for processing scientific documents into a markup language, and demonstrate the effectiveness of our model on a new dataset of scientific documents.” https://facebookresearch.github.io/nougat/ |
|