|
|
|
|
|
by maleldil
486 days ago
|
|
The API is pretty nice and easy to get started, but I couldn't get good results with parsing scientific paper PDFs, unfortunately (including OCR). Are there plans to use other backends? Docling works alright, and LLMs like Gemini Flash are interesting too. |
|
You should try using a different PSM to see if you get better results.
If it's scientific texts specifically, look at grobid