Hacker News new | ask | show | jobs
by _boffin_ 394 days ago
You’re aware that PDFs are containers that can hold various formats, which can be interlaced in different ways, such as on top, throughout, or in unexpected and unspecified ways that aren’t “parsable,” right?

I would wager that they’re using OCR/LLM in their pipeline.

1 comments

Could be. But their pricing for the conversion is free, which leads me to believe LLMs are not involved.