Hacker News new | ask | show | jobs
by vonneumannstan 325 days ago
So what you're saying is: the solution to PDF parsing is make a new file format altogether lol. Very helpful.
1 comments

Not at all. PDFs support embedded content, and JSON (or similar) is a fine way to store that content. So is plain text if it comes to it.