Hacker News new | ask | show | jobs
by 3abiton 309 days ago
This is an interesting approach! Why not offload PDF extraction to other frameorks that apply OCR pdf -> .md
1 comments

I may explore this when I implement the vectordb implementation I started.