| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by kerupian 778 days ago

I watched the demo. This looks quite interesting to me.

I have done some work on PDFs before and I know extracting info. from PDF is hard.

Kudos to you for building a search for scanned PDFs.

Do I have to manage Chunking for the search engine?

You mentioned about APIs. Do you support multiple clouds? For example, I have some data Dropbox, S3, GDrive, and R2. Will I be able to connect all these clouds?

Can you tell me more about data security?

Either way, looks impressive for data engineering and ML pipelines.