Hacker News new | ask | show | jobs
by arkohut 576 days ago
In fact, this project is indeed very computing-consuming, but it’s not Python’s fault. The main reason lies in the use of several machine learning models:

  1. OCR model
  2. Embedding model
  3. VLM model (optional)
I’ve tried many optimization approaches to ensure it doesn’t affect daily usage, though this comes at the cost of reduced search performance.