Hacker News new | ask | show | jobs
by ma1ms 1154 days ago
I also have created a question answering over pdf using LLMs, but with several advantages: I have used SentenceTransformers to make it faster and free of cost. You can use OpenAI embeddings or other ones. I have used Qdrant cloud (free tier) to host my embeddings and textual documents for fast search and retrieval.

By doing this, I was able to reduce costs (credit usage) significantly, while still achieving high performance. Plus, the smaller embeddings were faster and available for free. I only needed to call the OpenAI API for response generation, and by optimizing the context and selecting the most relevant search results, I was able to achieve highly accurate responses. Here's the repo: https://github.com/mallahyari/drqa