Hacker News new | ask | show | jobs
by CSMastermind 808 days ago
It sounds like you want RAG instead of training or even fine tuning a model.

Have you looked into the OpenAI APIs? They make it relatively easy to do assuming you have some limited programming knowledge.

3 comments

I'm currently looking to implement RAG locally, using QDrant [1] for instance.

Just playing around for now, but it makes sense to have a runnable example for our users too :) [2].

[1]. https://qdrant.tech/ [2] https://aimeetingbot.com

I have recently briefly looked into Assistants API (if that's what you're referring to) but it seems relatively if I'm not mistaken.
Are you referring to the "Assistants" API?
I was not, that's relatively new, though if I'm not mistaken it might make the process easier.

I mean calling the embeddings API and then having software locally that finds and appends documents to your queries.