Hacker News new | ask | show | jobs
by james-revisoai 1147 days ago
Langchain --> query_with_sources function (semantic search context)

If you finetune, you'll get mostly the style and sometimes worse factual performance. Source through semantic search context/RAG is a better approach.

Source: Creator of AnyQuestions.ai which works like that plus some extra neat tricks. (released the first demo in August 2022 before the onslaught)

1 comments

When are you actually billed? First, when you create embeddings at the beginning. Second, every time you call the OpenAI chat complete API?
Yes, that's correct. Then OpenAI bills you around the end of the month so you have some leeway of credit.

You can use local embeddings, such as sentence transformer models, too, to save on embeddings (and maintain embeddings you can move to another provider with new queries etc)