Hacker News new | ask | show | jobs
by drusepth 433 days ago
RAG still has lots of benefits for anyone paying per input token (e.g. over APIs).
1 comments

Not to mention latency
And grounding for the model. Smaller models with tend to hallucinate a little less (anecdotally).