Hacker News new | ask | show | jobs
by simonw 608 days ago
You misinterpreted my comment there. When I said "I have yet to see a convincing demo" I was talking about the idea of fine-tuning a model to answer questions against documentation. The rest of my comment described RAG - the exact same approach that tgittos is recommending.

Here are a few of my own RAG implementations - getting a basic version working really is something that can be done in a few hours... but getting a GOOD version working takes a LOT longer than that.

- https://simonwillison.net/2023/Jan/13/semantic-search-answer... - my first attempt at RAG, before I knew it was called that, using custom SQLite SQL functions

- https://til.simonwillison.net/llms/embed-paragraphs#user-con... - a Bash script implementation of RAG

- https://simonwillison.net/2024/Jun/21/search-based-rag/ - an implementation of RAG using SQLite full-text search (as opposed to embedding vectors), built on https://www.val.town/