Hacker News new | ask | show | jobs
by vanschelven 384 days ago
Including literal 404s... As an outsider it has always struck me as absurd that they don't just do the equivalent of wget over all provided sources.
1 comments

Or why the LLM doesn’t do a lookup into a subset of the training data as a database and reject the output if it seems to be wrong. A billion of the most urls and the entirety of Wikipedia, arkiv and stackoverflow would go a long way.
If that could be done, then we would be using that and skipping the llms entirely
Can’t see why that couldn’t be done? You save a http request for a ton of the urls.
Because if the llm could tell right from wrong, it wouldn't have to do this in the first place. It's like the bible clainming it's true because the bible says it's true. Circular logic.