|
|
|
|
|
by kgeist
851 days ago
|
|
It's verbose because it was finetuned to be "helpful" (the way OpenAI sees it). You can fix it with a system prompt, or finetune the base model with the format you want. Same with grounding it with RAG. Sure if you take a vanilla LLM and make no effort to adapt it to your app's needs, it's going to have subpar results. At least, verbosity is not an inherent problem of LLMs, it's a specific issue of a specific finetune. Hallucinations are a real problem indeed. However, being wrong and being very confident about your answer is something LLMs share with humans. LLMs can already have value if they're wrong less often than the average human (and some benchmarks suggest so). |
|
For example, 100 is defined as the average IQ, but it's almost certain that every human who comments on HN has an IQ much above that.
It's also problematic when LLMs are better not because they're getting better, but the average human is declining.