|
|
|
|
|
by mediaman
1132 days ago
|
|
The mistake is in believing that LLM's output should be deterministic to be useful. Human output is not deterministic. Fields with text-heavy output are already being upended by this. Being able to summarize long legal briefs, identify contract problems, do classification of discovery documents, or even write first drafts of common legal forms is already upending the legal discipline. Chat-based customer support agents are seeing 25% productivity improvements based on two-year-old models for new employees, according to a study published in NBER. Things like BabyAGI and other sequential "do anything" tools appear to be close to useless now, and unfortunately that is what is catching a lot of hype on Twitter. But actual industry applications are much quieter (often NDA) and much more impactful. |
|
It's about the LLM itself having any way to determine whether what it says is true.