|
|
|
|
|
by unblough
976 days ago
|
|
You are unable to. “LLMs can’t self-correct in reasoning tasks, DeepMind study finds“ https://news.ycombinator.com/item?id=37823543 Anyone who says otherwise is either ignorant of the underlying function of llms or trying to sell you something. |
|
This is just wrong lol.
GPT-4 logits calibration pre RLHF - https://imgur.com/a/3gYel9r
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback - https://arxiv.org/abs/2305.14975
Teaching Models to Express Their Uncertainty in Words - https://arxiv.org/abs/2205.14334
Language Models (Mostly) Know What They Know - https://arxiv.org/abs/2207.05221