Hacker News new | ask | show | jobs
by famouswaffles 978 days ago
>You are unable to.

This is just wrong lol.

GPT-4 logits calibration pre RLHF - https://imgur.com/a/3gYel9r

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback - https://arxiv.org/abs/2305.14975

Teaching Models to Express Their Uncertainty in Words - https://arxiv.org/abs/2205.14334

Language Models (Mostly) Know What They Know - https://arxiv.org/abs/2207.05221

1 comments

> This is just wrong lol.

The needless condescension of your “lol” feels a bit premature.

How can you have self correction without superintelligence?