| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by famouswaffles 978 days ago

>You are unable to.

This is just wrong lol.

GPT-4 logits calibration pre RLHF - https://imgur.com/a/3gYel9r

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback - https://arxiv.org/abs/2305.14975

Teaching Models to Express Their Uncertainty in Words - https://arxiv.org/abs/2205.14334

Language Models (Mostly) Know What They Know - https://arxiv.org/abs/2207.05221

1 comments

unblough 977 days ago

> This is just wrong lol.

The needless condescension of your “lol” feels a bit premature.

How can you have self correction without superintelligence?

link