Hacker News new | ask | show | jobs
by animal-husband 489 days ago
Text generated prior to a decision to “explain” it is reasoning for the relevant intents and purposes.

Text generated after a decision to “explain” it is largely nonsense.

1 comments

The true test would be seeing the behavior change depending on the presence of reasoning
The words thinking and reasoning used here are imprecise. It’s just generating text like always. If the text is after “ai-thoughts:” then it’s “thinking” and if it’s after “ai-response” then it’s “responding” not “thinking” but it is always a big ole model choosing the most likely next token potentially with some random sampling
That is what was observed - o1 family models performed the “cheat”, non-reasoning models didn’t.