| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by animal-husband 489 days ago
	Text generated prior to a decision to “explain” it is reasoning for the relevant intents and purposes. Text generated after a decision to “explain” it is largely nonsense.

1 comments

Kinrany 489 days ago

The true test would be seeing the behavior change depending on the presence of reasoning

link

2099miles 489 days ago

The words thinking and reasoning used here are imprecise. It’s just generating text like always. If the text is after “ai-thoughts:” then it’s “thinking” and if it’s after “ai-response” then it’s “responding” not “thinking” but it is always a big ole model choosing the most likely next token potentially with some random sampling

link

animal-husband 489 days ago

That is what was observed - o1 family models performed the “cheat”, non-reasoning models didn’t.

link