Hacker News new | ask | show | jobs
by nyrikki 468 days ago
Subbarao Kambhampati, who seems to only use X is a good resource. He points out how the CoT text is not of semantic importantce.

This work from his team shows how few 'reasoning' traces are valid.

https://atharva.gundawar.com/searchformer_response_analysis....

This paper shows how the scratch space gets transformers to PTIME from TC0 without it.

https://arxiv.org/abs/2502.02393

OpenAI may be able to do more in the long term because they don't show the <think> and can spend more of that scratch space on improving answers vs appeasing users, but time will show.

Remember that probabilistic checkable proofs show how random data can improve computation.

The AI field has always had a problem with wishful mnomics.

But it is probably not a binary choice, if we could get the scratch space to reliably simulate Dykstra' shunting and convert to postfix as an example, that would be great.