Hacker News new | ask | show | jobs
by singularity2001 499 days ago
I wonder whether even those models which emit thinking tokens in reality do most of the work within the latent space so the difference is only superficial