Hacker News new | ask | show | jobs
by seattleeng 452 days ago
It’s more like conditioning the posterior of a response on “Ok, so…” lets the model enter a better latent space for answering logically vs just spitting out a random token.