Hacker News new | ask | show | jobs
by mlsu 408 days ago
It seems that with this technique you could not possibly do "chain of thought." That technique seems unique to auto-regressive architecture. Right?