Hacker News new | ask | show | jobs
by 112233 253 days ago
Given how badly most models degrade once reaching a particular context size (any whitepapers on this welcome), reasoning does seem like quick hack, instead of a thought out architecture.