Y
Hacker News
new
|
ask
|
show
|
jobs
by
112233
253 days ago
Given how badly most models degrade once reaching a particular context size (any whitepapers on this welcome), reasoning does seem like quick hack, instead of a thought out architecture.