Y
Hacker News
new
|
ask
|
show
|
jobs
by
limoce
152 days ago
I think separating thinking tokens from "representing" tokens might be a better approach, like what those thinking models does