Hacker News new | ask | show | jobs
by zahlman 118 days ago
My assumption has been that emitting those tokens is part of the inference, analogous to humans "thinking out loud".
1 comments

You're absolutely right!