Y
Hacker News
new
|
ask
|
show
|
jobs
by
ChadNauseam
224 days ago
That's an implementation detail I believe. But what I meant was just greedy decoding (picking the token with the highest logit in the LLM output), which can be implemented very easily