Y
Hacker News
new
|
ask
|
show
|
jobs
by
ac29
10 days ago
None of those settings set the speculative decoder to accept 100% of drafted token. I assume you are looking at --draft-p-min 0.0, if so, you are misunderstanding what it does.