Y
Hacker News
new
|
ask
|
show
|
jobs
by
fredliu
1002 days ago
I might be wrong, but looks like this could help with speculative decoding which can already vastly improves the inference speed?