Y
Hacker News
new
|
ask
|
show
|
jobs
by
Glemllksdf
60 days ago
Isn't that some kind of gambling if you offload random experts onto the CPU?
Or is it only layers but that would affect all Experts?
2 comments
dragonwriter
60 days ago
Pretty sure all partial offload systems I’ve seen work by layers, but there might be something else out there.
link
namibj
59 days ago
Speculative decoding is already gambling.
link