|
|
|
|
|
by zozbot234
94 days ago
|
|
> maybe even learned prefetching for what the next experts will be Experts are predicted by layer and the individual layer reads are quite small, so this is not really feasible. There's just not enough information to guide a prefetch. |
|