Y
Hacker News
new
|
ask
|
show
|
jobs
by
pastage
15 days ago
A quick search say that this is a standard feature you cache the prefill and load it at PCIe bandwidth so it should be about 0.2s