Y
Hacker News
new
|
ask
|
show
|
jobs
by
chad1n
1169 days ago
Why would Facebook care about running LLAMA on a cpu with optimizing for 1-2% more latency when it has a lot of A100s laying around?