Hacker News new | ask | show | jobs
by chad1n 1169 days ago
Why would Facebook care about running LLAMA on a cpu with optimizing for 1-2% more latency when it has a lot of A100s laying around?