Hacker News new | ask | show | jobs
by chessgecko 1398 days ago
Maybe it’s to speed up multi gpu matrix multiplies. They’re useful for serving/training gpt3 size models