Y
Hacker News
new
|
ask
|
show
|
jobs
by
mezark
268 days ago
We look at how comparative advantage from economics applies to LLM inference - some GPUs are relatively better at FLOPs, others at memory bandwidth. What happens if you let each do what it’s best at?