Hacker News new | ask | show | jobs
by npilk 9 days ago
They make custom chips with a model's weights and parameters "hard-coded" which allows for much, much faster inference.