Hacker News new | ask | show | jobs
by 1ilit 105 days ago
On-device CPU inference is the real flex here. Optimization probably mattered as much as modeling.