Hacker News new | ask | show | jobs
by kevinlu1248 150 days ago
Honestly I think we can improve our training throughput drastically via a few more optimizations but we've been spending most of our time on model quality improvements instead.