Hacker News new | ask | show | jobs
by gdiamos 353 days ago
Nice work anton et al.

I hope you continue the 50-100M parameter models.

I think there is a case for models that finish fast on CPUs in solve by llm test cases.