Hacker News new | ask | show | jobs
by nikolayasdf123 606 days ago
+1 1B and 3B models perform so poorly, it is bellow any acceptance for us. and we have fairly simple natural language understanding.