Hacker News new | ask | show | jobs
by moffkalast 1144 days ago
> They specifically note they are training a smaller 3B model In the future.

They're kidding right, there's no way that thing will be more useful than one of those flan models.