Hacker News new | ask | show | jobs
by Lalabadie 120 days ago
It's an 8B parameter model from a good while ago, what were your expectations?