Hacker News new | ask | show | jobs
by hootz 17 days ago
I'd love to see a tokens per second metric. I always prioritize speed over raw intelligence for flash models.
1 comments

> I always prioritize speed over raw intelligence for flash models.

This model might have a perfect speed:

    for i in range(100):
      print(random.choices(words))
Leave it long enough, and it'll print the work of Shakespear!