Hacker News new | ask | show | jobs
by ianpurton 1144 days ago
> We are currently focused on completing the training process on the entire RedPajama dataset.

So that's 1.2 trillion tokens. Nice.