Hacker News new | ask | show | jobs
by timtom123 557 days ago
I'm excited to test the final model. This could be a major breakthrough for open-source LLMs.
1 comments

This specific model is only trained on 100 billion tokens, so it's not SOTA by any means, but we've got designs on larger training runs later :)