Y
Hacker News
new
|
ask
|
show
|
jobs
by
timtom123
557 days ago
I'm excited to test the final model. This could be a major breakthrough for open-source LLMs.
1 comments
arilotter
557 days ago
This specific model is only trained on 100 billion tokens, so it's not SOTA by any means, but we've got designs on larger training runs later :)
link