Y
Hacker News
new
|
ask
|
show
|
jobs
by
rafaelmn
489 days ago
You can say R1-604b to disambiguate, just like we have llama 3 8b/70b etc.
1 comments
pythux
489 days ago
These models are not of the same nature either. Their training was done in a different way. A uniform naming (even with explicit number of parameters) would still be misleading.
link