Y
Hacker News
new
|
ask
|
show
|
jobs
by
rasbt
846 days ago
Yes, it's definitely unfair to count it as a 7B model. In that case, we could call Llama 2, which is 6.6B parameters, a 6B (or even 5B) parameter model.
1 comments
neodymiumphish
838 days ago
Except 6.6 rounds to 7. That’s completely reasonable. Arguing otherwise is pedantic.
link