|
|
|
|
|
by WarmWash
34 days ago
|
|
Opus is estimated to be around 4T parameters, and 5.5 around 9T. [1] And while 3.5 at least qualifies to be in the same neighborhood, which is stunning if these numbers are all true, it may be that closing that last ~10% difference needs 50x more parameters. [1]https://arxiv.org/pdf/2604.24827 |
|
https://x.com/justanotherlaw/status/2050399317782155726 https://www.lesswrong.com/posts/veFMEzDDyWaer2Sms/sanity-che...