|
|
|
|
|
by nullbio
10 days ago
|
|
I know the big labs like to pretend that their models are trillion parameter. But how likely is that really to be the case when Qwen 3.6 35B A3B gets so close to their performance? Seems that with the best research applied, best training data, they'd be able to top the charts with a 60B model quite easily. |
|
Because if they don't imply that size is needed for every task, they'll end up tanking their valuations.
https://blog.nilesh.io/post/ai-profit-race