> Since DeepSeek-V3 is worse than those US frontier models — let’s say by ~2x on the scaling curve, which I think is quite generous to DeepSeek-V3
He says that it's 2x worse. So if it has the ~same quality [1], it would imply it's 2x larger. Unless I misunderstood what he meant there, of course.
[1]: "DeepSeek produced a model close to the performance of US models" -- in his own words.
> Since DeepSeek-V3 is worse than those US frontier models — let’s say by ~2x on the scaling curve, which I think is quite generous to DeepSeek-V3
He says that it's 2x worse. So if it has the ~same quality [1], it would imply it's 2x larger. Unless I misunderstood what he meant there, of course.
[1]: "DeepSeek produced a model close to the performance of US models" -- in his own words.