|
|
|
|
|
by threeseed
371 days ago
|
|
> bigger models trained on bigger data with bigger reasoning posttraining and better distillation will push the horizons further and further There is no evidence this is the case. We could be in an era of diminishing returns where bigger models do not yield substantial improvements in quality but instead they become faster, cheaper and more resource efficient. |
|