Y
Hacker News
new
|
ask
|
show
|
jobs
by
kyboren
79 days ago
Yes, but bigger models are still more capable. Models shrinking (iso-performance) just means that people will train and use more capable models with a longer context.
1 comments
sipjca
79 days ago
Of course they are! Both are important and will be around and used for different reasons
link