Hacker News new | ask | show | jobs
by raverbashing 1094 days ago
Variations of specializations I guess

For writing code you don't care about feeding world history to your model. So a smaller model might be better at a specialized task

Sure, having a big multi-modal-model is great, but by having specialized models you can spread tasks better

1 comments

But I am sure prompt understanding improves with more text data. Same with reasoning ability.