|
|
|
|
|
by curioussquirrel
54 days ago
|
|
There are architectural changes (such as reasoning or mixture of experts) that measurably improve how well models perform. So the improvements are definitely not just from data. I can speak for my area of expertise - multilingual capabilities. Some SOTA models are making huge strides in their support of various languages, and increasingly they understand and can produce text in languages where GPT-4 era models were absolutely lost. These are probably from a combination of richer training dataset and architectural improvements (more parameters?). I posted about this here if you're interested: https://news.ycombinator.com/item?id=47847282 Now that doesn't necessarily mean that models are also getting substantially better at English or other major languages. They likely are to some degree, but we've reached a point with major languages where core linguistic proficiencies are covered, and what's left is the more squishy part: style, tone of voice, ability to use different registers naturally, or what some people would call linguistic taste. But that's much harder to measure and therefore trickier to provide evidence for. Hope this helps. Edit: typo, clarification |
|
MoE: I assume some people just specialize in working with routing as with that, as by reducing the amount of params and just using a subset, you end up making it less costly. So, AI researchers are only working on optimizations on getting this better?
Same question on Reasoning, so AI researchers are working mostly on optimizations on top of it, like CoT and so on, like mini-optimizations.
So basically, they work on those micro-optimizations, put them together and see a % improvement in a benchmark?
I'm sure this is probably awesome for languages, which if I'm not mistaken, it was the use-case initially used on "All you need is attention" and the entire LLM revolution.
But this seems to be a very clear path to be "taking the car to the carwash by foot" for a long time, isn't it?
It feels like we'll keep "taking the car to the carwash by foot" until somebody optimizes for that prompt, or some pre-training done, and then there'll be another prompt that will show that the AI has real trouble with very basic real-world reasoning and imagination.
Isn't it the case, or do you see any kind of research that could take us from that plateau full of micro-optimizations that get us a few cm higher to the peak?