|
|
|
|
|
by n0id34
510 days ago
|
|
Is AI fizzing out or just me? I feel like they're trying to smash out new models as fast as they can but in reality they're barely any different, it's turning into the smartphone market. New iPhone with a slightly better camera and slightly differently bevelled edges, get it NOW! But doesn't actually do anything better than the iPhone 6. Claude, GPT 4 onwards, and DeepSeek all feel the same to me. Okay to a point, then kinda useless. More like a more convenient specialised Google that you need to double check the results of. |
|
Compare LLMs from a year or two ago with the ones out today on practically any task. It's night and day difference.
This is specially so when you start taking into account these "reasoning" models. It's mind blowing how much better they are than "non-reasoning" models for tasks like planning and coding.
https://aider.chat/docs/leaderboards/#aider-polyglot-benchma...