|
|
|
|
|
by fullstackchris
1226 days ago
|
|
It's really just math though. Any "LLM" isn't really "thinking" in any spoken or written language, but rather in a massive series of weighted matrices (numbers). I've commented a few times here and there about this AI hype, but might as well repeat myself: I think people largely misunderstand the technology and I see major missing aspects that are non-trivial to solve before we really get to anything looking like iRobot (or insert here any other scifi of your choice). These input / output models can only go so far, even if they are ever increasing in size. We don't just need 2 or 3 prompt memory, but full dynamic memory that the model can access throughout it's lifetime as well as the ability for the model to reflect and introspect on itself (much like human thought and communication). Without these things, an LLM will just remain an LLM, albiet larger and larger. Unfortunately I don't think size for sizes sake will bring much more improvement to such models. Aside from any of the aforementioned breakthroughs being incorporated, I see this type of chat GPT stuff plateauing in ~1-2 years. |
|