Hacker News new | ask | show | jobs
by davedx 652 days ago
> Modern ANN architectures are not actually capable of long-term learning

What do you think training (and fine-tuning) does?

1 comments

That's not how we (today) practically interact with LLMs, though.

No LLM currently adapts to the tasks its given with an iteration cycle shorter than on the order of months (assuming your conversations serve as future training data; otherwise not at all).

No current LLM can digest its "experiences", form hypotheses (at least outside of being queried), run thought experiments, then actual experiments, and then update based on the outcome.

Not because it's fundamentally impossible (it might or might not be), but because we practically haven't built anything even remotely approaching that type of architecture.