Hacker News new | ask | show | jobs
by jballanc 62 days ago
We need benchmarks that can distinguish between continuous learning and long-context extrapolation.
1 comments

oh that's easy: continuous learning is not something current architectures can do. So the benchmark for that can be done mentally