|
|
|
|
|
by jmmcd
207 days ago
|
|
"Pelican on bicycle" is one special case, but the problem (and the interesting point) is that with LLMs, they are always generalising. If a lab focussed specially on pelicans on bicycles, they would as a by-product improve performance on, say, tigers on rollercoasters. This is new and counter-intuitive to most ML/AI people. |
|
Like replacing named concepts with nonsense words in reasoning benchmarks.