Hacker News new | ask | show | jobs
by gistscience 58 days ago
Yeah I can imagine these popular benchmarks get special treatment in the training of new models. I wonder how they would perform for "Elephant riding a car" or "Lion sleeping in a bed"