Hacker News new | ask | show | jobs
by AIsore 769 days ago
These experiments seem pretty large already though, no? How are you so sure they messed up benchmarking? Is the code out already?