Hacker News new | ask | show | jobs
by redskyluan 1044 days ago
The inclusion of the OpenAI dataset in this benchmark adds a layer of realism that's often missing in standardized tests with datasets like SIFT and DEEP