Hacker News new | ask | show | jobs
by seba_dos1 313 days ago
How would asking this kind of question without providing the model with access to Wikipedia be a valid benchmark for anything useful?