Hacker News new | ask | show | jobs
by mirekrusin 310 days ago
Exactly, they should at least compare with judges as best models from others, ideally verified by human/ground truth/tests.