Hacker News new | ask | show | jobs
by jahala 14 hours ago
There is an answer- these tools should benchmark by cost per correct answer - not just tokens saved.