Hacker News new | ask | show | jobs
by dbreunig 39 days ago
Among benchmarkers its a frequent topic. Qwen BURNS reasoning to get its scores.