Hacker News new | ask | show | jobs
by whimsicalism 516 days ago
google absolutely games for lmsys benchmarks with markdown styling. r1 is better than google flash thinking, you are putting way too much faith in lmsys