Y
Hacker News
new
|
ask
|
show
|
jobs
by
jfaganel99
1 day ago
For the sceptics... The benchmark is research based with a published ArXiv paper on the methodology
https://arxiv.org/abs/2604.13764