Hacker News new | ask | show | jobs
by suchintan 523 days ago
I wonder if more companies should open source their eval model outputs alongside the eval results

We tried doing that here at Skyvern (eval.skyvern.com)