Hacker News new | ask | show | jobs
by xiphias2 77 days ago
Another project without running real benchmarks. It's very easy to generate tokens, it's much harder to solve tasks locally.
1 comments

Here is a reference https://www.sharpai.org/benchmark/ For specific tasks, local model could achieve workable level.