Hacker News new | ask | show | jobs
by CastFX 872 days ago
I was wondering how it performs in a more complex (and realistic) benchmark like Bird?

https://bird-bench.github.io/