Hacker News new | ask | show | jobs
by nl 3 hours ago
Not yet that I've tried, and I'm pretty systematic about test driving them.

I keep https://sql-benchmark.nicklothian.com/#all-data up-to-date with latest releases and try out most that score 24+.

GPT 5.5+ or Opus 4.6+ are the only things I find useful like this. Notably Gemini isn't useful in this way.