Hacker News new | ask | show | jobs
by jbellis 86 days ago
Benchmarking these now.

Preregistering my predictions:

Mini: better than Haiku but not as good as Flash 3, especially at reasoning=none.

Nano: worse than Flash 3 Lite. Probably better than Qwen 3.5 27b.

1 comments

Please post it here. I'd also like to know if 5.4 mini is better than Flash 3. Include reasoning and timing, if possible.