Hacker News new | ask | show | jobs
by mips_avatar 266 days ago
IMO the race for Latency/TPS/cost is entirely between grok and gemini flash. No model can touch them (especially for image to text related tasks), openai/anthropic seem entirely uninterested in competing for this.
1 comments

grok-4-fast is a phenomenal agentic model, and gemini flash is great for deep research leaf nodes since it's so cheap, you can segment your context a lot more than you would for pro to ensure it surfaces anything that might be valuable.
why use grok? It seems like it's constantly being throttled in order to appear more right-wing
It’s actually not. Most of the time if you ask it about a contentious political issue it will either give you a balanced view or a left-leaning one. Try it and see for yourself.
I just saw elon's tweet saying they'll fix it whenever the response is not rightwing enough