| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dpoloncsak 151 days ago
	Yeah, one of my first projects one of my buddies asked "Why aren't you using [ChatGPT 4.0] nano? It's 99% the effectiveness with 10% the price." I've been using the smaller models ever since. Nano/mini, flash, etc.

3 comments

sixtyj 151 days ago

Yup.

I have found out recently that Grok-4.1-fast has similar pricing (in cents) but 10x larger context window (2M tokens instead of ~128-200k of gpt-4-1-nano). And ~4% hallucination, lowest in blind tests in LLM arena.

link

verdverm 151 days ago

You use stuff from xAi and Elmo?

I'm unwilling to look past Musk's politics, immorality, and manipulation on a global scale

link

rudhdb773b 151 days ago

Grok is the best general purpose LLM in my experience. Only Gemini is comparable. It would be silly to ignore it, and xAI is less evil than Google these days.

link

naught0 149 days ago

When's the last time Sundar Pichai did a Hitler salute or had his creation calling itself "Mecha Hitler"?

link

rudhdb773b 149 days ago

In the big picture, those events are insignificant compared to the negative impacts on society from Google's trillion dollar advertising business and the associated destruction of privacy.

link

naught0 148 days ago

fair points, but we'll have to see now that grok is in the pentagon. sky's the limit

link

phainopepla2 151 days ago

I have been benchmarking many of my use cases, and the GPT Nano models have fallen completely flat one every single except for very short summaries. I would call them 25% effectiveness at best.

link

verdverm 151 days ago

Flash is not a small model, it's still over 1T parameters. It's a hyper MoE aiui

I have yet to go back to small models, waiting for the upstream feature / GPU provider has been seeing capacity issues, so I am sticking with the gemini family for now

link

walthamstow 151 days ago

Flash Lite 2.5 is an unbelievably good model for the price

link