| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dur-randir 772 days ago
	Based on their own numbers, 8B seems decent, but 34B not worth it compared to general-purpose trained models even on specific tasks. Which is an interesting result.