| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by eyeswideopen 699 days ago
	As written here: https://huggingface.co/nvidia/Mistral-NeMo-12B-Instruct "It significantly outperforms existing models smaller or similar in size." is a statement that goes in that direction and would allow the comparison of a 1.7T param model with a 7b one