| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by int_19h 83 days ago
	It goes both ways though. All that extra stuff is also a part of our "training set" when growing up. And we have already seen that training models on vision etc improves their text outputs as well, even in tasks that aren't directly connected to visual things. That might account for a lot of our advantages. But yes, of course it's not just a scale issue. Note though that a "finished model" can still be fine-tuned, and you can in fact allow it to fine-tune itself even. It's just that this is prohibitively expensive in practice (once again, the hardware is lagging behind the wetware here).