| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by numba888 492 days ago
	o1 is more than just math solver. And you cannot possibly train that much in a small model. However smaller specialized models looks to be the right way to handle world's complexity. Sort of mixture of experts on one level above. Orchestrating them will be another problem. Possible solution is generalists model "to rule them all".

1 comments

mdp2021 492 days ago

Have you considered the very practical importance of running specialized models for specialized tasks on common hardware (maybe a couple of CPU cores in a couple GB of RAM)?

link

numba888 491 days ago

Small models are just tools. Even many of them will make only a toolset. They don't evolve in AGI by themselves. But putting them together in a structure (brain) may result in something close. Like big smart calculator. It takes more to create a 'character' similar to, say, terminator.

link