| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by buster 323 days ago
	must say I am suspicious in this regard, as they don't show applications other than a Sudoku solver and don't discuss downsides.

1 comments

Oras 323 days ago

and the training was only on Sudoku. Which means they need to train a small model for every problem that currently exists.

Back to ML models?

link

JBits 323 days ago

I would assuming that training a LLM would be unfeasible for a small research lab, so isn't tackling small problems like this unavoidable? Given that current LLMs have clear limitations, I can't think of anything better than developing beter architectures on small test cases, then a company can try scaling it later.

link

lispitillo 323 days ago

Not only on Sudoku, there is also maze solving and ARC-AGI.

link