| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by zozbot234 537 days ago
	The really nice thing about this is that the AI can now acquire these newly-decoded texts as part of its training set, and begin learning at a geometric rate.

4 comments

nitwit005 536 days ago

With our current methods, feeding back even fairly small amounts of outputs back in as training data leads to declining performance.

Just think of it abstractly. The AI will be trained on the errors the previous generation made. As long as it keeps making new errors each generation, they will tend to multiply.

link

red75prime 536 days ago

Degradation of autoregressive models being fed their own unfiltered output is pretty obvious: it's, basically, noise being injected into the ground truth probability distribution.

But. "Our current methods" include reinforcement learning. So long as there's a signal indicating better solutions, performance tends to improve.

link

zeofig 536 days ago

Why not just feed it random data? It's so smart that it will figure out which parts are random, so eventually you will generate some good data randomly, and it will feed on it, and become exponentially smarter exponentially fast.

link

Validark 536 days ago

This is actually hilarious and I'm sad you are getting downvoted for it.

link

mistrial9 537 days ago

errors in => errors out

link

WhereIsTheTruth 536 days ago

Don't forget to spice it up with some bias!

https://x.com/i/grok/share/uMwJwGkl2XVUep0N4ZPV1QUx6

link

rzzzt 536 days ago

But do I want to see ancient programming advice written in Linear B?

link