Hacker News new | ask | show | jobs
by Mockapapella 1294 days ago
> Create a list of test cases by which you can benchmark yourself against

> Create an architecture for an LLM that passes 99%+ of those test cases

Then use an evolutionary algorithm based on those <1% of cases to create the next batch of tests. Keep a running record of all created tests and make sure the new model can still pass all of them. Add some randomness/branching into those tests and I think you’d have a recipe for an effective AI. I think Deepmind did something like that with AlphaStar and their tournament system.

1 comments

Wouldn't that just result in a hyper efficient AI trained against that list of test cases only?
https://www.deepmind.com/blog/alphastar-mastering-the-real-t...

I haven't looked too deeply into it, but as I understand it you would basically create branching tests (like an evolutionary tree) where the AI would need to solve all of those tests in order to move on to the next level of tests