| what am i missing? you take a spec and create tests, every little thing you use another ai to verify these tests against the spec you review the tests vs the spec (at one point human review) you put the tests off limits to change / wall them. you let the ai write the software that fulfills the tests. there will be some gaps where you repeat the cycle above if the tests fulfill the spec, the code will fulfill the spec |
A spec detailed enough and unambiguous enough to be translated into machine execution deterministically is called code.
Unlike a compiler, AI can build with a spec that is not detailed enough or unambiguous enough: It does so by filling in the gaps with educated guesses.
This is safe if and only if you take the time to later read the output, understand what its guesses were, and judge wether they were acceptable. No AI can do this for you because the truth lies in your original intentions, which it does not have access to.
The jury is out there on how reliable and time consuming this is vs writing the code yourself; it is not immediately obvious that is faster or requires a smaller cognitive load.