|
|
|
|
|
by satisfice
358 days ago
|
|
This reads like a collection of ad hoc advice overfitted to experience that is probably obsolete or will be tomorrow. And we don’t even know if it does fit the author’s experience. I am looking for solid evidence of the efficacy of folk theories about how to make AI perform evaluation. Seems to me a bunch of people are hoping that AI can test AI, and that it can to some degree. But in the end AI cannot be accountable for such testing, and we can never know all the holes in its judgment, nor can we expect that fixing a hole will not tear open other holes. |
|