|
|
|
|
|
by GardenLetter27
509 days ago
|
|
Almost all deployed ML systems work like this. I.e. for classification you can judge "certainty" by the soft-max outputs of the classifier, then in the less certain cases can refuse to classify and send it to humans. And also do random sampling of outputs by humans to verify accuracy over time. It's just that humans are really expensive and slow though, so it can be hard to maintain. But if humans have to review everything anyway (like with the EU's AI act for many applications) then you don't really gain much - even though the humans would likely just do a cursory rubber-stamp review anyway, as anyone who has seen Pull Request reviews can attest to. |
|