|
|
|
Show HN: Good AI Task – a tool for asking AI what it can and can't do
(goodaitask.com)
|
|
6 points
by jmt710
51 days ago
|
|
Describe a task, and AI will give you a breakdown of whether it can do your task well, poorly, or somewhere in between. I built it mostly because I kept getting asked "what is AI even good for" and fumbling the answer. The most fun use is testing it on things you already know it can't do and seeing how it explains why it can't be done. |
|
There a model gets a prediction market scenario (that in reality has already closed, but not from the model's POV), and it is tasked to predict the outcome AND give its confidence in the prediction.
Conclusion turns out to be "systematic overconfidence across all models". Probably worth keeping an eye on such research, might enable you to make the product better over time as new research comes out etc.