Hacker News new | ask | show | jobs
by WarmWash 111 days ago
You would pass those hypothetical scenarios to doctors too, and then the analyses of results would be done by doctors who don't know if it's an AI or doctor result.
1 comments

From the paper

> Three physicians independently assigned gold-standard triage levels based on cited clinical guidelines and clinical expertise, with high inter-rater agreement

You're misunderstanding. What this paper did-- Those three physicians set a ground truth to compare the AI response to.

What people in this thread are asking for-- Evaluate a set of doctors on those cases as well, and compare doctor vs AI accuracy.