|
|
|
|
|
by acchow
814 days ago
|
|
Agreed. Those are different evaluations (is what I meant by "Instead of comparing against"). The paper cannot conclude that "doctors are better/more correct" It assumes that "here are 5 doctors which are always correct". Then measures GPT's correctness against them. |
|