| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by acchow 814 days ago
	Agreed. Those are different evaluations (is what I meant by "Instead of comparing against"). The paper cannot conclude that "doctors are better/more correct" It assumes that "here are 5 doctors which are always correct". Then measures GPT's correctness against them.