Y
Hacker News
new
|
ask
|
show
|
jobs
by
xnx
298 days ago
Have you looked at comparing to Google's foundation models or specialty medical models like MedGemma (
https://developers.google.com/health-ai-developer-foundation...
)?
1 comments
fertrevino
298 days ago
That would be an interesting extension. MedGemma isn't part of the original benchmark either [1]. Since Gemini 2.0 Flash is on 6th place, expectations are for MedGemma to achieve higher than that :)
[1]
https://crfm.stanford.edu/helm/medhelm/latest/#/leaderboard
link
[1]https://crfm.stanford.edu/helm/medhelm/latest/#/leaderboard