Y
Hacker News
new
|
ask
|
show
|
jobs
by
cheviethai123
851 days ago
Consider how low the score of Gemini here compared to the other LLM test. And I'm impressed by the evaluation method's ability to assess performance without relying on tailored prompts.
1 comments
hoamatcuoi
851 days ago
But the benchmark only scoring Gemini-Pro 1, I'm curious how the Gemini Ultra performance here but guessed we couldn't know yet.
link