Y
Hacker News
new
|
ask
|
show
|
jobs
by
vitaflo
185 days ago
I also have my own tricky benchmark that up til now only Deepseek has been able to answer. Gemini 3 Pro was the second. Every other LLM fail horribly. This is the main reason I started looking at G3pro more seriously.