Y
Hacker News
new
|
ask
|
show
|
jobs
by
meame2010
496 days ago
We use gpt4o as the backward model. But I’m excited to try deepseek r1 as it has explicit reasoning available.
We are continuously adding more benchmarks to the paper with UTAustin.