Hacker News new | ask | show | jobs
by meame2010 496 days ago
We use gpt4o as the backward model. But I’m excited to try deepseek r1 as it has explicit reasoning available.

We are continuously adding more benchmarks to the paper with UTAustin.