|
|
|
|
|
by yiyingzhang
1 day ago
|
|
"Safety evals are an exception
I believe eval startups can work when they're targeting safety benchmarks specifically. Researchers who want to work on safety evals tend to be ideologically opposed to working on capabilities, which means they don't migrate to post-training or applications due to monetary incentives." This is quite interesting. Seems more relevant in 2026. |
|