Y
Hacker News
new
|
ask
|
show
|
jobs
by
cgorlla
194 days ago
I checked with the team and it may have been some temporary rate-limiting issue. We've rectified the results, it seems to be an isolated case.
https://www.ctgt.ai/benchmarks
2 comments
rancar2
194 days ago
Thanks for the thoroughness! I look forward to the next steps as you all apply this approach in other unique ways to have even better results.
link
SomaticPirate
194 days ago
Are these benchmarks correct that adding Anthropic's Constitutional AI system prompt lowered results across all the models?
link