| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by cgorlla 194 days ago
	I checked with the team and it may have been some temporary rate-limiting issue. We've rectified the results, it seems to be an isolated case. https://www.ctgt.ai/benchmarks

2 comments

Thanks for the thoroughness! I look forward to the next steps as you all apply this approach in other unique ways to have even better results.

Are these benchmarks correct that adding Anthropic's Constitutional AI system prompt lowered results across all the models?