Show HN: A/B test Gemini, ChatGPT and Claude

Y	Hacker News new \| ask \| show \| jobs

	Show HN: A/B test Gemini, ChatGPT and Claude (news.ycombinator.com)
	2 points by eclair99 792 days ago
	I built this app for macOS to A/B test LLMs side-by-side. Also, I feel that having your prompt answered by two LLMs is a quick and easy way to confirm you are not falling for hallucinated information.

1 comments

Any insights into which is better from the using the app?

Thanks for checking out the tool!

I personally feel Claude3 outperforms Gemini and ChatGPT. But lack of web-browsing is a slight dis-adv of claude.