Hacker News new | ask | show | jobs
by toxik 1173 days ago
The part where they use Gpt4 to decide whose answer is better really highlights how bad judgement you get from GPT-4.