Hacker News new | ask | show | jobs
by refulgentis 860 days ago
No, it benchmarked around the original release of GPT-4 given 32 attempts versus GPT-4's 5.