Hacker News new | ask | show | jobs
by NLPaep 1015 days ago
Benchmark results in publications show it being confused in chat—like settings and answering questions incorrectly