Y
Hacker News
new
|
ask
|
show
|
jobs
by
NLPaep
1015 days ago
Benchmark results in publications show it being confused in chat—like settings and answering questions incorrectly