Y
Hacker News
new
|
ask
|
show
|
jobs
by
quantadev
630 days ago
They simply ask the AI a question about a large document (or set of docs). It either gets the answer right or wrong. They count the number of hits and misses.