Hacker News new | ask | show | jobs
by quantadev 630 days ago
They simply ask the AI a question about a large document (or set of docs). It either gets the answer right or wrong. They count the number of hits and misses.