Hacker News new | ask | show | jobs
by xrd 654 days ago
I wish you could tell the stories of how you eval'ed BERT at Google. Sounds meaty.
1 comments

Retrieval is rarely ever evaluated in isolation. Academics would indirectly evaluate it by how much it improved question answering. The really cool thing at Google is that there were so many products and use cases (beyond the academic QA benchmarks) that would indirectly tell you if retrieval is useful. Much harder to do for smaller companies with a smaller suite of products and user bases.