Hacker News new | ask | show | jobs
by moojacob 425 days ago
I messed up, I apologize.

I looked at the NDCG and thought that was the dataset.since voyage and cohere both used NDCG. I now realize it was separate benchmarks with the same evaluation metric.