Hacker News new | ask | show | jobs
by smpanaro 856 days ago
Has perplexity fallen out of favor? I didn't see it mentioned anywhere. I tried using lm-eval for the 2B model but the results seem wrong (46.1288).