| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by babhishek21 17 days ago

Looking at the results, I have some thoughts:

1. I understand the need to have the ceiling model be at a big enough factor to make for a good headline. But is it really fair to compare between two different family of models (phi3:mini vs mixtral:8x7b)?

2. The corpora is really small. Are the results here statistically significant?