|
|
|
|
|
by oersted
793 days ago
|
|
It's easy to miss: select English in the dropdown.
The scores are quite different in Overall and in English for LMSYS. As I've stated in other comments, yeah... Agreed, I'm stretching it a bit. It's just that any indication of a 3.8B model being in the vicinity of GPT-4 is huge. I'm sure that when things are properly measured by third-parties it will show a more sober picture. But still, with good fine-tunes, we'll probably get close. It's a very significant demonstration of what could be possible soon. |
|
Secondly, Llama 3 usually adds first sentences like ‘What a unique question!’ or ‘What an insightful thought’, which might make people like it more than the competition because of the pandering.
While Llama 3 is singular in terms of size to quality ratio, calling the 8B model close to GPT4 would be an overstretch.