|
|
|
|
|
by CuriouslyC
787 days ago
|
|
They didn't compare against the best models because they were trying to do "in class" comparisons, and the 70B model is in the same class as Sonnet (which they do compare against) and GPT3.5 (which is much worse than sonnet). If they're beating sonnet that means they're going to be within stabbing distance of opus and gpt4 for most tasks, with the only major difference probably arising in extremely difficult reasoning benchmarks. Since llama is open source, we're going to see fine tunes and LoRAs though, unlike opus. |
|