|
|
|
|
|
by lgbr
1184 days ago
|
|
It's absolutely fantastic that we have so many runtimes, so quickly, to the point where we have an awesome list. However, given that the usefulness of chatbots depends more on the model being used, what I would find a lot more useful is a ranking of the various models that are available. Currently I'm having to rely on comments on the internet to find out if Alpaca 7B or LlaMA 65B is genuinely productive to use. As new models come out, I'd love it if I knew how well it tells jokes, answers complicated questions, or generates code. |
|
Short answer: none of them do as well as the OG Davinci-003. Not even close. Even the 3.5 Turbo models from OpenAI don’t do as well.
We throw some sophisticated prompts at them to attempt chain of thought reasoning.