|
|
|
|
|
by blueblimp
841 days ago
|
|
I wonder what's happening to all that money. Back when they originally released Claude, they were second only to OpenAI as far as chatbot models were concerned. Although Claude wasn't as smart as GPT-4, it had a more pleasant writing style, and Anthropic later released 100k context. At the time, I expected Anthropic to be the next company to release a GPT-4-level model. But since then Claude has been passed by Mistral's mistral-medium and Google's Gemini Ultra. More concerningly for Anthropic, each subsequent release of Claude has actually performed _worse_ on the Chatbot Arena Leaderboard. (Claude-1 outranks Claude-2.0, which outranks Claude-2.1.) The reason for the decline in ranking is seemingly that the most noticeable update is to make the model refuse more requests. In an additional blow, the needle-in-a-haystack independent benchmark revealed that Claude's long context is not actually used effectively by the model. All-in-all, Anthropic is not looking in a good spot, despite the massive investment. They need to start releasing legitimately better models, or risk irrelevance. |
|
Part of me really wants to get into the game somehow, as it looks very invigorating and motivating from outside. Although not entirely sure where I would need to start as I'm not at some cutting edge AI/ML company.