Hacker News new | ask | show | jobs
by sdesol 563 days ago
> Different models have different strengths and weaknesses

I would add different errors as well. Here are two examples where GPT-4o and Claude 3.5 Sonnet cannot tell that "GitHub" is spelled like "GitHub".

GPT-4o: https://app.gitsense.com/?doc=6c9bada92&model=GPT-4o&samples...

Claude 3.5 Sonnet: https://app.gitsense.com/?doc=905f4a9af74c25f&model=Claude+3...

I don't think there will be one model that will rule them all, unless there is a breakthrough. If things continue on the same path, I think Amazon, Microsoft and Google will be the last ones standing, since they can provide models from all the major LLM players.