Hacker News new | ask | show | jobs
by gremlinsinc 841 days ago
according to one of the ai youtubers, the mistral large llm actually scored a perfect score on their logic benchmarks, which is pretty good. All LLM's are prone to some suggestion or confusion. I wouldn't base whether it's logical or not based off an assumption from one response.