Hacker News new | ask | show | jobs
by zacksiri 96 days ago
I tested this model in an agentic workflow, it failed at some very basic tasks:

https://upmaru.com/llm-tests/simple-tama-agentic-workflow-q1...