Hacker News new | ask | show | jobs
by vitally3643 6 hours ago
As per usual, the current Claude model's performance took a sharp nosedive the moment the new model was announced. Compared to the now-handicapped Sonnet model, Fable seems pretty smart I guess.

But it also really, really wants to burn tokens. I asked it to look into a fairly straightforward database bug in my RN app, and while I was off getting coffee it decided to spin up an android emulator unprompted and started navigating the app by reading screenshots and injecting touch events. There went my entire week's tokens. There was no reason to even start the emulator, the bug wasn't graphical, so I have no clue what it was doing.