Hacker News new | ask | show | jobs
by simianwords 85 days ago
Can someone produce a single example <20 characters that fails with latest thinking model? Can’t seem to reproduce.