Hacker News new | ask | show | jobs
by varispeed 41 days ago
These things are not going to be reliable if you don't know when your session will be routed to inferior model. I stopped using Opus because of that. I had to always create verification task first (a non trivial problem) for the model to prove itself it is "Opus grade" before giving it actual task, but then I found performance often was suddenly severely degraded (model suddenly being dumb as sack of potatoes). This tells me this is not ready for any serious work.