Hacker News new | ask | show | jobs
by kovek 1 day ago
I thought it was known since a few years now that if you train models to NOT do certain things, then they start behaving in weird ways…
1 comments

It seems like they run a classifier model before going to Fable (or falling back to Opus), so it should be fine