Hacker News new | ask | show | jobs
by aprilthird2021 351 days ago
> train a model to detect prompt injections (a simple classifier would work) and reject user inputs that trigger the detector above a certain threshold

What are we doing here, guys?