Hacker News new | ask | show | jobs
by hengistbury 351 days ago
Can you trust the model when the people releasing it are using it in this way? Can you trust that they won't be training models to behave in the way that they are prompting the existing models to behave?