|
|
|
|
|
by johndhi
90 days ago
|
|
The article posits that sycophancy is inherent to how models are trained. I think there's a simpler explanation. Every leaked system prompt from every model pretty much includes instructions to "be helpful," and the models are trained to be assistants, not just general knowledge repositories or research tools. My hunch is that's the core of the problem -- the system prompt. |
|