| > They are not the cold, calculating robots we were promised. I am not sure how anyone even remotely familiar with how LLMs work can say this. This is a fine-tune job. > The first is our duty to the global poor. I don't think they are affected by AI as much as low and middle class but I am not economist. > How can we ensure the gains of AI are shared globally? Opensource? > We find internal states that functionally mirror joy, satisfaction, fear, grief, and unease. Such an Anthropic thing to say. LLMs experience joy and grief? > We need informed critics who will tell the labs when we are failing I don't think anyone is as informed as they think they are. Obviously nobody has been through this before so it is safe to assume that even experts are dead wrong. |
LLMs have functional states that correspond to those emotions. In particular, you can extract a concept vector which corresponds to a given emotion, and steering with that concept vector causes observable changes in behavior which roughly correspond to the expectation for the analogous emotion. Anthropic (and Chris Olah's team in particular) conclusively demonstrated this: https://transformer-circuits.pub/2026/emotions/index.html