| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hgoel 59 days ago
	Maybe referring to it as welfare is odd, but these points are important. It isn't a good look to have a model that tends to get into self-deprecating loops like one of Google's older models, it's an even worse look and potential legal liability if your model becomes associated with a suicide. An overly negative chat model would also just be unpleasant to use. With the weights being mostly opaque, these kinds of evaluations are an important piece of reducing the harm an AI model can cause.

1 comments

deflator 59 days ago

I feel that anthropomorphizing the model is also potentially very harmful. We've seen that in the LLM interactions that end in tragedy. It's the wording that bothers me.

link