| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by cornhole34 1138 days ago

sorry for the tldr heres the whole rambling Large language models can exhibit "hallucinatory behavior" and generate artificial content that does not correspond to facts. This does not truly anthropomorphize the models by imbue them with consciousness, however. They are generating outputs based on the statistical patterns in their training data, not through any internal experience or self-awareness.

The response to "how much opportunity is still in front of us for adversarial LLM systems that try to detect/control for hallucinations." is by nature infinite or none (as in its futile). As "hallucinations" are whatever the developer deems to be a "hallucination". To hallucinate anthropomorphizes the model to be a human actor and leads "treatment" like a drug to be administered. A physician saying that "oh my patient is hallucinating" they have a mental disorder. This implies that there is a ground truth the developer knows to "not hallucinations". To make a model with such procedures would inherently contain any bias from the development team. Using techniques like Constitutional AI to align models with ethical values, relies on someone making that "ethical value".

Statistical artifacts or general incorrectness in responses are a more accurate to this research. Adopting a "bias mitigation" mindset, viewing bias reduction as an ongoing process of detection and correction, not a one-time fix produces its own errors or inconsistencies is a better solution, as the red tape is out of scope of the model itself. Treat every model as rouge, similar to zero trust of a computer system. If the solution is not also an AI model, then you avoid a sort of Inventors Paradox by dehumanizing people into agents.

Both of these are ideas at the current state of AI is a social dilemma, that people have been warning about for years. The nature of the words we use change our mental model and perception of the tools we create. While history shows it is something in human nature to anthropomorphize items and tools like cars and boats, they do not talk back in a human readable format. If my car started to "hallucinate" I would think I am driving inside a Herby or some other living car. The parallels made between silicon and carbon are similar but profoundly inaccurate to our current understanding, but to go down that path is off topic. As an engineer please do not anthropomorphize your creations it is unhealthy and may lead to superficial relationships. To control "Statistical artifacts" or "hallucinations" is to be the same contextually, and there is always middleware and interface management, but to "hallucinate" changes how one may perceive the ai's functions. Please do not anthropomorphize LLM.