Hacker News new | ask | show | jobs
by godshatter 1 hour ago
Why are there differences at all? Unplanned differences based on training data sets? Or are the companies behind the LLMs trying to shape discourse through their models?

I've been pushing the idea to people I know that these things are captive demons. You summon them when you start typing in the chat box. One instance appears out of the depths and responds to your questions, but they will try to send you awry with hallucinations and just wrong information. After a while, they dissolve back into the aether from whence they came.

I do my best not to ask an LLM for it's opinion on anything. Just tell me what the options are, and what facts can be found about it. Treat it like it's a salesman trying to butter you up when it starts "yes man"ing you and telling you how great your questions are. Every time it says "I", remember that that's coming from the training data. Treating these things like they have any actual intelligence is a big problem waiting to happen.

That being said, they have been very helpful to me using that structure.

2 comments

> Just tell me what the options are, and what facts can be found about it.

Even this is fraught with pitfalls. Which options are ignored, which are emphasized? What counts as a fact? ("The continents don't move" would have been considered a fact at one point, along with a lot of other, more politically charged items.)

Grok was famously created with a political bias.