Hacker News new | ask | show | jobs
by hfhdjdks 981 days ago
This comment isn't saying anything. It says "LLMs statistically choose the next token" and, because of that, it can't be anything else.

We know how the LLMs were trained, so re-stating it doesn't help in anything. The point is that after the LLMs are trained they behave in certain ways and it can be helpful to say something about how it behaves.

For example, we can talk about how a linear regression can or cannot capture the causal effect of X1 over Y. "It's not capturing the causal effect. It's just minimizing the squared error" is an unhelpful statement.