Hacker News new | ask | show | jobs
by freejazz 1227 days ago
> letting the models acquire knowledge from these copyrighted works

I'm gagging at the nonsensical anthropomorphizing being done to end-run the fact that what the LLMs are doing is copying.

1 comments

You make a lot of condescending or toxic remarks on HN. You might want to consider how that affects your ability to sway others with your comments.

Please chill

> the fact that what the LLMs are doing is copying.

I disagree, the training process creates token representations and weighted connections between them. The models later produce probabilistic token sequences, not so unlike what our meat bodies do, though by very different mechanisms. The fact that certain sequences can be reproduced verbatim is likely a consequence of overfitting. They certainly cannot reproduce all training data verbatim. It would be interesting to know the features around what can and cannot be, and how.

> The models later produce probabilistic token sequences, not so unlike what our meat bodies do, though by very different mechanisms.

Your response to me calling out your baseless anthropomorphizing was to double down on it? It's amusing to me that you don't think you are condescending.

It seems you don't know what anthropomorphizing actually means based on your over application of the word.

ChatGPT can help you with that ;]