| No. That is NOT what it does at all. Here's a quick explainable in this thread for what it does: https://news.ycombinator.com/item?id=31489463 If you gave 100 well read, English speaking humans the following (very commonly found) sentence and asked them to predict the next character what would they do? "four score and s" Most would predict "e", then "v" etc until you get "seven". That's what a language model does. So if you give it (or humans!) a prompt that is less common or has more variation: "that is no" and ask for the most likely next character you will get a lot of variation from both humans and machines. The heat parameter in language models tune how random it will be. Both will produce English, because English words are more common than gibberish, but which works will be produced is randomish. In neither case is it doing something that one would really characterise as "reposting partially disguised sentences combining the writtings of several human authors". It is creating sentences that have never been seen before. |
Now, there is the philosophical question of whether human creators simply do the same. (Which the don't; we have other mental processes for creating ideas than predicting the next letter we are going to utter next). But that doesn't change the fact that the likeliness of each emitted word is determined by what the model has seen more often in relation to the current context and therefore it considers most "valid".