Hacker News new | ask | show | jobs
by xcv123 1135 days ago
No it is not Markov-like. GPT models are not Markov processes by definition. They take into account all previous words in the sequence when generating the next word. They have a type of memory in the form of an attention mechanism that refers to multiple previous states when generating tokens.

They are not human-like and they are not Markov-like. GPT is a separate category.