Y
Hacker News
new
|
ask
|
show
|
jobs
by
pigscantfly
1303 days ago
No, an autoregressive language model is conditioned on all prior states, not the previous one.
1 comments
blueflow
1303 days ago
Multiply out the states, "all prior states" is then the "previous one". Easy to model as Markov chain.
link
The_Amp_Walrus
1303 days ago
Also 'easy' to model as a lookup table containing all possible solutions.
link
adgjlsfhk1
1303 days ago
this is technically true but the Markov chain would be too big to store even with petabytes of storage.
link
tgv
1303 days ago
Indeed. The argument boils down to: since it's finite, I can turn it into a FSA. Not only is that unhelpful, it doesn't tell you how to construct it, i.e. the learning process.
link