|
|
|
|
|
by IIAOPSW
1189 days ago
|
|
>I was saying that we only need to model sentences that are short enough that nobody will notice that the plot is lost with longer ones. Thats one of the things on my short list of unsolved probs. People remember oddly specific and arbitrarily old details. Clearly not a lossless memory, but also not an agnostic token window that starts dropping stuff after n tokens. I think we agree then that a plain superficial model gets you surprisingly far, but does lose the plot. It is certainly enough for things that are definable purely as and within text (the examples I gave). Beyond that who knows. |
|
Yes, I agree with you. I just tend to go on :P