|
|
|
|
|
by bheadmaster
261 days ago
|
|
I don't disagree with the conclusion, I disagree with the reasoning. There's no reason to assume that models trained to predict a plausible next sequence of tokens wouldn't eventually develop "understanding" if it was the most efficient way to predict them. |
|