|
|
|
|
|
by dkdcio
177 days ago
|
|
…except we know what every neuron in a neural network is doing. I ask again, what criteria do we need to meet for you to claim we know how LLMs work? we know the equations, we know the numbers going through a network, we know the universal approximation theorem —- what’re you looking for exactly? I’ve answered the “what have they learnt” bit; a function that predicts the next token based on data. what more do you need? |
|
edit: We know the data that their function outputs, it's a "blurry jpeg of the internet" because that's what they're trained on. But we do not know what the function is, and being able to blurrily compress the internet into a tb or whatever is utterly beyond any other compression algorithm known to man.