| Those terms sound similar to biological concepts but they’re very different. Neural networks are not like brains. They don’t grow new neurons. A “neuron” in an artificial neural net is represented with a single floating point number. Sometimes even quantized down to a 4 bit int. Their degrees of freedom are highly limited compared to a brain. Most importantly, the brain does not do back propagation like an ANN does. LSTMs have about as much to do with brain memory as RAM does. Attention is a specific mathematical operation applied to matrices. Activation functions are interesting because originally they were more biologically inspired and people used sigmoid. Now people tend to use simpler ones like ReLU or its leaky cousin. Turns out what’s important is creating nonlinearities. Hallucinations in LLMs have to do with the fact that they’re statistical models not grounded in reality. Evolutionary algorithms, I will give you that one although they’re way less common than backprop. |
> the brain does not do back propagation
Do we know this? Ruling this out is tantamount to claiming that we know how brains do learn. My suspicion is that we don't currently know, and that it will turn out that, e.g., sleep does something that is a coarse approximation of backprop.