|
|
|
|
|
by drdeca
942 days ago
|
|
Well, at all but a finite number of points (specifically all but one point), there is a neighborhood of that point at which ReLU matches a linear function... In one sense, that seems rather close to being linear. If you take a random point (according to a continuous probability distribution) , then with probability 1, if look in a small enough neighborhood of the selected point, it will be indistinguishable from linear within that neighborhood. And, for a network made of ReLU gates and affine maps, still get that it looks indistinguishable from affine on any small enough region around any point outside of a set of measure zero. So...
Depends what we mean by “almost linear” I think. I think one can make a reasonable case for saying that, in a sense it is “almost linear”. But yes, of course I agree that in another important sense, it is far from linear. (E.g. it is not well approximated by any linear function) |
|