|
|
|
|
|
by wrs
618 days ago
|
|
With a ReLU activation function, rather than a simple linear function of the inputs, you get a piecewise linear approximation of a nonlinear function. ReLU enables this by being nonlinear in a simple way, specifically by outputting zero for negative inputs, so each linear unit can then limit its contribution to a portion of the output curve. (This is a lot easier to see on a whiteboard!) |
|