|
|
|
|
|
by karpathy
3725 days ago
|
|
this is very nice! I think that the reason swiss roll doesn't work as easily might be because of initialization. In 2 dimensions you have to be very careful with initializing the weights or biases because small networks get more easily stuck in bad local minima. |
|
But that technique would not work when you cannot see that it is a "swiss roll" or in multiple dimensions.