Hacker News new | ask | show | jobs
by oteytaud 2744 days ago
Sure GA can be great for weights as well - but mainly when gradient is unreliable. I would not use Nevergrad for training the weights of a convolutional network for image classification for example; whereas I use Nevergrad for WorldModels.
1 comments

Doesn't the model Uber used begin with a bunch of convolutional layer sets, since it processes raw images?