Hacker News new | ask | show | jobs
by ajtulloch 4254 days ago
Caffe (and other frameworks) provide exactly this. It's basically:

1) To setup, load a pre-trained AlexNet/Overfeat/other architecture model (e.g. trained on ILSVRC2012)

2) To get a vector from an image, run a forward pass on the images, and extract the activations at a given layer (e.g. fc7) as the output vector.

http://caffe.berkeleyvision.org/gathered/examples/feature_ex... is a step-by-step walkthrough.

There's a lot of mystique around deep learning and these kind of problems, but it's not _that_ difficult to use these tools.