Hacker News new | ask | show | jobs
by stan_btd 2129 days ago
The paper that presents this algorithm has the best known accuracy on the widerface dataset, which is why it is called state of the art. The authors of the paper published an implementation of this algorithm based on mxnet, but a lot people and companies use tensorflow instead of mxnet in their work, so just using the mxnet implemenation is not an option. Thats why I converted it to TF, with a slight decrease in accuracy on widerface. Then what is "acceptable" depends on your judgement, but the widerface dataset is extremely challenging, with many pictures having hundreds of small faces. In some of these pictures my implementation will miss a few faces, or find them but with lower probability. Overall for the vast majority of face detection applications, the two implementations will yield extremely similar results. I havent yet found a picture with a few dozen faces where the TF implementation performed not as well as the original one !
1 comments

Got it. I understand now. Don't be offended at my use of "acceptable", it was used in ignorance.
no offense taken at all !
Any thoughts on distilling it or pruning to make it smaller ?
would you be interested in a smaller model file size or in a model that runs faster ?

For model file size I have a few ideas of things that could be tried easily.

For a faster model, this model is based on resnet50 architecture. The original authors also published a lighter mobileNet architecture. I havent converted it to tensorflow but I could try to take a look