|
|
|
|
|
by briggers
2481 days ago
|
|
I see from the MTCNN code that this repo (like all others I've seen) is still bouncing tensors between GPU and CPU while passing between the P/R/ONets. So many ML repos make this mistake in pre/post-processing and end up bottlenecked on CPU. Anyone know of an MTCNN that's been ported to run more or less fully on GPU? (Or even that does batching instead of an image-by-image approach?) |
|
Example in rust: https://cetra3.github.io/blog/face-detection-with-tensorflow...