Hacker News new | ask | show | jobs
by briggers 2096 days ago
Nice one.

More useful to me would be something similar that operates on tensors on the GPU.

Doing image annotations on host/CPU often becomes a bottleneck.

2 comments

Out of curiosity, what throughput do you have that would require GPU labelling?

The most resource heavy bit is text rendering I guess, but that could be cached per class-name and reduced to a memcopy. Otherwise drawing rectangles is pretty quick on a CPU to the point where I'd imagine the memory transfer to the GPU is probably comparable to the draw ops?

I've got OpenCV down to around 10ms per image (single thread, python) without the caching idea I mentioned above.

it could be nice to avoid the overhead of moving the gpu box tensors to cpu, potentially.
Interesting idea! I'll have to do some studying as to how to make that happen, so I doubt I'll be adding that option anytime soon but I'll be adding it to the icebox.

Thank you for the comment!