Hacker News new | ask | show | jobs
by dangoodmanUT 205 days ago
I was trying to figure out from their examples, but how are you breaking up the different "things" that you can detect in the image? Are you just running it with each prompt individually?
1 comments

The model supports batch inference, so all prompts are sent to the model, and we parse the results.