Hacker News new | ask | show | jobs
by llama_person 768 days ago
https://huggingface.co/spaces/google/paligemma/blob/main/pal...

the blog post details it but essentially to convert from PaliGemma tokens to bbox:

y0 / 1024 * h

x0 / 1024 * w

y1 / 1024 * h

x1 / 1024 * w

have not played with segmentation yet.