Hacker News new | ask | show | jobs
by blandcoffee 1167 days ago
Can you expand further on what you mean by this?
1 comments

The SAM model is small (4m params) , but requires image embedding to be computed from what is I think a 600m params model. Right now the demo uploads the image to get the embeddings, then runs the actual segmentation locally.