Y
Hacker News
new
|
ask
|
show
|
jobs
by
tim-tday
100 days ago
How specifically would that work? I’ve seen no framework for that happening.
1 comments
killbot5000
99 days ago
The output of the transformation layers are a collection of embeddings in the latent concept space. Those can be fed into an addition model to extract semantic segments, bounding boxes etc. IIUC this is how dinov3 works.
link