Hacker News new | ask | show | jobs
by frakt0x90 95 days ago
They created this in service of their video generation model which "clusters and reorders tokens based on semantic similarity using k-means.":

http://arxiv.org/pdf/2505.18875

1 comments