Hacker News new | ask | show | jobs
by yarg 894 days ago
Probably not the best in terms of efficiency.

Easier just to deliberately overshoot (with a too high k) and then merge any clusters with too much overlap.