Hacker News new | ask | show | jobs
by is_true 712 days ago
I had to do this with sport teams but I used levenshtein for the names. I ended up creating a vector with other data (country, gender) and using that vector to calculate the distance (similarity).