Hacker News new | ask | show | jobs
by madsravn 4117 days ago
Very exciting stuff. I love how you can take simple building blocks and create something elegant and fun with them.

However, why are there words more similar to "vacation" than "vacation"?

1 comments

Thanks! The word 'vacation' is just removed from the list since it's exactly what we're looking for.
It's not removed from the list -- it is second from the bottom. madsravn's question is a good one.
The one in the list includes a period after it, so I believe it is just a case of slightly dirty data.
Good observation -- I missed that (obviously). They seem to be using data from the word2vec project, so I would guess that it is intentional rather than a lack of cleaning.
But how come it is less similar to itself than other words?