|
|
|
|
|
by peteforde
643 days ago
|
|
Really cool to see my favs show up, but I honestly don't understand what we're actually looking at; the groupings seem very opaque beyond very general themes like sci-fi, startups, biographies, math, physics. In other words, what are the clustering shapes telling us? Can we dig in based on geography, publishing date, key terms or themes? Either way, I can't keep the site open for more than 30-40 seconds before it crashes. I suspect that's not the goal! Is Cryptonomicon the best fiction book, or is the data wrong? |
|
IMHO it's a category error that results from tutorials using the king + female = queen example (which, funnily enough, wasn't even true for the original word2vec, if commentary I've read previously here is correct).
Working with them a lot has me picture them more as "a multivariate function that outputs 768 numbers, and was learned by brute force" than "something that sees in 768 dimensions" --- of course, they're both true, but the second interpretation shades more than it illuminates once you're past the very first interrogatory of "so what is this calculating, exactly?"