|
|
|
|
|
by microtonal
4022 days ago
|
|
Mikolov, et al. 2013 [1] do a proper evaluation of this. E.g. they found that the skip-ngram model has a 50.0% accuracy for semantic analogy queries and 55.9% accuracy for syntactic queries. word2vec comes with a data set that you can use to evaluate language models. [1] http://arxiv.org/pdf/1301.3781.pdf |
|
The only semantics that it tests are "can you flip a gendered word to the other gender", which is so embedded in language that it's nearly syntax; and "can you remember factoids from Wikipedia infoboxes", a problem that you could solve exactly using DBPedia. Every single semantic analogy in the dataset is one of those two types.
The syntactic analogies are quite solid, though.