|
|
|
|
|
by ipsi
893 days ago
|
|
FWIW, English Wikitionary (appears to!) have fewer words than German Wiktionary. I've run into this trying to extract words from eBooks (then converting to the "base" form, to essentially de-duplicate). I think it's mostly compound or more niche words, but I imagine you'd still run into them at least occasionally with most written works. There's a nice project for converting and extracting the data from English Wiktionary into JSON but it doesn't support any other languages, AFAIK, which is a bit of a shame but also not very surprising - Wiktionary is a lot more complex, technically, than I expected! |
|
- the English Wiktionary has fewer English words than the German Wiktionary has German words, or
- the English Wiktionary has fewer German words than the German Wiktionary does?