|
|
|
|
|
by GolDDranks
912 days ago
|
|
I'm attempting to create a frequency list of words for language learners. (In Japanese.) Commonly, these lists are based in just what word appears in the text at "surface" level. However, words commonly have multiple "senses" or nuances of meaning in which they are used. Dictionaries list these senses, but it has been traditionally hard to disambiguate which sense the word is used in, given an usage in text. LLM's make this feasible, so I'm attempting to create a word sense/usage frequency list. |
|
https://fasttext.cc/docs/en/crawl-vectors.html
https://news.ycombinator.com/item?id=13771292 (6 years ago)
Aligning the fastText vectors of 78 languages
https://github.com/babylonhealth/fastText_multilingual/blob/...