|
|
|
|
|
by wooorm
4272 days ago
|
|
You are completely right, franc doesn’t state how language are detected. The detection is based on (1) unicode-script usage and (2) trigram-counts. Some scripts are only used by one language. Other scripts, such as Cyrillic, come with many more: those are detected by the top 300 trigrams per their corresponding UDHR (Universal Declaration of Human rights, the most translated document). Shameless plug, the page clearly stated you can fork franc to support 300+ languages ;) |
|