|
|
|
|
|
by riffraff
4273 days ago
|
|
the question would be where he got the language data If the original language data is available I'd suggest classifying the trigrams as "high" and "low" frequency, which should improve performance without needing to keep full frequency data. |
|