Hacker News new | ask | show | jobs
by nemothekid 1446 days ago
Arabic is the 5th or 6th most spoken language. I think the concern for low resource languages is that nuances like that won't get picked up.
1 comments

That's fair, I was mostly just responding to the parent comment's point about language models running into potential difficulties in languages where the men and women speak differently (though I don't speak Dakota, so the gender-specific differences there may be more pronounced than in Arabic, where there's also the "default"/neutral option of just picking the masculine version of verbs unless you know the subject(s) are female).