Good to know. We're trying to attempt something similar[1] but for Tamil. I'm also surprised how well the OSS language model & library AI4Bharat [2] performs for NLP tasks against SoTA systems.
Is there a way to contact you?
[1] https://vpt.ai/posts/about-us/
[2] https://ai4bharat.org/projects/
I don't see how I can be of help.
But I can talk. Leave me something through which I can reach you. And I will reach you within a week.