Hacker News new | ask | show | jobs
Veena – open-source TTS for Indian Languages (huggingface.co)
2 points by Dheemanthreddy 356 days ago
1 comments

Veena is a 3B parameter autoregressive transformer model based on the Llama architecture. It is designed to synthesize high-quality speech from text in Hindi and English, including code-mixed scenarios. The model outputs audio at a 24kHz sampling rate using the SNAC neural codec.