Hacker News new | ask | show | jobs
by th-ai 3748 days ago
anyone know of an open TTS speech synthesis engine that provides delay (duration) timings per word spoken? thanks
1 comments

Any of them have this information available in some form - openmary, festival. For example in Festival, you can access synthesized utt markup with utt.something functions like this: (utt.save.segs (utt.synth (Utterance Text "Hello world")) "out.seg")