|
|
|
|
|
by tsumnia
5105 days ago
|
|
How does this compare to Microsoft's Old HTK (HMM Toolkit)? The language used on the website seems to point to a lot of the same things. Is this breaking it down to actual IPA phonemes? I'm mostly curious because I used the HTK for my thesis and would like to know how they compare (besides, one being just 'newer'). |
|
The main difference here is hooking DNN output to an HMM decoder, replacing GMMs, and possibly even more important the training process they use to get the DNN fairly efficiently. That's the biggest thing -- GMMs, at least the last time I've looked, can be trained and adapted much quicker than a DNN.