|
|
|
|
|
by rawnlq
3447 days ago
|
|
If you look at cantonese, an older chinese dialect, you will find 9 tones: https://en.wikipedia.org/wiki/Cantonese_phonology#Tones Particularly useful in that section is this graph: https://en.wikipedia.org/wiki/Cantonese_phonology#/media/Fil... And I think that's a great way to visualize it. Each word has it's own tone contour and it just so happens that a lot of them could be clustered and curve-fitted to match a small number of tones. But the number of curves you choose is arbitrary just like how choosing the number of clusters in k-means is not well defined. So if you treat the pinyin as authoritative you will sound robotic compared to a native speaker who learns each word's actual tone curve naturally. Just like how if you believe english is a phonetic language you will pronounce lots of things wrong. (e.g., as an English learner once I learned how to spell "doubt" I kept trying to enunciate the "b" since I thought the pronunciation I learned from hearing, "dout", was wrong) Disclaimer: I'm a native cantonese speaker so my knowledge about tones is bullshit. Just hypothesizing why text-to-speech and people learning to speak it always sound funny. |
|