|
|
|
|
|
by e-dt
1694 days ago
|
|
Syllables should always have only one vowel (technically nucleus) in them, which means that the bogus "syllables" the website is producing can't be real syllables - that might be the rule that you can't articulate. However, the website actually does not use GPT-2 to produce the syllables, it uses the library "pyhyphen". This library provides (misleadingly) a function claiming to create a list of syllables from a word, which does not actually do that and instead splits the word up by all possible hyphenation points - and not every syllable break gets or indeed should get a hyphenation point. (For example, you do not want to break up prefixes or suffixes during hyphenation.) [0] https://github.com/turtlesoupy/this-word-does-not-exist/blob... [1] https://github.com/dr-leo/PyHyphen/blob/master/src/hyphen/hy... |
|