|
|
|
|
|
by JonathanFly
626 days ago
|
|
Bark can sound as good, but Google is using SoundStorm which was specifically trained on dialogs. Surprisingly Bark can even sort of match it without being trained to do so, but not reliably. (https://x.com/jonathanfly/status/1675987073893904386) And SoundStorm has more than twice the context window of Bark so dialogs are a tight fit. |
|
When I tried my own text with it, it went completely off the rails... skipping completely over random words, and also switching to different voices in the middle of a sentence. Trying to run the large model also crashed entirely.