I wish there was an open/local tts model with voice cloning as good as 11l (for non-english languages even)