Hacker News new | ask | show | jobs
by artninja1988 385 days ago
Sounds absolutely amazing, like 99% indistinguishable from real professional voice actors to me. I couldn't find any pricing though. Anyone know what they charge for it?
3 comments

> Public API for Eleven v3 (alpha) is coming soon. For early access, please contact sales.

I suspect they themselves don't know the exact pricing yet and want to assess demand first.

Ouch. Professional Voice Actor here.
As a user of audible, I do follow some authors but I've found better luck following certain voice actors. It's almost like the voice actor is the critic, and by narrating a story they are recommending it to me. Anybody can take a robot voice and apply it to anything, meaning that just because my favorite robot voice "Robot McRobot" read book XYZ doesn't mean I'll enjoy book XYZ. But because your voice is inherently scarce, you are only likely to read books that "work" for you.

I don't know what the process is for matching voice actor to book, but that process is inherently constrained because the voice belongs to a real human, and I enjoy the output of that process.

That said, while Audible is kind of expensive, I'm afraid that they'll reduce their price and move to robot voices and I'll lose interest entirely despite the cheaper price.

Just here to say the oposite. It is astonshing how far away it still is from a professional voice actor while being really good. Emotion is completely missing. Instead it seems to try to hard to express exactly that. I cant really put my finger on it. It feels predictable, flat and the timing is strange.
Better by a mile than most anime voice work, but lacks the detail that a good voice narrator has on an audio book.
Yes, I couldn't bear this for an entire audio book.
Wait until you hear Burt Reynolds and Richard Feynman narrate the Fifty Shades of Gray.
Fifty Shades of Butthead
I think the voices are impressive, yet still uncanny and awkward. I don't want to hear them ever outside of the passing fascination of witnessing technological progress.

Frankly I like the arts strictly because they're expressed by humans. The human at the core of all of it makes it relatable and beautiful. With that removed I can't help wondering why we're doing it. For stimulation? Stimulation without connection? I like to actually know who voice actors are and follow their work. The day machines are doing it, I don't know. I don't think I'll listen.

Is only good if you are doing any type of quick AI slop like TikTok
Not for long. Sorry
Time to license your voice to Elevenlabs and sit back and enjoy the good life!
But it's not an actual person. It's an "AI". Do you want a future where you don't hear actual people anymore? I want to listen to music, audiobooks, poetry, novels, plays, with actual humans talking, that's the whole fucking point.
I feel like you're conflating the act of creation (writing a book) versus the act of performance (narrating the book). For the former I agree with you, but for the latter? Shrug.

Personally I have hundreds of old texts that simply do not have an audio book equivalent and using realistic sounding TTS has been perfectly adequate.

What difference does it make?
Are you seriously even asking that question?

It’s like having a robot that can give you a hand-job and someone saying, “well it’s a robot…” and you saying “what difference does it make?”

You tell me? What difference does it make talking with an old friend versus an ai simulation of an old friend?

What difference does it make seeing the artist who actually painted something talking about why they painted it, versus get sent an image an ai made in stable diffusion?

The difference is we are human and live in a society with other humans and we make connections with them because of their personalities, experiences, life story, emotions etc.

Perhaps you’re ok with staying alone at home with ai friends and ai generated everything but it seems quite strange to me.

>It’s like having a robot that can give you a hand-job and someone saying, “well it’s a robot…” and you saying “what difference does it make?”

What difference does it make? Also, where can I buy one?

I know a man who was pissed off after realizing the personalized-looking emails from the bank was machine generated. What do you think about those?
Are you suggesting that you can compare a formulaic bank email to your mom reading you a bedtime story? I'm not sure you can connect those two things.

Of course, when I go and check my balance at an ATM machine, I don't mind that an actual person isn't reading me the balance. But this isn't an area where we appreciate or want another human being involved.

If you're a "normal", "well adjusted" human being, you appreciate other people, being around them, having friends, lovers, companions, talking to other humans, hearing their actual voices, getting advice and giving advice, hearing someone say "I love you" or "I appreciate you" etc. If you're a "normal", "well adjusted" human being, you will probably feel much less from having an AI voice tell you "I love you".

Of course, if you don't mind never hearing actual human voices again, and prefer just AI talking to you, then sure, go live in your shack and listen to ElevenLabs voices for the rest of your life.

I promise this comment will circle back to Elevenlabs:

When my cat died after a few months of cancer treatment, the staff of the animal hospital sent me a condolence card with comments by staff members.

On the one hand, this was a very touching, very human thing to do. On the other hand, this was presumably a work assignment that had to be passed around and completed for staff members to meet their employer's goals, while juggling the other medical and administrative duties at the animal hospital.

So whether this was a good thing or bad thing might depend on how taxing you view it from the staff member's POV.

With the audio book market: it's kind of a similar dichotomy. There's undoubtedly more human touch in the style an audio book is read by an actual human. (Though if that human touch is "stuttering awkwardly because I'm very self aware as I read, you probably wouldn't want to buy my audio book...)

However, for a human to make an audio book, you are asking someone to sit in a room for many hours, being careful not to stutter as they work through a book. If there's joy in that, maybe you see Elevenlabs as an evil company eliminating the human touch in audiobooks. If it's soulless labor, why not replace it with a machine?

I believe the op's comment was along the lines of "what difference does it make - if you can't tell the difference how can you say it makes a difference?"

To be followed up with the questions of "how will you be able to tell?" and "what are you going to do about it?"