Hacker News new | ask | show | jobs
by syrrim 2802 days ago
>listen at 2x speed

I'm trying 1.5x, but then when he is speaking it is too fast

>skip ahead

I skipped the introduction, but the standstill periods are too short, and you have no way of knowing when something interesting will happen, so skipping ahead entails missing things. This is where a copy of the slides is particularly useful: you can identify those slides you don't comprehend, or of particular interest, and sit through the speaking just for these portions.

1 comments

Listening to things fast is a skill - you can do it but it requires practice. If you know any blind/vision impaired people who use screen readers you should ask them to use speaker for a bit while they’re browsing/navigating etc.

It is absolutely incredible how fast some of them have the audio going at. (It also makes any pausing or lag in your application excrutiating. It turns out 10ms being “noticeable” is more a factor of how you interact with an application.

They've practiced with their specific text-to-speech program. The more familiar you are with a voice the faster you can understand it. The skill won't completely transfer to different speakers.
For their standard reading speed that true (but it’s still an amazing thing to hear, and I think more devs should communicate with blind people just so they can grasp the sort of latencies involved).

I would expect that the skill does translate to them being able to understand arbitrary speakers sped up more than a regular person could manage. I’d be curious if any HN readers using screen readers could confirm/tell me I’m talking nonsense :)

>I would expect that the skill does translate to them being able to understand arbitrary speakers sped up more than a regular person could manage. I’d be curious if any HN readers using screen readers could confirm/tell me I’m talking nonsense :)

I use a text to speech program at ~850wpm and I also watch lectures sped up to x4 regular speed. So yes, it does generalize.

I think it may also be the case that (old-school, non-concatenated but generated) synthesized speech, with practice, is more intelligible at higher speeds than natural speech. The sounds are more distinct and consistent.
The speech synthesis for screen readers is heavily tailored for that job (I remember the emphasis put on the new voiceover voices on Mac many years ago, and they apparently were noticeably superior to people who used them as screen readers at the high speed settings)