| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by krisroadruck 3146 days ago
	Assuming its sending it as audio, and not as transcribed text which is both smaller and also much more compressible.

2 comments

Analog24 3146 days ago

ASR is a hugely complex process that is handled by ML algorithms on Amazon's servers. The echo simply does not have the hardware to handle this on it's own.

link

krisroadruck 3146 days ago

Is it though? Not trying to be argumentative but I remember using dragon naturally speaking to do voice dictation way back in like 98 on a processor that makes today's average smartphone look like a supercomputer. I thought all the ML stuff was for figuring out context and the like, but straight transcription?

link

nerpderp83 3146 days ago

Modern voice codecs are extremely compact. An annotated text representation of voice will take up equivalent space.

link