|
|
|
|
|
by haarts
4749 days ago
|
|
Storing voice (audio) data is not what the article says. I'd imagine you transcribe the audio to text and search in that. Storing text is incredibly easy. Besides you can throw away 99.9% of the data almost immediately. I'm actually curious how much text data this would be per day; number of call minutes * average number of words per minute. I'd be surprised if that wouldn't fit in a reasonable cluster. |
|
Transcribing all voice calls to text in the Netherlands computationally could easily be two orders of magnitude more difficult than Google voice.