Hacker News new | ask | show | jobs
by mrjaeger 3590 days ago
There's a really great episode of Linear Digressions[0] (a Data Science podcast) that goes into the different scales of data that exist in the world. Everyone thinks of Google and Facebook as Big Data, but the Australian Square Kilometer Array Pathfinder collects 7.5B TB of data per second!

[0]: https://soundcloud.com/linear-digressions/whats-the-biggest-...

1 comments

And it's going to end up quite a bit more than that.

Check out slides 21 and 22 from [1]. There are parts that will process 4 PB/s (!)

[1] http://www.slideshare.net/SparkSummit/distributed-data-proce...

It's not a secret that in the process of automatic manufacturing there are terabytes of data produced in a matter of minutes. OF course how many of such data is of any use is another story, but anyway it still requires storage and processing power capable of handling that volumes.