Hacker News new | ask | show | jobs
by haddr 3590 days ago
This is one of those few moments when saying Big Data is actually a well-used term.
1 comments

There's a really great episode of Linear Digressions[0] (a Data Science podcast) that goes into the different scales of data that exist in the world. Everyone thinks of Google and Facebook as Big Data, but the Australian Square Kilometer Array Pathfinder collects 7.5B TB of data per second!

[0]: https://soundcloud.com/linear-digressions/whats-the-biggest-...

And it's going to end up quite a bit more than that.

Check out slides 21 and 22 from [1]. There are parts that will process 4 PB/s (!)

[1] http://www.slideshare.net/SparkSummit/distributed-data-proce...

It's not a secret that in the process of automatic manufacturing there are terabytes of data produced in a matter of minutes. OF course how many of such data is of any use is another story, but anyway it still requires storage and processing power capable of handling that volumes.