Hacker News new | ask | show | jobs
by lifestil 3590 days ago
LHC data = EPIC
1 comments

This is one of those few moments when saying Big Data is actually a well-used term.
There's a really great episode of Linear Digressions[0] (a Data Science podcast) that goes into the different scales of data that exist in the world. Everyone thinks of Google and Facebook as Big Data, but the Australian Square Kilometer Array Pathfinder collects 7.5B TB of data per second!

[0]: https://soundcloud.com/linear-digressions/whats-the-biggest-...

And it's going to end up quite a bit more than that.

Check out slides 21 and 22 from [1]. There are parts that will process 4 PB/s (!)

[1] http://www.slideshare.net/SparkSummit/distributed-data-proce...

It's not a secret that in the process of automatic manufacturing there are terabytes of data produced in a matter of minutes. OF course how many of such data is of any use is another story, but anyway it still requires storage and processing power capable of handling that volumes.