Hacker News new | ask | show | jobs
by mrcactu5 3501 days ago
massive is an understatement. I have only dealt with puny GB sized data sets. They deal with vectors which cannot fit into main memory.
1 comments

Yes, in general what they refer to are things like the IRS Tax records (250 TB), Yahoo Ad data (900 TB). You just can't use a single machine to work with such data.