Hacker News new | ask | show | jobs
by glxc 3806 days ago
13.5 TB - that is pretty huge!

Great to get some truly "Big Data" sets out there. I consider "Big Data" to be data that can't be conventionally processed on a commodity machine, else it's just analytics

Yahoo must be applauded for supplying various data sets and helping progress machine learning research

1 comments

I saw a course advertised in my email yesterday. Big data with MySQL. The description talked about queries and aggregate functions. That isn't big data - that's just "using a database" before the term "big data" appeared in the mainstream.
https://www.coursera.org/learn/analytics-mysql?utm_medium=em...

Ok, they are comparing MySQL to Excel, so in relative terms the data could be bigger...

:(

Ugh.. I had a boss years back who insisted on using "big data" to refer to our analytics and reporting work (which was nowhere near big data in terms of data size - we had maybe a million rows in our database across all our tables), and I fruitlessly tried for months to explain to him that anyone who really knows what "big data" means would immediately see through his bullshit..
I really wish we could get rid of the hipster / buzzword / fashionista aspect of our industry. Way too much churn as a result. I would far rather spend time honing SQL skills to perfection rather than having to learn another NoSQL database. Unfortunately job descriptions prefer the latter.