|
|
|
|
|
by threeseed
1850 days ago
|
|
Would add that HDFS was a particular nightmare to manage. You had to worry about the size of files since the NameNode would be overloaded. Being a Java app running on the older JVMs it would do a full GC under heavy load and cause failovers. And it was impossible to get data in/out from outside the cluster using third party tools. I remember many companies seeing S3 and just being in shock that it was so cheap, limitless and that someone else was going to manage it all. |
|
I think there are still a couple use-cases where HDFS dominates S3 (I think some HBase workloads?). But yeah, I scaled up and maintained a 2000+ Hadoop cluster for years, and I would never choose it over object storage if given any plausible alternative.