|
|
|
|
|
by tempnow987
1523 days ago
|
|
Is this true? I thought dropbox moved their 30PB+ data lake ONTO aws to get off of Hadoop or something because trying to do this on-prem, even with tons of tech talent and money, was not working. They complained about onprem requiring 3 YEAR forecasts for capacity planning given their scale. Here is what they said in 2020 for benefits of AWS: --------------------- Hosts 40 PB of analytics data and supports 1 PB of data growth a month
Optimizes costs by moving cold data to Amazon S3 Glacier Deep Archive
Uses Amazon EC2 Spot Instances for 15–50% of compute capacity
Doubles compute footprint using Amazon EC2 Spot Instances
Enables the testing of new technologies without damaging data or affecting users
Improved performance by six times for some job types
Deletes hundreds of files in a few seconds compared to 30–40 minutes
Runs more than 100,000 analytics jobs and tens of thousands of one-time jobs daily --- https://www.youtube.com/watch?v=6x-XGJQwk2M Maybe this has changed since 2020 |
|