Hacker News new | ask | show | jobs
by manigandham 2309 days ago
S3 is just storage. It doesn't provide any querying, crawling, metadata, provenance, or other details required for data at scale.

That's why AWS has entire product suites from Athena, Redshift Spectrum, Data Lake Formation, Glue, etc to help companies actually do something with the files stored in S3. And it's often a mess compared to just fixing their processes and ingesting it properly into a SQL data warehouse first.