|
|
|
|
|
by eevmanu
86 days ago
|
|
made me remember this article <https://adamdrake.com/command-line-tools-can-be-235x-faster-...> Command-line Tools can be 235x Faster than your Hadoop Cluster (2014)
Conclusion: Hopefully this has illustrated some points about using and abusing tools like Hadoop for data processing tasks that can better be accomplished on a single machine with simple shell commands and tools.
|
|
The discussion we had here was involving TB of data, so I'm curious how this is faster with CLIs rather than parallel processing...