|
|
|
|
|
by bsg75
3364 days ago
|
|
> It seems to now be more of a pointer referencing a basket of distributed processing technologies that run on YARN/HDFS You reference the MapR distribution for their training material, and its interesting that their version of HDFS is a reimplentation in C++ (MapR-FS). Its part of the reason I settled on MapR to use tools like Apache Drill, because the filesystem becomes usable to non-Hadoop tools via NFS (i.e. Awk). Given a shift in some categories away from map-reduce to other approaches, could Hadoop eventually just become a collection of distributed filesystems and job schedulers? |
|