|
|
|
|
|
by rxin
4554 days ago
|
|
Going from Hive/Pig to Spark enables substantial improvement in developers' productivity (for non-reporting/BI workloads). You can properly unit test your program, use a debugger, and have all your code in the same place in the same language (rather than in the case of Pig, write UDFs in Java and then use a pseudo-scripting language for workflow specification). All of these are just productivity gains; not to mention the performance gains you get when you go from MapReduce to Spark. |
|