| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by rxin 4554 days ago
	Going from Hive/Pig to Spark enables substantial improvement in developers' productivity (for non-reporting/BI workloads). You can properly unit test your program, use a debugger, and have all your code in the same place in the same language (rather than in the case of Pig, write UDFs in Java and then use a pseudo-scripting language for workflow specification). All of these are just productivity gains; not to mention the performance gains you get when you go from MapReduce to Spark.