| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jakosz 2956 days ago
	You can get very good improvements over Spark too. I've been using GNU Parallel + redis + Cython workers to calculate distance pairs for a disambiguation problem. But then again, if it fits into a few X1 instances, it's not big data!