| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dekhn 2592 days ago
	I feel like ADAM (https://github.com/bigdatagenomics/adam) is a huge step in the right direction. You convert from standard genomics format to Parquet and then work with the resulting data in spark with genomics-specific libraries. My experience has been translating domain data into spark has a 100X improvement in data analysis.