|
|
|
|
|
by dekhn
2592 days ago
|
|
I feel like ADAM (https://github.com/bigdatagenomics/adam) is a huge step in the right direction. You convert from standard genomics format to Parquet and then work with the resulting data in spark with genomics-specific libraries. My experience has been translating domain data into spark has a 100X improvement in data analysis. |
|