|
|
|
|
|
by rch
4606 days ago
|
|
First, I should note that my needs are fairly specific, and not typical of the rest of the NGS world. The datasets are essentially the same though. The rate at which we are acquiring new data has been accelerating, but each of our Illumina datasets is only 30GB or so. The total accumulated data is still just a few TB. The real imperative for using MR is more about the processing of that data. Integrating HMMER, for instance, into Postgres wouldn't be impossible, but I don't know of anything that's available now. Edit: A FDW for PostgreSQL around HMMER just made my to do list. |
|