|
|
|
|
|
by vineetg
3979 days ago
|
|
Thanks for bringing up all of these alternatives. We definitely would have preferred to use an existing solution to building our own. Unfortunately, a lot of the existing software is not intended for the search we're trying to do, or does not perform well under these conditions. We did in fact experiment with some of them before building our own. Bowtie, for example, doesn't allow more than 3 mismatches, and is also intended for alignments where there are very few matches (close to 1). Since we need to be able to support multiple genomes (see Josh's comment), the amount of RAM we need to run a particular set of alignments is relatively important. Things like BLAT (which seems also intended for > 25 bases) need to keep the entire genome index in memory. This means that we would need to spin up a lot more servers to handle parallel requests, especially with different genomes. FWIW our search is only a couple hundred lines of C++, and does the search with very little memory requirements. |
|