|
|
|
|
|
by jpeloquin
1334 days ago
|
|
I realized why this being based on simulation bothered me: this is a machine learning classifier that classifies viral genomes as synthetic or natural. The training set n = 72 (all negative, which is justifiable if you're ok with null hypothesis significance testing) the validation set n = 6 (only synthetic examples, which is less fine), and there's no test set. No effort was made to estimate true positive rate, false positive rate, etc. If this was published as a machine learning paper instead of a biology paper it would probably be held to a higher standard. |
|