|
|
|
|
|
by nonbel
3080 days ago
|
|
>"We found that adding these same features from the CFD model further boosted performance and so also included these. The final deployed model was trained only on the Avana data (combining with Gecko did not increase cross-validation performance)."
https://www.biorxiv.org/content/early/2016/10/05/078253 Sounds like you leaked info from the training data into validation/test data, which will make you overfit and thus overstate the accuracy. I may have missed it, but did you evaluate the skill of this model on a holdout dataset? EDIT: This link doesn't appear to work: >"All source code and a front-end website for the cloud service will be made available from http://research.microsoft.com/en-us/projects/crispr upon publication." |
|
EDIT: we will update the link, thanks. The correct link is https://www.microsoft.com/en-us/research/project/crispr/