| I think you're talking about their model in "Inferring the MRCA of SARS-CoV-2", and I'm talking about their model in "Separate introductions of lineages A and B"? So you're saying they don't use the epi simulations to root and build the phylogenetic tree of real sampled genomes, which is true. I'm saying they do use the epi simulations to build a phylogenetic tree for each simulated pandemic, whose shape (polytomy structure) they then compare against the real tree: > We simulated SARS-CoV-2–like epidemics (22, 23) with a doubling time of 3.47 days [95% highest density interval (HDI) across simulations, 1.35 to 5.44] (24–26) to account for the rapid spread of SARS-CoV-2 before it was identified as the etiological agent of COVID-19 (figs. S21 and S22, tables S3 and S4, and supplementary text). We then simulated coalescent processes and viral genome evolution across these epidemics to determine how frequently we recapitulated the observed SARS-CoV-2 phylogeny. Coverage of this paper in the popular press usually said something like "study finds that SARS-CoV-2 arose from two introductions into humans", so I thought the latter was the more important result and started there. Like in your second link, Worobey says: > [...] We then go on the explain, point by point, that it is not a two-mutation difference that is unexpected. It is a two mutation difference between two large clades like lineage A and lineage B, each displaying a MASSIVE polytomy at their root. This is something that [sic] DO NOT see in ~99.5% of simulations. That is the crux of the paper. Not the idea that two mutations can't happen in a single transmission event. Are those "simulations" not the SIR-type epi simulations (followed by simulation of the mutations and sampling, then construction of the tree)? I believe his 99.5% is 100% minus the 0.5% from Figure 2C. Their former model is of course independent of their SIR stuff, and indeed purports to independently establish tMRCA in humans too recent for significant cryptic spread. It carries a different set of plausible but arbitrary assumptions though, again about the stochasticity/overdispersion and sampling rate of early spread, just less directly. |
> It carries a different set of plausible but arbitrary assumptions though, again about the stochasticity/overdispersion and sampling rate of early spread, just less directly.
So, you don't only have problems with the modeling of the authors, but their base phylogeny too? Do you reject their tMRCA? Good grief.
I'm still looking forward to discussing the molecular phylogenetics of this paper sometime.