Hacker News new | ask | show | jobs
by forgotpwd16 1387 days ago
>It's about a separate personal desire to regulate supposedly risky research.

Referring to: https://www.nature.com/articles/nature.2015.18787

So a paper on a lab-made coronavirus related to SARS that can infect human cells from Wuhan on topic about the origins of a coronavirus related to SARS that can infect human cells that caused the pandemic from Wuhan, is a separate personal desire?

>It's about a completely different study that didn't result in any pandemic.

Because after the article was published, Wuhan stopped working on this, right? Or is expected China (or any other country) publishes everything related to something delicate as this?

Seriously this "rebuttal" is anything but excellent. The ironic answers to good remarks makes it even worse. Sounds more like trying to evade the ~~points made~~ info given rather answering them.

As for the papers, origins of pandemic and origins of virus are distinct things. So, yes, we can all agree that evidence shows that pandemic started in the market, and as virus can infect animals sold there, this points towards that virus came from animals. But genetically is there anything that can distinguish a natural from a lab-made virus?

1 comments

What is often missed is that chimeric viruses are easy to detect. The viral genome will show clear evidence of manipulation from random base insertions and clear homology with all the ancestral viruses. Hiding the signs of manipulation would either require vast amounts of time and resources (the expense and man power would make it very difficult to hide) or straight up science fiction technology. The chimeric origin hypothesis is not a plausible explanation for the origin of sars-cov2, which means the nature link is not relevant.

The other lab leak hypothesis is that a specimen collected and cultured by scientists, infected a lab employee and this patient zero then transmitted the virus to others. This is a plausible option, and it is being researched. However it is less plausible than wild transmission based on a simple numbers game. What is more likely, a breakout infection cause by a dozen scientists specifically trained and equipped against this possibility, or a transmission to one of the millions of other people who routinely interact with these bat populations? Both are possible, but one is much more likely. Before covid19, WIV had published research indicating that novel coronaviruses routinely jump from bats to humans in that part of the world. Most of these viruses aren't don't last in human hosts, but it's clear that it was only a matter time before something nasty got through. After all, it's already happened once before.

The real nail in the coffin is that research[0] has shown that there were at least two, independent transmissions of sars-cov2 to humans. For this to happen as part of a lab leak it would require WIV to have found and cultivated 2 different strains of sars-cov2, and then each of those strains would have to escape the lab.

[0] https://www.science.org/doi/10.1126/science.abp8715

That's a much better counter argument to the lab-origin. The

>The chimeric origin hypothesis is not a plausible explanation for the origin of sars-cov2, which means the nature link is not relevant.

seems to be incorrect. By a simple search: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7744920/

Now the two distinct genomic lineages seem to indeed present a challenge to lab-leak hypothesis. It's explained in the original study[0] that the second lineage B came from A by intra-host evolution. Due to the molecular clock of the virus the single-introduction origin of the pandemic from a lineage A can be ruled out.

[0]: https://www.science.org/doi/10.1126/science.abp8337

Have you looked at Pekar's full model, as described mostly in the supplementary materials? A typical molecular clock approach wouldn't give anywhere near the accuracy necessary to exclude evolution of lineage B (just two SNPs away) in humans. Pekar instead builds layer upon layer of complexity, with dozens of reasonable but somewhat arbitrary judgment calls, in the same general direction as econometrics. From the shape of the resulting modeled phylogenetic tree, he purports to exclude a single introduction into humans.

I'm not aware of any case where any similar model has been shown to have predictive power, and there's inherently no way to validate this one against any physical data. So I believe this result has been grossly oversold, per my comments and links at

https://news.ycombinator.com/item?id=32740568

> A typical molecular clock approach wouldn't give anywhere near the accuracy necessary to exclude evolution of lineage B (just two SNPs away) in humans

You're ignoring other data which is counter to the idea of B evolving from A in humans. Pekar's models are not the only evidence.

- Early cases were predominantly B - A shows less generic divergence than B, this is what Pekar is talking about with regards to the discontinuity in the early clock.

When we first started discussing this - I spoke up because I was annoyed by you trashing peer-reviewed papers when it was obvious you weren't even attempting to grok the phylogenetics involved. Still annoyed.

It's been genuinely interesting watching the scientific debate to root the SC2 tree over the past few years because of the involved paradoxes.

"Just a few SNPs" is just such a silly argument when stacked against peer-reviewed phylogenies in high-impact publications.

Have you looked at Pekar's full numerical stack yourself, as described in their supplemental materials? If yes, then why are you confident that their choice of the Barabasi-Albert algorithm to generate a fixed infection network correctly models the earliest spread of SARS-CoV-2 in humans? In particular, why choose to study robustness against doubling time (which seems intuitively like it wouldn't affect the shape of the tree much), but not robustness against that connectivity (which seems intuitively like it would)?

The rest of their arguments depend fundamentally on the polytomy thing, because nothing else excludes an earlier (even September) first introduction into humans. With an earlier introduction and thus more extensive unsampled spread, it's much harder to insist that A and B would be first sampled in the same order in which they evolved in humans, or make any similar early claims with confidence.

You are correct that I hadn't fully understood their polytomy argument before you brought it up, and I appreciate you bringing it to my attention. I still don't think it's very good, though. I later found Erik van Nimwegen's criticisms, which roughly followed my own; so I don't think I'm taking a fringe position here. Indeed, I've never seen anyone citing or defending Pekar engage in any way with the numerical complexity of that model. It seems like anyone who's looked inside the box becomes a critic, thus my hope that you'll do so.

High-impact publications have shown unfortunate willingness to publish low-quality work that would exclude research-related origin of SARS-CoV-2. For example, I assume you followed Nature's publication, editor's note, and ultimate extensive correction of their pangolin paper, and that you agree pangolins aren't the proximal host. This makes me less inclined to trust in their reviewers here, and more inclined to trust my own judgment (or that of the two Twitter threads I've linked elsewhere).

> In particular, why choose to study robustness against doubling time (which seems intuitively like it wouldn't affect the shape of the tree much)

As I understand it, the doubling times observed in the simulations were primarily the result of the ascertainment and transmission rate parameters.

Care to elaborate why you think the robustness of the model with respect to transmission rate should be assumed? I don't share your intuition here, and note that the authors observe, "that sensitivity analyses with longer doubling times increase the support for multiple introductions."

You really fault them for robustness analysis here?

Yes, I've reviewed the supplemental materials.

> because nothing else excludes an earlier (even September) first introduction into humans. With an earlier introduction and thus more extensive unsampled spread, it's much harder to insist that A and B would be first sampled in the same order in which they evolved in humans

The tMRCA clearly excludes an earlier introduction. Because the tMRCA is based on genetic diversity, you cannot calculate a tMRCA based on all the known samples, get a date, and then say "oh, geez- well, there was also wide cryptic spread before that." It just doesn't make sense. Pekar addresses this point directly.

A race between the first A and the first B is a strawman. Rather, it's the predominance of lineage B over A in the early pandemic which is interesting. It would be unexpected for lineage B to dominate if A came first. Much of the modeling is to get a handle on how unlikely that situation would be. It shouldn't be surprising that the models don't support it as being likely. (But, that's not the only evidence.)

If you're willing to actually think about and engage on the phylogeny - stop with the "just a few SNPs" nonsense, and ask yourself what you really think the early origins looked like. If it really was a single introduction - Was lineage A ancestral? Was B ancestral? A C/C ancestor? A T/T ancestor? All these have interesting problems being supported by the data.

Finally, after reading some of your earlier comments, I'm realizing that you're conflating several techniques from Pekar's paper, eg:

> Have you looked at Pekar's full model, as set out mostly in the supplementary materials? This isn't any standard molecular clock approach. It's a byzantine stack of plausible but somewhat arbitrary assumptions, ending in a simulated phylogenetic tree.

His epi simulations are separate from the tree-building, with the possible exception of rooting, which he was using the output of the models to inform. Otherwise, the epi modeling which everyone is hand wringing over is really separate and doesn't end "in a simulated phylogenetic tree."

There /are/ novel methods used in the tree building (eg, non-reversibility of base substitutions), but that's a whole separate technique.

> Essentially Pekar's argument is a "two introductions of the gaps"--that if their model of a single introduction doesn't conform to reality, then it must have been two introductions.

BS. Again - understanding the paradoxes and debate involved in rooting the tree is basically required to understand the importance of this paper. The existing data is confounding and didn't conform to a logical understanding of viral evolution. A separate introduction elegantly explains the existing evidence.

If their modeling isn't strong enough evidence for you, fine. But that's different than throwing everything out because you don't understand how "just a couple SNPs" can still provide sufficient resolution to make phylogenetic inferences possible. If you think that "just a couple SNPs" /don't/ provide enough for experts in the field to inform their phylogenies, at least get to that argument directly instead of throwing ignorant shade at an unrelated portion of the paper.

Thanks for the links to those other threads. Nod's was interesting, but AFAICT, way off-base, starting around "Needless to say, early winter in Wuhan is not the Mardi Gras."

Here's Pekar's earlier thread which I recently reread and found helpful for understanding the significance of the phylogeny (#20 is where he gets into how lineage A breaks the clock):

https://twitter.com/jepekar/status/1499840335349911553

and Worobey re-emphasizing that we're not just talking about a few SNPs, it's the shape of the tree which matters:

https://twitter.com/michaelworobey/status/157050467474223923...