Hacker News new | ask | show | jobs
by hammock 1958 days ago
I see the ANALYSIS you provide does not use my DNA but I'm unclear- are you collecting my DNA? Are you saving it or storing it, in any form, in either physical or digital form? What assurances do I have that any of the labs or other partners you use won't do this either?
3 comments

Bristle CSO here. To elaborate on Esyir's reply, through saliva collection, we inevitably will have your DNA in the sample. However, our analysis selectively removes host DNA from the sample, enabling targeted sequencing of the oral microbiome. During the processing of the sample, we may capture some signal from the host genome. Our sequencing, however, captures such a low level of the host genome that we cannot derive any meaningful signal from host genetic data, and actually selectively remove that signal from the analysis to enrich for microbial signals.

We keep your saliva sample and its molecular derivatives in physical form completely DEIDENTIFIED such that it cannot be mapped back to YOU without an encrypted key.

TLDR, we do NOT use host genetic data, but cannot guarantee that none of that data will be stored somewhere in this analysis. We do not share any identifiable data with anyone. We do not analyze host genetic data.

Who has the encrypted key?

The statement is meaningless if you have the key.

We maintain the key and will never share it without your consent. The key is critical to reporting microbiome data back to the correct person. Additionally, maintaining these identifiers will be crucial when we plan to add additional health features to our platform such as dental consults, tracking the outcomes of therapeutic/preventative interventions, and improvement tracking.

Of course, there are a few tricky circumstances around the future of the data. What happens if we go bankrupt/sell the company? I can't say for sure what will happen, but I am fully committed to the destruction of the key at any point when we realize that the identifiers do not serve a purpose.

but sharing the key would be sharing (access to) the DNA
Could we have an option of not storing biological materials after processing?

Assuming you don't store (notable) digital host data, it seems much safer, in case policy / ownership were to change, a breach happened and/or someone was compelled to produce samples/data.

Absolutely, you may request that your physical sample be destroyed after receiving your results. At present, we biobank samples in the event that the sample needs to be re-run.

Please send us an email at info@bristlehealth.com and we can ensure your physical sample is destroyed post-sequencing.

Will you be using samples to identify NEW microbiome bacteria? Or are you only identifying KNOWN bacteria?
This is a great question! In short, there is a network effect with our platform. Each sample we sequence contributes to a database of novel bacteria, viruses, and fungi that we can use to discover new microbes.
Thanks to y'all for the elaboration
I know it is not good HN policy to doubt your intentions but by emphasizing the NOT's, DEIDENTIFIED and YOU, you make me highly suspicious. Anonymization of data is difficult at best and sometimes nearly impossible so I would advise to publish the entire protocol if you want to give people assurances. The encryption key, as already stated, is pointless without an explanation how you use it and why it is employed and is otherwise just smoke and mirrors and no real security.
I completely understand your hesitation around the anonymization of data, and will have more detail around how your data will be anonymizes and used on our website shortly.
There's no way to get the oral/gun microbiome using shotgun sequencing without also picking up human data along the way. The raw sequence files will contain reads mapping to the human genome, but those are often filtered out in subsequent steps of the data processing.
From another reply -

  We also will NOT be selling or using any of your personal genetic data without consent.
"Genetic data" seems to imply that they do look up your DNA too.
Genetic data is €€€.
See Esyir's response. In short, we do NOT look up your DNA, and only look into microbial DNA. In fact, we do a host depletion step to enrich for microbial DNA prior to sequencing, then a following host depletion step prior to microbial mapping. Sequencing will inevitably pick up some signal from the host genome, but at such low levels, we cannot derive meaningful signal from the host DNA.

TLDR, we do NOT use host genetic data, but cannot guarantee that none of that data will be stored somewhere in this analysis