| I would be eternally grateful to a bio-chem person to explain two facets of DNA sequencing I've never understood. To motivate the question a short quote from Wikipedia: "For longer targets such as chromosomes, common approaches consist of cutting (with restriction enzymes) or shearing (with mechanical forces) large DNA fragments into shorter DNA fragments. The fragmented DNA may then be cloned into a DNA vector and amplified in a bacterial host such as Escherichia coli. " Two questions: 1) OK, we cut up the DNA and replicate it. How in the heck do we assemble the pieces together to restore the original ordering? I've got three pieces A,B,C. How do I know it was originally ABC or CBA or BAC etc? I can correctly sequence a piece ... but what I want is the sequence of all base-pairs in their original order. How on earth do they do that? 2) DNA spends most of its time wrapped up around histones. How do does one get it off the histones for access to sequencing? 3) Less interesting: unless we sequence the whole chromosome researches must pick off a piece ... how do we know which piece to pick off and how do we know where it's crammed around the histones? |
The key to this is understanding that the reads or sequences are long, and the slicing happens at somewhat random locations, such that your reads all overlap with each other (of course in reality it's more complicated than that, due to a bunch of long non-coding sequences and repeating sequences, but this will be mostly true for interesting parts of the genome).
Therefore your mental model of A, B, C is not particularly useful. I would replace it with the following example:
QWERTYUIOP ---[gets chopped up into]----> QWERT ERTY UIOP WE QWER TYU QW
Note that I used the terms "read" and "sequence" interchangeably here because the general notion is the same. But these two words refer to two different things.
2.
Histones are proteins. You can get rid of them simply by adding proteases to the mix. Proteases specifically break down proteins and leave nucleic acid sequences intact.
A commonly used protease is Proteinase K, which you can just order online here: https://www.sigmaaldrich.com/life-science/metabolomics/enzym...
You can also buy kits for specific applications: https://www.sigmaaldrich.com/life-science/molecular-biology/...
Once the histones have been digested, the DNA can be extracted, purified, amplified, etc, at your leisure. A full treatment of this topic would fill an entire textbook, so I will spare you the details.
3.
I'm not sure what this question means, but if you can rephrase I might be able to answer.