Hacker News new | ask | show | jobs
by robert-boehnke 1262 days ago
Wouldn't you only need two bits to encode C, G, A, and T?
1 comments

Yes, two bits (assuming you only need to encode those four bases; unfortunately, biological reality requires FASTQ to encode other nucleotide values). There's even a format from UCSC called "2bit" (https://genome.ucsc.edu/goldenPath/help/twoBit.html)