“Truly finishing the human genome sequence was like putting on a new pair of glasses.” Critical for Understanding Human Genomic Variation and Genetic Contributions to Certain Diseases
April 12, 2022
At a Glance
- Researchers finished sequencing the roughly 3 billion bases (or “letters”) of DNA that make up a human genome.
- Having a complete, gap-free sequence of our DNA is critical for understanding human genomic variation and the genetic contributions to certain diseases.
- Below, vitstudio / Shutterstock
The work was done by the Telomere to Telomere (T2T) consortium. T2T is led by researchers at NIH’s National Human Genome Research Institute (NHGRI), the University of California, Santa Cruz, and the University of Washington, Seattle. NHGRI was the primary funder.
“Short-read” technologies were originally used to sequence the human genome. These provide several hundred bases of DNA sequence at a time, which are then stitched together by computers. Such methods still leave some gaps in genome sequences.
Over the past decade, two new DNA sequencing technologies emerged that can read longer sequences without compromising accuracy. The PacBio HiFi DNA sequencing method can read about 20,000 letters with nearly perfect accuracy. The Oxford Nanopore DNA sequencing method can read even more — up to 1 million DNA letters at a time — with modest accuracy. Both were used to generate the complete human genome sequence.
In total, the new project added nearly 200 million letters of the genetic code. This last 8% of the genome includes numerous genes as well as repetitive DNA sequences, which may influence how cells function. Most of the newly added sequences were in the centromeres, the dense middle sections of chromosomes, and near the repetitive ends of each chromosome.
The complete genome sequence will be particularly valuable for studies that aim to understand how DNA differs from person to person. For example, T2T researchers used the sequence as a reference to discover more than 2 million previously unknown sequence variants in the human genome. These included variants within many medically relevant genes.
“This complete human genome sequence has already provided new insight into genome biology, and I look forward to the next decade of discoveries about these newly revealed regions,” says Dr. Karen Miga, a co-chair of the T2T consortium at the University of California, Santa Cruz.
“Truly finishing the human genome sequence was like putting on a new pair of glasses,” says consortium co-chair Dr. Adam Phillippy, whose group at NHGRI led the effort. “Now that we can clearly see everything, we are one step closer to understanding what it all means.”
This accomplishment can now serve as a model for sequencing genomes from globally diverse people — a goal researchers are pursuing. Further work is also needed to finish the complete sequence of the Y chromosome, which was not contained in the cells used for this study.
“This foundational information will strengthen the many ongoing efforts to understand all the functional nuances of the human genome, which in turn will empower genetic studies of human disease,” says Dr. Eric Green, director of NHGRI.