Building Complete and Inclusive Human Genomes: How National and Global Efforts Are Addressing Long-Standing Reference Gaps
- Dec 31, 2025
- 3 min read
TAIPEI, TAIWAN, Dec. 31th, 2025- Despite decades of progress since the completion of the Human Genome Project, fundamental limitations remain in how the human genome is represented and interpreted. Large repetitive regions, structural variants, and population-specific sequences have historically been underrepresented or entirely missing from reference genomes. Recently, two major initiatives — South Korea’s National Pangenome Project and the international Telomere-to-Telomere (T2T) Consortium — are addressing these gaps by rethinking both the scope of reference genomes and the technologies used to build them.
South Korea’s National Pangenome Project was launched to address a key limitation of existing genomic references: their limited representation of Asian populations. Most widely used reference genomes are derived from a small number of individuals, primarily of European ancestry, which can introduce bias in variant interpretation and disease research. To resolve this, South Korean researchers plan to generate high-quality, telomere-to-telomere-level genome assemblies from more than 1,000 Korean individuals. These assemblies will form a population-specific pangenome while also contributing to global reference efforts.

Achieving this goal requires the ability to accurately assemble complex regions such as centromeres, segmental duplications, and structural variants — features that are often fragmented or misrepresented using short-read sequencing alone. The project therefore adopted long-read, high-accuracy sequencing to generate contiguous assemblies that preserve haplotype structure and capture variants across a wide size range. This approach allows researchers to move beyond single linear references and instead represent multiple genomic paths that better reflect real population diversity.
A similar challenge motivated the Telomere-to-Telomere Consortium, which set out to complete the first truly gapless human genome assembly. Earlier references left unresolved regions at chromosome ends and across highly repetitive sequences, limiting biological interpretation and masking functional elements. The T2T team addressed this by combining long and ultra-long reads with improved assembly algorithms to span previously inaccessible regions. The result was a complete human genome that added nearly 200 million bases of newly resolved sequence and corrected numerous errors in older references.
The success of the T2T assembly has had broader implications, extending beyond the production of a single complete genome. It demonstrated that long-standing “dark regions” of the genome are technically solvable and biologically meaningful. Many of the newly resolved regions contain genes, regulatory elements, and structural variation relevant to evolution and disease. These insights reinforced the need for pangenome strategies that represent multiple high-quality assemblies rather than relying on a single reference.
In both projects, the selection of long-read, high-fidelity sequencing reflects a shared objective: resolving genomic complexity with sufficient accuracy for population-scale analysis. High-accuracy long reads enable direct detection of structural variants, repeat expansions, and haplotype-specific sequences, reducing reliance on inference or imputation. This is especially important for integrating genomic data into clinical research, where false positives or missing variants can directly affect interpretation.
Together, South Korea’s National Pangenome Project and the T2T effort illustrate a broader transition in genomics. By addressing technical blind spots and population bias simultaneously, these initiatives are redefining what a reference genome should be — not a static consensus sequence, but a dynamic framework that reflects human diversity and biological complexity.
[References]
About WASAI Technology Inc.
WASAI Technology's mission is to deliver acceleration technologies of High-Performance Data Analysis (HPDA) in future data centers for targeted vertical applications with massive volumes and high velocities of scientific data. To strengthen and advance scientific discovery and technological research via big data-intensive acceleration in high-performance computing, WASAI Technology aims to improve commercialization and commoditization of scientific and technological applications.
###


It’s amazing to see how genome research is moving toward truly inclusive references, capturing diversity that was long overlooked. Reading this, I thought about how in my own academic life I sometimes got tempted to pay someone to take my online accounting exam, thinking it was a shortcut. But projects like these show the value of doing the work properly careful, precise effort uncovers real insights and prevents errors. It’s a good reminder that thoroughness pays off, whether in science or studies. Your post makes me smile.
I read the post about building complete and inclusive human genomes and it really made me think about how science can open doors to better health for everyone by filling in missing pieces of our DNA map. I even remembered when I used Law project editing service to clean up a tough write up while learning about genetics and it helped me understand my own work better. Getting the details right really matters.