top of page

Genome in a Bottle Consortium Expands into Cancer Genomics with Publicly Accessible Tumor-Normal Datasets

TAIPEI, TAIWAN, Nov. 30, 2024- The Genome in a Bottle Consortium (GIAB), a project hosted by the National Institute of Standards and Technology (NIST), is renowned for providing high-quality reference materials that serve as benchmarks for sequencing technologies and bioinformatics tools. In a groundbreaking move, GIAB is now venturing into cancer genomics by developing matched tumor-normal genomic datasets explicitly consented for public use. These resources aim to accelerate advancements in detecting somatic mutations and refining sequencing methods.


In recent update, GIAB is taking an exciting step forward by developing genomic benchmarking datasets from matched tumor-normal samples, explicitly consented for public use and dissemination. Aiming to create an open and accessible resource that empowers researchers to advance cancer research. By offering datasets tailored to the complexities of tumor biology, this project provides a foundation for developing new methods to detect somatic mutations—one of the most pressing challenges in cancer genomics. Unlike earlier GIAB datasets, which focused on healthy individuals, this effort delves into the intricate genetic landscape of tumors, offering researchers a valuable tool to innovate and refine their approaches.



To construct the tumor-normal datasets, GIAB researchers utilized an array of 13 advanced whole-genome measurement technologies. These included short-read and long-read PCR-free whole-genome sequencing, single-cell whole-genome sequencing (WGS), Hi-C short-read sequencing, optical genome mapping (OGM), and cytogenetic analysis. These state-of-the-art techniques were applied to samples from a pancreatic ductal adenocarcinoma (PDAC) patient identified as HG008, a 61-year-old woman of European descent recruited at Massachusetts General Hospital.


The focus of this work is not to compare or evaluate the various technologies but to ensure comprehensive and reliable data for future use. To achieve this, GIAB collaborated closely with technology developers, several of whom provided data for the project.


The result of this collaboration is one of the most detailed and accessible cancer genomic datasets currently available. Researchers worldwide can now access the data on the National Center for Biotechnology Information (NCBI) FTP site and its Sequence Read Archive (SRA). This open access ensures that scientists have reliable reference points for testing new approaches in cancer genomics.


The work doesn’t stop here. GIAB plans to further analyze the genomic data from the HG008 samples to identify and characterize somatic variants. These findings will be made publicly available as they are completed. The first step in this process is curating a set of somatic structural variants to serve as a foundational benchmark for future studies.


GIAB’s efforts mark a significant milestone in cancer research. By providing a standardized, openly available resource, the consortium is equipping scientists with tools to advance somatic mutation detection and improve bioinformatics methodologies. This pioneering initiative underscores the importance of collaboration and open data sharing in tackling the complexities of cancer genomics, paving the way for transformative advancements in the field.


Reference:

 






About WASAI Technology Inc.

WASAI Technology's mission is to deliver acceleration technologies of High-Performance Data Analysis (HPDA) in future data centers for targeted vertical applications with massive volumes and high velocities of scientific data. To strengthen and advance scientific discovery and technological research via big data-intensive acceleration in high-performance computing, WASAI Technology aims to improve commercialization and commoditization of scientific and technological applications.

​​###

Comments


WASAI Tecnology Inc.

4F, No. 6, Zhiyuan 3rd Rd., Beitou Dist., Taipei 112025, Taiwan 

wasai@wasaitech.com

bottom of page