Mount Sinai Scientists Develop New Approach for More Accurate and Comprehensive Whole Genome Assembly, Variant Discovery and Interpretation
A new strategy for uncovering difficult-to-detect, complex forms of genomic variation associated with human disease
Scientists from the Icahn School of Medicine at Mount Sinai have developed a new approach to build nearly complete genomes by combining high-throughput DNA sequencing with genome mapping. The methodology enabled researchers to detect complex forms of genomic variation, critically important for their association with human disease, but previously difficult to detect. The study was published today in Nature Methods, and is a collaboration with scientists at European Molecular Biology Lab, Weill Cornell Medical College, Cold Spring Harbor Laboratory, Rockefeller University, University of California, San Francisco, Pacific Biosciences, and BioNano Genomics.
Conventional next-generation sequencing (NGS) techniques are able to accurately detect certain types of variation, such as single nucleotide variants and small insertions or deletions, but miss many large or complex forms of genomic variation that are associated with human disease. Further, these previous approaches are poorly suited for completely de novo analysis of genomes and for phasing the maternal and paternal haplotypes of an individual.
“We created a high-throughput strategy that builds highly contiguous de novo genomes without the need for complex jumping libraries or targeted approaches. This strategy, in some cases, automatically resolved complete arms of chromosomes,” said Ali Bashir, PhD, Assistant Professor of Genetics and Genomics at the Icahn School of Medicine and senior author of the study. “While we focused this study on a human genome, the method can be applied to any new genome, including those with high genomic complexity, such as plants, that have been extremely challenging to study.”
To overcome limitations with existing NGS methods, the study authors combined two single molecule approaches: long read sequencing from Pacific Biosciences and Nanochannel Array technology from BioNano Genomics. Pacific Biosciences sequencing enables reads exceeding 10kb in length, which can directly resolve and phase complex forms of variation. The NanoChannel Array from BioNano confines and linearizes DNA molecules up to megabases in length to provide high-resolution sequence motif physical maps, termed ‘genome maps’.
The researchers studied the NA12878 diploid genome, a well-sequenced sample that is part of the 1000 Genomes project and often used for benchmarking new techniques. The study authors mapped variation and built assemblies with both technologies, then combined the two to create a “hybrid” assembly that dramatically improved the contiguity of each. The resulting hybrid assembly N50s, the length such that 50% of all base pairs are contained in scaffolds of the given length or longer, approach 30Mb - on par with the best assemblies to date at a fraction of the cost and labor.
“The study revealed an unprecedented view of genomic complexity, in many cases identifying regions overlooked by conventional sequencing or further refining previously known genetic variant classes,” said study co-author Jan Korbel, PhD, Group Leader at the European Molecular Biology Laboratory. “We had notable success in challenging regions such as inversions and tandem repeats,” added co-author Robert Sebra, PhD, Assistant Professor of Genetics and Genomic Sciences at the Icahn School of Medicine. “For example, a systematic underrepresentation of tandem repeat sizes was observed in the human reference genomes. Such expansions, as we observed within the LPA gene which has been associated with plasmid lipid levels, are increasingly being identified as important markers for disease.”
“By using a powerful combination of new technologies, we can finally begin to circumvent biases induced by overreliance on a single reference genome” said co-author Eric Schadt, PhD, Founding Director of the Icahn Institute, and Professor of Genomics at the Icahn School of Medicine. “Fully de novo approaches will increasingly become standard practice to enable direct and comprehensive characterization of genome variation. This will accelerate our understanding of the links to human diseases that such variations induce.”
Matthew Pendleton, Robert Sebra, Andy Wing Chun Pang, Ajay Ummat, Oscar Franzen,Tobias Rausch, Adrian M Stütz, William Stedman, Thomas Anantharaman, Alex Hastie, Heng Dai,
Markus Hsi-Yang Fritz, Han Cao, Ariella Cohain, Gintaras Deikus, Russell E Durrett, Scott C Blanchard,Roger Altman, Chen-Shan Chin, Yan Guo, Ellen E Paxinos, Jan O Korbel, Robert B Darnell,
W Richard McCombie, Pui-Yan Kwok, Christopher E Mason, Eric E Schadt & Ali Bashir. “Assembly and Diploid Architecture of an Individual Human Genome via Single Molecule Technologies." Nature Methods. DOI: 10.1038/nmeth.3454
About the Mount Sinai Health System
Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 43,000 employees working across eight hospitals, over 400 outpatient practices, nearly 300 labs, a school of nursing, and a leading school of medicine and graduate education. Mount Sinai advances health for all people, everywhere, by taking on the most complex health care challenges of our time — discovering and applying new scientific learning and knowledge; developing safer, more effective treatments; educating the next generation of medical leaders and innovators; and supporting local communities by delivering high-quality care to all who need it.
Through the integration of its hospitals, labs, and schools, Mount Sinai offers comprehensive health care solutions from birth through geriatrics, leveraging innovative approaches such as artificial intelligence and informatics while keeping patients’ medical and emotional needs at the center of all treatment. The Health System includes approximately 7,300 primary and specialty care physicians; 13 joint-venture outpatient surgery centers throughout the five boroughs of New York City, Westchester, Long Island, and Florida; and more than 30 affiliated community health centers. We are consistently ranked by U.S. News & World Report's Best Hospitals, receiving high "Honor Roll" status, and are highly ranked: No. 1 in Geriatrics and top 20 in Cardiology/Heart Surgery, Diabetes/Endocrinology, Gastroenterology/GI Surgery, Neurology/Neurosurgery, Orthopedics, Pulmonology/Lung Surgery, Rehabilitation, and Urology. New York Eye and Ear Infirmary of Mount Sinai is ranked No. 12 in Ophthalmology. U.S. News & World Report’s “Best Children’s Hospitals” ranks Mount Sinai Kravis Children's Hospital among the country’s best in several pediatric specialties. The Icahn School of Medicine at Mount Sinai is one of three medical schools that have earned distinction by multiple indicators: It is consistently ranked in the top 20 by U.S. News & World Report's "Best Medical Schools," aligned with a U.S. News & World Report "Honor Roll" Hospital, and top 20 in the nation for National Institutes of Health funding and top 5 in the nation for numerous basic and clinical research areas. Newsweek’s “The World’s Best Smart Hospitals” ranks The Mount Sinai Hospital as No. 1 in New York and in the top five globally, and Mount Sinai Morningside in the top 20 globally.