Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes

Show full item record



Genome Aggregation Database Prod T , Genome Aggregation Database Consor , Wang , Q , Pierce-Hoffman , E , Cummings , B B , MacArthur , D G , Groop , L , Färkkilä , M , Palotie , A , Remes , A M , Ripatti , S , Salomaa , V , Soininen , H , Suvisaari , J , Tuomi , T , Vartiainen , E & Wessman , M 2020 , ' Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes ' , Nature Communications , vol. 11 , no. 1 , 2539 .

Title: Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes
Author: Genome Aggregation Database Prod T; Genome Aggregation Database Consor; Wang, Qingbo; Pierce-Hoffman, Emma; Cummings, Beryl B.; MacArthur, Daniel G.; Groop, Leif; Färkkilä, Martti; Palotie, Aarno; Remes, Anne M.; Ripatti, Samuli; Salomaa, Veikko; Soininen, Hilkka; Suvisaari, Jaana; Tuomi, Tiinamaija; Vartiainen, Erkki; Wessman, Maija
Contributor organization: Centre of Excellence in Complex Disease Genetics
HUS Abdominal Center
Institute for Molecular Medicine Finland
Helsinki Institute of Life Science HiLIFE
Department of Medicine
Gastroenterologian yksikkö
Helsinki University Hospital Area
Research Programs Unit
Aarno Palotie / Principal Investigator
Genomics of Neurological and Neuropsychiatric Disorders
Department of Public Health
Samuli Olli Ripatti / Principal Investigator
Complex Disease Genetics
Biostatistics Helsinki
Date: 2020-05-27
Language: eng
Number of pages: 13
Belongs to series: Nature Communications
ISSN: 2041-1723
Abstract: Multi-nucleotide variants (MNVs), defined as two or more nearby variants existing on the same haplotype in an individual, are a clinically and biologically important class of genetic variation. However, existing tools typically do not accurately classify MNVs, and understanding of their mutational origins remains limited. Here, we systematically survey MNVs in 125,748 whole exomes and 15,708 whole genomes from the Genome Aggregation Database (gnomAD). We identify 1,792,248 MNVs across the genome with constituent variants falling within 2bp distance of one another, including 18,756 variants with a novel combined effect on protein sequence. Finally, we estimate the relative impact of known mutational mechanisms - CpG deamination, replication error by polymerase zeta, and polymerase slippage at repeat junctions - on the generation of MNVs. Our results demonstrate the value of haplotype-aware variant annotation, and refine our understanding of genome-wide mutational mechanisms of MNVs. Multi-nucleotide variants (MNV) are genetic variants in close proximity of each other on the same haplotype whose functional impact is difficult to predict if they reside in the same codon. Here, Wang et al. use the gnomAD dataset to assemble a catalogue of MNVs and estimate their global mutation rate.
1184 Genetics, developmental biology, physiology
Peer reviewed: Yes
Rights: cc_by
Usage restriction: openAccess
Self-archived version: publishedVersion

Files in this item

Total number of downloads: Loading...

Files Size Format View
s41467_019_12438_5.pdf 1.593Mb PDF View/Open

This item appears in the following Collection(s)

Show full item record