Computational framework for targeted high-coverage sequencing based NIPT

Show full item record



Permalink

http://hdl.handle.net/10138/306598

Citation

Teder , H , Paluoja , P , Rekker , K , Salumets , A , Krjutškov , K & Palta , P 2019 , ' Computational framework for targeted high-coverage sequencing based NIPT ' , PLoS One , vol. 14 , no. 7 , 0209139 . https://doi.org/10.1371/journal.pone.0209139

Title: Computational framework for targeted high-coverage sequencing based NIPT
Author: Teder, Hindrek; Paluoja, Priit; Rekker, Kadri; Salumets, Andres; Krjutškov, Kaarel; Palta, Priit
Contributor organization: HUS Gynecology and Obstetrics
Clinicum
Department of Obstetrics and Gynecology
University of Helsinki
CAN-PRO - Translational Cancer Medicine Program
Research Programs Unit
Institute for Molecular Medicine Finland
Genomics of Neurological and Neuropsychiatric Disorders
Date: 2019-07-08
Language: eng
Number of pages: 19
Belongs to series: PLoS One
ISSN: 1932-6203
DOI: https://doi.org/10.1371/journal.pone.0209139
URI: http://hdl.handle.net/10138/306598
Abstract: Non-invasive prenatal testing (NIPT) enables accurate detection of fetal chromosomal trisomies. The majority of publicly available computational methods for sequencing-based NIPT analyses rely on low-coverage whole-genome sequencing (WGS) data and are not applicable for targeted high-coverage sequencing data from cell-free DNA samples. Here, we present a novel computational framework for a targeted high-coverage sequencing-based NIPT analysis. The developed framework uses a hidden Markov model (HMM) in conjunction with a supplemental machine learning model, such as decision tree (DT) or support vector machine (SVM), to detect fetal trisomy and parental origin of additional fetal chromosomes. These models were developed using simulated datasets covering a wide range of biologically relevant scenarios with various chromosomal quantities, parental origins of extra chromosomes, fetal DNA fractions, and sequencing read depths. Developed models were tested on simulated and experimental targeted sequencing datasets. Consequently, we determined the functional feasibility and limitations of each proposed approach and demonstrated that read count-based HMM achieved the best overall classification accuracy of 0.89 for detecting fetal euploidies and trisomies on simulated dataset. Furthermore, we show that by using the DT and SVM on the HMM classification results, it was possible to increase the final trisomy classification accuracy to 0.98 and 0.99, respectively. We demonstrate that read count and allelic ratio-based models can achieve a high accuracy (up to 0.98) for detecting fetal trisomy even if the fetal fraction is as low as 2%. Currently, existing commercial NIPT analysis requires at least 4% of fetal fraction, which can be possibly a challenge in case of early gestational age (35 kg/m2). More accurate detection can be achieved at higher sequencing depth using HMM in conjunction with supplemental models, which significantly improve the trisomy detection especially in borderline scenarios (e.g., very low fetal fraction) and enables to perform NIPT even earlier than 10 weeks of pregnancy.
Subject: 1182 Biochemistry, cell and molecular biology
113 Computer and information sciences
CELL-FREE DNA
DOWN-SYNDROME
MEIOTIC STAGE
MATERNAL AGE
ANEUPLOIDY
PLASMA
NONDISJUNCTION
TRISOMY-21
FRACTION
GENESIS
Peer reviewed: Yes
Rights: cc_by
Usage restriction: openAccess
Self-archived version: publishedVersion


Files in this item

Total number of downloads: Loading...

Files Size Format View
journal.pone.0209139.pdf 1.861Mb PDF View/Open

This item appears in the following Collection(s)

Show full item record