Lep-MAP3 : robust linkage mapping even for low-coverage whole genome sequencing data

Show full item record




Rastas , P 2017 , ' Lep-MAP3 : robust linkage mapping even for low-coverage whole genome sequencing data ' , Bioinformatics , vol. 33 , no. 23 , pp. 3726-3732 . https://doi.org/10.1093/bioinformatics/btx494

Title: Lep-MAP3 : robust linkage mapping even for low-coverage whole genome sequencing data
Author: Rastas, Pasi
Contributor organization: Biosciences
Date: 2017-12-01
Language: eng
Number of pages: 7
Belongs to series: Bioinformatics
ISSN: 1367-4803
DOI: https://doi.org/10.1093/bioinformatics/btx494
URI: http://hdl.handle.net/10138/312091
Abstract: Motivation: Accurate and dense linkage maps are useful in family-based linkage and association studies, quantitative trait locus mapping, analysis of genome synteny and other genomic data analyses. Moreover, linkage mapping is one of the best ways to detect errors in de novo genome assemblies, as well as to orient and place assembly contigs within chromosomes. A small mapping cross of tens of individuals will detect many errors where distant parts of the genome are erroneously joined together. With more individuals and markers, even more local errors can be detected and more contigs can be oriented. However, the tools that are currently available for constructing linkage maps are not well suited for large, possible low-coverage, whole genome sequencing datasets. Results: Here we present a linkage mapping software Lep-MAP3, capable of mapping high-throughput whole genome sequencing datasets. Such data allows cost-efficient genotyping of millions of single nucleotide polymorphisms (SNPs) for thousands of individual samples, enabling, among other analyses, comprehensive validation and refinement of de novo genome assemblies. The algorithms of Lep-MAP3 can analyse low-coverage datasets and reduce data filtering and curation on any data. This yields more markers in the final maps with less manual work even on problematic datasets. We demonstrate that Lep-MAP3 obtains very good performance already on 5x sequencing coverage and outperforms the fastest available software on simulated data on accuracy and often on speed. We also construct de novo linkage maps on 7-12x whole-genome data on the Red postman butterfly (Heliconius erato) with almost 3 million markers.
1182 Biochemistry, cell and molecular biology
111 Mathematics
113 Computer and information sciences
Peer reviewed: Yes
Usage restriction: openAccess
Self-archived version: acceptedVersion

Files in this item

Total number of downloads: Loading...

Files Size Format View
lepmap3.pdf 439.4Kb PDF View/Open

This item appears in the following Collection(s)

Show full item record