Lep-MAP3 : robust linkage mapping even for low-coverage whole genome sequencing data

Visa fullständig post




Rastas , P 2017 , ' Lep-MAP3 : robust linkage mapping even for low-coverage whole genome sequencing data ' , Bioinformatics , vol. 33 , no. 23 , pp. 3726-3732 . https://doi.org/10.1093/bioinformatics/btx494

Titel: Lep-MAP3 : robust linkage mapping even for low-coverage whole genome sequencing data
Författare: Rastas, Pasi
Upphovmannens organisation: Biosciences
Datum: 2017-12-01
Språk: eng
Sidantal: 7
Tillhör serie: Bioinformatics
ISSN: 1367-4803
DOI: https://doi.org/10.1093/bioinformatics/btx494
Permanenta länken (URI): http://hdl.handle.net/10138/312091
Abstrakt: Motivation: Accurate and dense linkage maps are useful in family-based linkage and association studies, quantitative trait locus mapping, analysis of genome synteny and other genomic data analyses. Moreover, linkage mapping is one of the best ways to detect errors in de novo genome assemblies, as well as to orient and place assembly contigs within chromosomes. A small mapping cross of tens of individuals will detect many errors where distant parts of the genome are erroneously joined together. With more individuals and markers, even more local errors can be detected and more contigs can be oriented. However, the tools that are currently available for constructing linkage maps are not well suited for large, possible low-coverage, whole genome sequencing datasets. Results: Here we present a linkage mapping software Lep-MAP3, capable of mapping high-throughput whole genome sequencing datasets. Such data allows cost-efficient genotyping of millions of single nucleotide polymorphisms (SNPs) for thousands of individual samples, enabling, among other analyses, comprehensive validation and refinement of de novo genome assemblies. The algorithms of Lep-MAP3 can analyse low-coverage datasets and reduce data filtering and curation on any data. This yields more markers in the final maps with less manual work even on problematic datasets. We demonstrate that Lep-MAP3 obtains very good performance already on 5x sequencing coverage and outperforms the fastest available software on simulated data on accuracy and often on speed. We also construct de novo linkage maps on 7-12x whole-genome data on the Red postman butterfly (Heliconius erato) with almost 3 million markers.
1182 Biochemistry, cell and molecular biology
111 Mathematics
113 Computer and information sciences
Referentgranskad: Ja
Användningsbegränsning: openAccess
Parallelpublicerad version: acceptedVersion

Filer under denna titel

Totalt antal nerladdningar: Laddar...

Filer Storlek Format Granska
lepmap3.pdf 439.4Kb PDF Granska/Öppna

Detta dokument registreras i samling:

Visa fullständig post