Assessing the power of principal components and wright's fixation index analyzes applied to reveal the genome-wide genetic differences between herds of Holstein cows

Show full item record



Permalink

http://hdl.handle.net/10138/315936

Citation

Smaragdov , M G & Kudinov , A A 2020 , ' Assessing the power of principal components and wright's fixation index analyzes applied to reveal the genome-wide genetic differences between herds of Holstein cows ' , BMC Genetics , vol. 21 , no. 1 , 47 . https://doi.org/10.1186/s12863-020-00848-0

Title: Assessing the power of principal components and wright's fixation index analyzes applied to reveal the genome-wide genetic differences between herds of Holstein cows
Author: Smaragdov, M. G.; Kudinov, A. A.
Contributor: University of Helsinki, Animal Science Research
Date: 2020-04-28
Language: eng
Number of pages: 15
Belongs to series: BMC Genetics
ISSN: 1471-2156
URI: http://hdl.handle.net/10138/315936
Abstract: Background Due to the advent of SNP array technology, a genome-wide analysis of genetic differences between populations and breeds has become possible at a previously unattainable level. The Wright's fixation index (F-st) and the principal component analysis (PCA) are widely used methods in animal genetics studies. In paper we compared the power of these methods, their complementing each other and which of them is the most powerful. Results Comparative analysis of the power Principal Components Analysis (PCA) and F-st were carried out to reveal genetic differences between herds of Holsteinized cows. Totally, 803 BovineSNP50 genotypes of cows from 13 herds were used in current study. Obtained F-st values were in the range of 0.002-0.012 (mean 0.0049) while for rare SNPs with MAF 0.0001-0.005 they were even smaller in the range of 0.001-0.01 (mean 0.0027). Genetic relatedness of the cows in the herds was the cause of such small F-st values. The contribution of rare alleles with MAF 0.0001-0.01 to the F-st values was much less than common alleles and this effect depends on linkage disequilibrium (LD). Despite of substantial change in the MAF spectrum and the number of SNPs we observed small effect size of LD - based pruning on F-st data. PCA analysis confirmed the mutual admixture and small genetic difference between herds. Moreover, PCA analysis of the herds based on the visualization the results of a single eigenvector cannot be used to significantly differentiate herds. Only summed eigenvectors should be used to realize full power of PCA to differentiate small between herds genetic difference. Finally, we presented evidences that the significance of F-st data far exceeds the significance of PCA data when these methods are used to reveal genetic differences between herds. Conclusions LD - based pruning had a small effect on findings of F-st and PCA analyzes. Therefore, for weakly structured populations the LD - based pruning is not effective. In addition, our results show that the significance of genetic differences between herds obtained by F-st analysis exceeds the values of PCA. Proposed, to differentiate herds or low structured populations we recommend primarily using the F-st approach and only then PCA.
Subject: Principal components
Fixation index
Minor allele frequency
Dairy cattle
Genetic diversity
POPULATION-STRUCTURE
WHOLE-GENOME
F-ST
CATTLE
RARE
DIVERSITY
CONSERVATION
ASSOCIATION
1184 Genetics, developmental biology, physiology
414 Agricultural biotechnology
Rights:


Files in this item

Total number of downloads: Loading...

Files Size Format View
document_herds.pdf 838.3Kb PDF View/Open

This item appears in the following Collection(s)

Show full item record