Document Type : Research Articles
Author
Department of Animal Science, Faculty of Agriculture, Arak University, Arak, Iran
Abstract
Introduction: Genetic architecture of sheep reproduction is increasingly gaining scientific interest due to the major impact on sheep production systems. The number of lambs per lambing is one of the most important reproductive traits in sheep. Many studies have reported that genetic mechanisms play an important role in the variation of litter size in sheep. Reproductive traits normally show low heritability and therefore response to conventional selection methods is not satisfactory for these traits. Considering the genetic information of the genetic variants underlying reproduction variability could efficiently increase the selection efficacy. Genome-wide association studies (GWAS) have been used to identify associations between genotypes and phenotypes as well as candidate genes for reproductive economically important traits. Statistical power in GWAS is mostly affected by sample size. The low sample size is hence a main obstacle in GWAS. Combining multiple data sets of different studies for joint (mega) GWAS provides an opportunity to increase the sample size required for GWAS. This study was performed to identify genomic regions affecting litter size in different sheep breeds using the mega-analysis of GWAS.
Materials and Methods: Multi-population joint GWAS was performed using genotypic and phenotypic data of three sheep breeds including native Zandi and two breed retrieved from the database. Quality control was performed using the Plink software. The markers or individuals were removed from the further study based on the following criteria: (1) unknown chromosomal or physical location, call rate <0.90, missing genotype frequency >0.05, minor allele frequency (MAF) < 0.05, and a Pvalue for Hardy–Weinberg equilibrium test less than 10-6. Before analysis, imputation of missing genotypes for combined data set was implemented by LD-kNNi method. Mega-analysis was performed using a mixed linear model in TASSEL software considering kinship and population structure (top five components of principal component analysis (PCA)) as confounding effects. The quantile–quantile (Q–Q) plot was visualized by plotting the distribution of obtained vs. expected log10 (P-value). The association results along the genome and the significant SNPs were visualized in the Manhattan plot. To account for multiple test problem and identify the genome-wide significance level, Bonferroni test was used based on the number of independent SNPs obtained from pairwise linkage disequilibrium analysis. After GWAS analysis, the 500 bp sequence upstream and downstream of the significant SNP was explored to identify the adjacent candidate genes using ARS-UI_Ramb_v2.0 (Genome Data Viewer).
Results and Discussion: In the present study, we implemented a mega GWAS using three different sheep breed data to identify the genetic mechanisms responsible for litter size in sheep. After quality control, 671 animals and 45167 SNP markers were kept for further analysis. The results of the mega-analysis identified nine marker on chromosome on chromosomes 1 (two SNP), 2, 3 (two SNP), 10, 13 (two SNP), and 22. The quantile–quantile plot that features the total distribution of the observed P-values (−log10 P-values) of quality passed SNPs vs. the expected values, showed the effective control for confounding effects. Many of the significant SNPs identified in this study were located in or very adjacent to known genes (DLG1, CLSTN2, INHBE, TCFL5, and RBP4) that have been already reported for their contribution to fertility and pregnancy success. It has been reported that the RBP4 gene is expressed during the period of fast elongation of the pig blastocyst which is a crucial period for the survival of the embryos. Also, it has been suggested that CLSTN2 has the main contribution in uterine and conceptus physiology during the establishment of pregnancy and therefore can be considered as a candidate gene for litter size. INHBE has an essential function during ovulation and pregnancy through extracellular matrix (ECM) components degradation and therefore enabling cell migration and angiogenesis.
Conclusion: Comparison of the results of this study with previous reports showed that the mega-analysis of GWAS, compared to the meta-analysis already reported for GWAS results, had comparable power in identifying genomic regions influencing litter size in sheep but identified fewer genomic regions than individual GWAS for each breed. No previously reported major genes controlling litter size in sheep were identified using our mega GWAS. The results of our research are suggested for further investigations in identifying causal genetic variants or genomic regions underlying the litter size variation in sheep and can be used to understand the genetic mechanism controlling this trait.
Keywords
Main Subjects
Send comment about this article