شناسایی کل ریزماهواره‌ها در ژنوم شترهای دو‌کوهانه ایرانی با استفاده از داده‌های توالی‌یابی کل ژنوم

نوع مقاله : علمی پژوهشی- ژنتیک و اصلاح دام و طیور

نویسندگان

1 گروه علوم دامی، دانشکده کشاورزی و منابع طبیعی- دانشگاه محقق اردبیلی، اردبیل، ایران

2 دکتری ژنتیک و اصلاح دام و طیور- دانشگاه محقق اردبیلی، اردبیل، ایران

3 گروه علوم دامی، دانشکده کشاورزی و منابع طبیعی- دانشگاه محقق اردبیلی، اردبیل، ایران.

چکیده

شترهای دوکوهانه یکی از مقاومترین گونه‌های حیوانی در برابر شرایط سخت محیطی به شمار می‌روند که تعداد آنها در ایران در محدوده خطر قرار گرفته است. شناخت هر چه بهتر و دقیق‌تر این گونه، به خصوص در سطح مطالعات ژنومی، می‌تواند از طریق طراحی برنامه‌های مدیریت تنوع ژنتیکی، به حفظ این گونه کمک کند. هدف از انجام این مطالعه شناسایی ریزماهواره­های ژنوم شترهای دوکوهانه ایرانی با استفاده از داده­های توالی­یابی کل ژنوم بود. در مطالعه حاضر از تعداد شش نفر شتر دوکوهانه متعلق به استان اردبیل خون­گیری صورت گرفت. توالی­یابی کامل ژنوم شترهای دوکوهانه با استفاده از پلتفرم ایلومینا و به صورت دو انتها (Paired-end) با اندازه 100 جفت باز از هر طرف انجام شد. بعد از پالایش کیفی خوانش‌ها، گردآوری از نو آنها صورت گرفته و با استفاده از برنامه MISA به شناسایی تمام ریزماهواره­های ژنوم‌های مورد مطالعه پرداخته شد. اندازه ژنوم‌های گردآوری شده برای شترهای دو­کوهانه مورد مطالعه، در محدوده Gb 90/1 برای نمونه یک تا Gb 97/1 برای نمونه سه قرار داشت. مقدار N50 مربوط به کانتیگ­های ایجاد شده برای شترهای دو­کوهانه ایرانی از kb 1/19 برای نمونه یک تا kb 7/51 برای نمونه پنج متغیر بود. ریزماهواره­های شناسایی شده در اندازه یک تا هشت نوکلئوتید بودند. کل ریزماهواره­های شناسایی شده برای نمونه­های شترهای دو­کوهانه ایرانی در محدوده 136028 برای نمونه دو تا 539555 برای نمونه سه قرار داشت. همچنین با شناسایی ریزماهواره‌های هفت گونه پستاندار دیگر در این مطالعه، به مقایسه نتایج به دست آمده آنها با شتر دوکوهانه ایرانی پرداخته شد.

کلیدواژه‌ها


عنوان مقاله [English]

Identification of total microsatellites in the genome of Iranian Bactrian camels using whole genome sequencing data

نویسندگان [English]

  • nahideh Zare 1
  • Nemat hedayat evrigh 1
  • Reza Seyedsharifi 1
  • reza khakhali 2
  • Azade Boustan 3
1 Department of Animal Science, Faculty of Agriculture and Natural Resource, University of Mohaghegh Ardabili.
2 Genetics and animal breeding, PhD. University of Mohaghegh Ardabili
3 Department of Animal Science, Faculty of Agriculture and Natural Resource, University of Mohaghegh Ardabili.
چکیده [English]

Introduction Bactrian camels are known as one of the resistant species to harsh environmental conditions. The camel’s body temperature may vary from 34 to 41 °C throughout the day. They can survive if they lose body water greater than 25% of total body weight, while, in non-desert mammals, losses of greater than 15% are deadly. Since, Iran is located in one of the most arid regions of the world and water resources shortage, also special capabilities of camels, this species can be a valuable source of protein in the country. The study of genetic diversity is one of the most widely studies in domestic animals and microsatellites are widely used in this field. Microsatellite sequences contain useful information and are widely used to assess genetic diversity within and between populations, as well as to investigate the evolution process between species. The main aim of the present study was to identify the total microsatellites in the genome of Iranian Bactrian camels using whole genome sequencing data and compare them with other mammalians.
 Materials and Methods This study was carried out to identify genome wide microsatellites on six Bactrian camels from Ardabil province. Blood samples were collected from the jugular vein using 4 ml vacutainer tubes and stored at -20C˚ until use. Illumina HiSeq 2000 technology (Illumina, USA) was used for whole genome sequencing of samples. Sequencing was performed using the paired-end method with 100 bp at both ends of the reads. The quality control of raw sequence reads was performed using FastQC software. The SLIDINGWINDOW (4:20) algorithm of Trimmomatic v0.36 program was used to quality filter of raw reads. After filtration of reads with low quality, reads shorter than 40 bp were discarded. The de novo assembly of trimmed reads from Bactrian camels was done using CLC Genomics Workbench 11 software (CLC Bio, Aarhus, Denmark). The parameters used in this study for de novo assembly of trimmed reads were: 3 for mismatch cost, 3 for deletion and insertion cost, 0.5 for length fraction, and 0.8 for similarity fraction. Assembled genomes were searched for identifying the microsatellites using MISA with motif size ranging from mono-nucleotide to octo-nucleotide. The minimum repeat numbers were defined as 12 for mono-, 6 for di-, 5 for tri- and tetra, 4 for penta- and hexa-, and 3 for hepta- and octo-nucleotide repeat SSRs. Microsatellite motifs that interrupted by 100 nucleotides were considered as compound microsatellites. Also, several mammalians assembled genomes were downloaded and searched for microsatellite loci, including Arabian dromedary camel, Bactrian camel, alpaca, horse, cattle, sheep, and human.
 Results and Discussion The assembled genome size for the Bactrian camels were ranged from 1.90 for sample one to 1.97 for sample three. Also, the N50 length for the assembled contigs of Iranian Bactrian camels were ranged from 19.1 kb for sample one to 51.5 kb for sample five. The contig N50 length is one of the qualitative measurement parameters of genome assembly and a larger size means better assembly.The total microsatellites loci identified for Iranian Bactrian camels ranged from 136028 for sample two to 539555 for sample three. The results show that the genome of samples one, two, three, four, five and six contained 3.13 Mb, 2.35 Mb, 9.26 Mb, 7.1 Mb, 8.99 Mb and 8.86 Mb microsatellites, respectively. It should be noted that the difference in the microsatellites of SSRs in the Iranian Bactrian camel genomes is due to their different qualities in assembly. In mammals examined in this study, humans with 25.7 Mb and horses with 7.81 Mb had the highest and lowest total size of microsatellites, respectively. The results revealed that the number of microsatellites decreases with increasing in them, repeats, so that, one and two repeats sequences are the most frequent motifs. More than 74% of the identified microsatellites belong to the ten microsatellites with the highest number in all seven species. The motif T is the most frequent motif in the samples one and six Iranian Bactrian camels, Iranian dromedary camels, Bactrian camel, cattle, sheep, horses and humans. In samples two, three, four, five, the non-Iranian dromedary camel and alpaca motif A is the most abundant motif. The finding of this study will be applied as a valuable resource for further studies on camel breeding, especially on Iranian Bactrian camels. A large number of camel’s SSR markers developed in this study established a valuable resource for the investigation of genetic diversity and may improve the development of breeding programs in Iranian Bactrian camels in the future.

کلیدواژه‌ها [English]

  • Bactrian camel
  • Iran
  • Microsatellite
  • Whole genome sequencing
1-       Abdul-Muneer, P. 2014. Application of microsatellite markers in conservation genetics and fisheries management: recent advances in population structure analysis and conservation strategies. Genetics Research International, 2014: 691759.
2-       Ala-Amjad, M., H. Yeganeh, and M. Sadeghi. 2017. Study of Genetic variation in Iranian Kurdish horse using microsatellite marker. Iranian Journal of Animal Science, 48(3):342-335. (In Persian)
3-       Al-Ali, A., H. Husayni, and D. A. Powe. 1988. Comprehensive biochemical analysis of the blood of the camel (Camelus dromedarius). Comparative Biochemistry and Physiology, 89(1): 35–37.
4-       Ali, T. A. 1994. A Manual for the Primary Animal Health Care Worker. Food and Agriculture Organization of the United Nations (FAO).
5-       Beier, S., T. Thiel, T. Münch, U. Scholz, and M. Mascher. 2017. MISA-web: a web server for microsatellite prediction. Bioinformatics, 33(16):2583-2585.
6-       Bolger A. M., M. Lohse, and B. Usadel. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, 30(15):2114-2120.
7-       Ellegren, H. 2004. Microsatellites: simple sequences with complex evolution. Nature Reviews Genetics, 5(6):435–445.
8-       Fitak, R. R., E. Mohandesan, J. Corander, and P. A. Burger. 2016. The de novo genome assembly and annotation of a female domestic dromedary of North African origin. Molecular Ecology Resources, 16(1):314–324.
9-       Gemayel, R., J. Cho, S. Boeynaems, and K. J. Verstrepen. 2012. Beyond Junk-Variable Tandem repeats as facilitators of rapid evolution of regulatory and coding sequences. Genes, 3(3):461-480.
10-   Guang-Xin, E., Q. H. Hong, Y. J. Zhao, Y. H. Ma, M. X. Chu, L. Zhu, and Y. F. Huang. 2019. Genetic diversity estimation of Yunnan indigenous goat breeds using microsatellite markers. Ecology and Evolution, 9(10): 5916.
11-   Hampton, J. O., P. B. S. Spencer, D. L. Alpers, L. E. Twigg, A. P.  Woolnough, J. Doust, T. Higgs, and J. Pluske. 2004. Molecular techniques, wildlife management and the importance of genetic population structure and dispersal: a case study with feral pigs. Journal of Applied Ecology, 41(4):735–743.
12-   Huson, K. M., W. Haresign, M. Hegarty, T. Blackmore, C. Potter, and N. McEwan. 2015. Assessment of genetic relationship between six populations of Welsh Mountain sheep using microsatellite markers. Czech Journal of Animal Science, 60(5): 216-223.
13-   Ingram, D. L., and L. E. Mount. 1975. Man and Animals in Hot Environments. Springer Science & Business Media.
14-   Jirimutu, Z. W., G. Ding, G. Chen, Y. Sun, Z. Sun, H. Zhang, L. Wang, S. Hasi, Y. Zhang, J. Li, and Y. Shi. 2012. Genome sequences of wild and domestic Bactrian camels. Nature Communications, 3: 1202.
15-   Katti, M. V., P. K. Ranjekar, and V. S.  Gupta. 2001. Differential distribution of simple sequence repeats in eukaryotic genome sequences. Molecular Biology and Evolution, 18(7):1161–1167.
16-   Khalkhali-Evrigh, R.., S. H. Hafezian, N. Hedayat-Evrigh, A. Farhadi, and M. R. Bakhtiarizadeh. 2019. Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing Data. Frontiers in Genetics, 10(2019):692.
17-   Kelkar, Y. D., N. Strubczewski, S. E. Hile, F. Chiaromonte, K. A. Eckert, and K. D. Makova. 2010. What is a microsatellite: a computational and experimental definition based upon repeat mutational behavior at A ⁄ T and GT ⁄ AC repeats. Genome Biology and Evolution, 2: 620–635.
18-   Leclercq, S., E. Rivals, and P. Jarne. 2010. DNA slippage occurs at microsatellite loci without minimal threshold length in humans: a comparative genomic approach. Genome Biology Evolution, 2:325–335.
19-   Schmidt-Nielsen, K. 1959. The physiology of the camel. Scientific American, 201: 140–151.
20-   Schmidt-Nielsen, K. 1964. Desert Animals. Physiological Problems of Heat and Water. Oxford University Press.
21-   Subramanian, S., R. K. Mishra, and L. Singh. 2003. Genome-wide analysis of microsatellite repeats in humans: their abundance and density in specific genomic regions. Genome Biology, 4(2): 1-10.
22-   Sun, W., C. Lei, X. Lei, and Y. Zhang. 2008. Genetic variation in eight Chinese cattle breeds based on the analysis of microsatellite markers. Genetics Selection Evolution, 40(6): 1-12.
23-   Toth, G., Z. Gáspári, and J. Jurka. 2000. Microsatellites in different eukaryotic genomes: survey and analysis. Genome Research, 10(7):967–981.
24-   Wu, H., X. Guang, M. B. Al-Fageeh, J. Cao, S. Pan, H. Zhou, L. Zhang, M. H. Abutarboush, Y. Xing, and A. S. Alshanqeeti. 2014. Camelid genomes reveal evolution and adaptation to desert environments. Nature Communications, 5: 5188.