738
Views
0
CrossRef citations to date
0
Altmetric
Rapid Communications

Whole-genome resequencing reveals new mutations in candidate genes for Beichuan-white goat prolificacya

ORCID Icon, , , , , & show all

Abstract

In this study, we evaluated the copy number variation in the genomes of two groups of Beichuan-white goat populations with large differences in litter size by FST method, and identified 1739 genes and 485 missense mutations in the genes subject to positive selection. Through functional enrichment, ITGAV, LRP4, CDH23, TPRN, RYR2 and CELSR1 genes, involved in embryonic morphogenesis, were essential for litter size trait, which received intensive attention. In addition, some mutation sites of these genes have been proposed (ITGAV: c.38C > T; TPRN: c.133A > T, c.1192A > G, c.1250A > C; CELSR1: c.7640T > C), whose allele frequencies were significantly changed in the high fecundity goat group. Besides, we found that new mutations at these sites altered the hydrophilicity and 3D structure of the protein. Candidate genes related to litter size in this study and their missense mutation sites were identified. These candidate genes are helpful to understand the genetic mechanism of fecundity in Beichuan white goat, and have important significance for future goat breeding.

Introduction

Capra hircus is widely used for its meat, milk and hides, and has very high economic value (Citation1). Litter size (the number of kids in one birth) is closely related to the economics of goat breeding, but studies have shown the heritabilities of litter size related traits are low and is strongly influenced by epigenetic factors. Although through long-term natural selection and artificial intervention, some characteristic goats have been bred and the genetic resources of modern domestic goats have been expanded (Citation2). However, it is still the focus of current research to search for new reproductive control candidate genes, carry out polymeric breeding with known reproductive genes to make breakthroughs in improving high fecundity goats, and meet the growing demand for animal products (Citation3,Citation4). Therefore, the ongoing search of related genes that affecting reproductive traits is essential in the field of livestock breeding (Citation5,Citation6).

Genome resequencing facilitates population genetic structure assessment, molecular conservation and improvement of local livestock breeds. It has become a reliable way to mine candidate genes for important traits in livestock. Thus, genome-wide resequencing data is used to identify genomic characteristics of domestication or artificial selection, and marker-assisted selection (MAS) based on major candidate genes to improve yield traits remains an important alternative to traditional breeding techniques. Recently, some progress has been made in the research on reproductive traits in goat and sheep, and high fecundity genes such as CCNB1, GDF9, FecB, FecG and FecL have been identified (Citation7). Guan et al compared Dazu black goats with Nei Menggu cashmere goats and found that genes (PAIP2B, CCDC64, EPB41L5, BIRC6) found in Dazu black goats were correlated with reproductive traits (Citation8). Liu et al analyzed whole-genome selection signals of Duolang sheep, small-tailed Han sheep and Mongolian sheep and found 143 genomic regions were subject to selection, and genes related to reproduction, horn type and ear development were screened out (Citation9). Other studies identified were associated with milk production, fat deposition, meat quality and production by whole-genome resequencing (Citation10). However, current genomic characterization of kidding traits in goats and resequencing gene screening are still insufficient.

Beichuan-white goat, a local goat breed that evolved in the specific ecological environment in China. It is distributed at the border of Tibet Plateau and Chengdu Plain. While adapting to the great temperature difference in different seasons, Beichuan white goat keeps the average kidding percentage of female goat above 200%, which shows its high breeding characteristics undoubtedly. Thus, it is ideal resource to screen the breeding genes. However, few studies on the genomic characteristics and genetic characteristics of Beichuan-white goat and litter size have not yet been mined. In addition, the protection and utilization of germplasm resources of this variety need to be paid more attention. Therefore, to elucidate the genetic basis of reproduction in Beichuan-white goats, this study selected 12 female goats from breeding base of Beichuan white goat producing area, distributed in two tails with an average litter size (in three consecutive birth records) and divided into two groups of high fecundity (n = 6) and low fecundity (n = 6) goats. Then, whole gene resequencing was performed to them. Candidate genes of litter size were screened by genome-wide selection signaling assays and these results were compared to identify genomic regions putatively under selection. This study is to provide a theoretical basis for the breeding of goat kidding traits through the mining of candidate genes or molecular markers for the number of kids in Beichuan-white goat, which is of great significance for the breeding of new varieties of goat with multiple kids.

Materials and methods

Animals and ethics

A total of twelves 2-3-year-old healthy female Beichuan-white goats, average body weight 37 ± 6 kg, were divided into high fecundity (n = 6) and low fecundity (n = 6) and placed in a house. The goat was ear tragged, fed three times per day, and given free access to water and salt.

The ethical approval for this study was obtained from the Ethical Committee of Mianyang Academy of Agricultural Sciences with permission number MAAS-2022-003. All efforts were made to minimize possible discomfort during blood collection.

Goat genomic DNA extraction

Genomic DNA was extracted by standard phenol chloroform method. Briefly, a 2 mL blood sample was used for the next processing step. The erythrocyte lysate was added in equal proportions and mixed for 10 min, centrifuged for 10 min in 4 °C 6000 rpm, and the supernatant discarded. Then, 20ul proteinase K, 10% SDS 200 μL and STE 1 mL was added, well shaken and mixed, and digested in a water bath at 55 °C overnight. Subsequently, an equal volume of Tris saturated phenol was added and centrifuged; DNA extractant, centrifuged; 3 times the volume of anhydrous ethanol, centrifuged; 1 mL of 75% ethanol centrifuged. Finally, 200 μL of TE solution was appended to dissolve the DNA and −20 °C stored. DNA quality and concentration were determined by gel electrophoresis and UV spectrophotometer.

DNA library construction and whole genomic resequencing

For genome resequencing, at least 1 μg of genomic DNA from each sample was used to construct a library with an insert size of 350 ∼ 500 bp. The fragmented DNA was modified, and PCR amplification was performed to construct a sequencing library. The quality-checked libraries were sequenced on Illumina’s HiSEquation 2000 sequencer with an average sequencing depth of 10x to obtain Sequenced Reads.

Genome-wide analysis of genetic variation

Sequencing data filtering and sequence alignment

The raw reads are filtered by removing the reads with adapter, unknown sequences over 10, and low quality (Q ≤ 10). The sequencing quality was evaluated by FastQC software after filtration (Citation11). The main assessment items included the number of reads, the overall quality and length distribution of each base in reads, and the content and preferred types of the A, T, G and C bases in reads. The 150-bp paired-end reads were mapped onto the goat reference genome (ARS1CGCA_001704415.1). Use the bwa index -a bwtsw command to build an index for the reference genome file. The samtools tool was used to convert the sam files into bam files in binary compressed format (Citation12). Then, we used the samtools sort command to sort the compared files according to the reference sequence serial number. Finally, the reads of PCR duplicates in the file were marked by the MarkDuplicates command of the picard tool and created an index using CreateSequenceDictionary.jar.

Identification and annotation of genetic variation sites

Firstly, HaplotypeCaller command of gatk tool was used to extract genetic variation information (Citation13). The SNPs were filtered by a series of thresholds with ‘VariantFiltration’ mode of GATK package-4.0.8.1. The condition on variant filtering is DP (Depth) < 10, QD (Quality score by depth) < 2.0, FS (Phred-scaled p-value using Fisher’s exact test) < 60.0, SQR (StrandOddsRatio) >4.0, MQ (Mapping quality score) <40.0, MQRankSum (Rank sum test for mapping quality of reference and alternative within reads) <−12.5, and ReadPosRankSum (Rank Sum Test for relative positioning of reference and alternative alleles within reads) <−8.0. Simultaneous identification of variant information for all samples used GenotypeGVCFs command (joint calling). Finally, the MergeVcfs command of GATK tool was used to combine the identification results of all chromosomes into a vcf file containing the variation information of all samples.

The vcf file was split into 30 files (1 ∼ 29, MT) by chromosome using the vcftools tool (Citation14). To shorten the annotation time, all contents of the files need to be removed except chromosome number, base position, rs number and allele information. Annotation of genetic variation information was performed using the Ensembl Variant Effect Predictor (VEP) online tool of the Ensembl database (http://asia.ensembl.org/info/docs/tools/vep/index.html).

Genome-wide selective signal detection

FST statistics infer the likelihood of selection by calculating the difference in allele frequency of a marker site between different populations. A larger value of the statistic indicates a greater likelihood of population segregation at the site and a greater polymorphism between populations; conversely, a smaller likelihood of population segregation. In this study, the vcftools tool was used to calculate the amount of FST between single-kid and multi-kid group goat populations, and the analysis strategy of sliding-window method was used to scan the whole genetic range (window-size: 100 kb, window-step: 10 kb). The Z - standardized transformation of FST value is performed by R program. The Z conversion formula of FST is as follows: ZFST=|FSTμFST|σFST

Where, μFST is the population mean FST, σFST is the standard deviation of FST. Using R of qqman package mapped of Manhattan that Z(FST) >2.326 areas were identified to the selected area (p < 0.01) (Citation15).

Gene ontology analysis

The annotation information of each locus in the selection fragment was extracted based on the whole genome annotation file. The corresponding gene symbol of each locus were uploaded to Ensembl Biomart database and the goat gene symbol was converted into the corresponding human gene symbol. Function of enrichment using Metascape online website (http://metascape.org/gp/index.html#/main/step1), and FDR p < 0.05 was used as a criterion to screen for enriched pathways, from which the reproduction-related items and the genes were selected.

Screening for gene mutation sites

Vcftools was used to calculate the FST values of all the loci of the above selected genes. Manhattan was draw by the qqman program in R, and the loci with FST ≥0.27 were identified as the selected site. The non-synonymous mutation sites in the selected region were counted and the allele frequencies in low fecundity and high fecundity were calculated.

Protein 3D structure analysis and physicochemical property prediction

The I-TASSER online program (https://zhanglab.ccmb.med.umich.edu/I-TASSER/) was used to predict the 3D structure of proteins (Citation16). We use Ensembl database to get the sequence and location information of the mutant gene and convert the amino acids in the mutant location. Amino acid sequences of wild-type and mutant genes were then input into the I-TASSER online program to predict protein structure. Then, according to the α helix, β folding, irregular curling, and spatial conformation of the protein structure, we determined whether the protein structure have changed. Protein structures was visualized by PyMOL 2.4. ProtScale online tool (https://web.expasy.org/protscale/) predicted the hydrophilicity of proteins before and after mutation and the GraphPad Prism8.0 was used to draw map of hydrophobicity.

Results

Phenotype and population division

shows the descriptive statistics of litter size of Beichuan-white goat. For all selected goats with litter size records, ranged from 1 to 3, with a mean of 1.83 and a standard deviation of 0.91. As described above, 12 goats were divided into two population pairs according to the difference of phenotypic values. Of the 12 Beichuan-white goats, 6 of them were assigned to the high-fecundity (H) population with an average litter size of 2.67 ± 0.49, and the remaining 6 were assigned to the low-fecundity (L) population with an average litter size of 1.00 ± 0.00. As expected, the phenotypic values of litter size trait between H and L populations are significant differences (P-value <0.01) (). The PCA indicated that the genetic backgrounds of H and L populations were similar and the population division based on the value of litter size did not lead to population stratification ().

Figure 1. Genome-wide distributions of selection signals and reproduction-related candidate genes in two Beichuan-white goat populations. a) Distribution of average litter size of high and low fecundity Beichuan-white goat population. b) The positive end of the Z(FST) distribution was plotted along Beichuan-white goat autosomes (different chromosomes are separated by color). the Z(FST) values was calculated for each sliding 100Kb window with steps of 10Kb across the autosomes, and solid horizontal line indicates the cutoff (Z(FST) >2.326) used for extracting outliers. c) The selected 1739 genes were enriched by the GO pathway using Metascape online database with the top 20 items shown, and q < 0.05 (p correction value) was the standard. d) Distribution of mutation types at mutation sites of the positively selected genes.

Figure 1. Genome-wide distributions of selection signals and reproduction-related candidate genes in two Beichuan-white goat populations. a) Distribution of average litter size of high and low fecundity Beichuan-white goat population. b) The positive end of the Z(FST) distribution was plotted along Beichuan-white goat autosomes (different chromosomes are separated by color). the Z(FST) values was calculated for each sliding 100Kb window with steps of 10Kb across the autosomes, and solid horizontal line indicates the cutoff (Z(FST) >2.326) used for extracting outliers. c) The selected 1739 genes were enriched by the GO pathway using Metascape online database with the top 20 items shown, and q < 0.05 (p correction value) was the standard. d) Distribution of mutation types at mutation sites of the positively selected genes.

Whole genomic selective signal detection and enrichment analysis

To screen genomic regions under positive selection in high fecundity goats, a sliding window method based FST with a window size of 100Kb and a step size of 10Kb was calculated. The Manhattan plot was made through normalized FST results using Z(FST)>2.326 as a threshold to filter selected regions (p < 0.01) (). The results showed that 825 regions were positively selected region and the average length is 178.93Kb.

The positively selected regions screened correspond to 1739 genes, encoding protein (Table S1). Subsequently, these genes were subjected to functional enrichment analysis using Gene Ontology (GO) database. The top 20 of significantly enriched items are shown (). GO pathway analyses identified the positively selected genes mainly enriched in microtubule-based processregulation of ion transmembrane transport, regulation of system process and so on. In addition, genes in the top 5% of selected regions and their GO functional enrichment pathways were displayed and mainly enriched in immunological synapse formation, regulation of vesicle-mediated transport, regulation of cell projection organization, cell-cell recognition and so on (Table S2 and ).

Figure 2. Result of selection signal detection and protein structure prediction on the ITGAV gene of Beichuan-white goats. a) Result of the ITGAV gene selection signal. The red dot represents the missense mutation site. The black solid line represents the selection signal screening threshold (FST =0.27). b) Effect of c.38C > T locus variation on the hydrophilicity of 0–20 amino acid sites in ITGAV. The black and red lines represent wild-type and mutant ITGAV proteins, respectively. c) The effect of c.38C > T locus mutation on the overall and local schematic 3D structure of wild-type and mutant ITGAV protein.

Figure 2. Result of selection signal detection and protein structure prediction on the ITGAV gene of Beichuan-white goats. a) Result of the ITGAV gene selection signal. The red dot represents the missense mutation site. The black solid line represents the selection signal screening threshold (FST =0.27). b) Effect of c.38C > T locus variation on the hydrophilicity of 0–20 amino acid sites in ITGAV. The black and red lines represent wild-type and mutant ITGAV proteins, respectively. c) The effect of c.38C > T locus mutation on the overall and local schematic 3D structure of wild-type and mutant ITGAV protein.

Candidate genes related to litter size and SNP

There are diversified types of variation in positive selection sites, and the variation of these sites, especially non-synonymous mutations, may has a great impact on the 3-dimensional (3D) structure and physicochemical properties of proteins. Therefore, we further identified SNP of the positive selection genes screened in the above. The results showed that 485 missense mutation loci out of 106,750 mutation loci were identified in the chromosomes of these positive selection genes ().

To understand the specific role of these missense mutations, functional enrichment of GO was performed. Importantly, some reproduction-related pathways such as cell morphogenesis involved in differentiation, embryonic morphogenesis, and cell morphogenesis were also significantly enriched (q < 0.05). Further, ITGAV, LRP4, CDH23, TPRN, RYR2, CELSR1 genes related to these pathways were focused and identified as candidate genes affecting reproductive function in goats. In addition, some mutations interestingly appear to affect the auditory cells primarily.

SNPs of genes associated with kidding under positive selection

According to the screened information on the autosomal mutation sites of these six genes, the recorded polymorphic sites were re-emphasized, and several new polymorphic sites were also proposed ().

Table 1. SNP Details of six reproductive related genes in autosomal sequence.

Positive selection site mutation and protein structure prediction of ITGAV gene

One missense mutation of positive selection sites was detected in ITGAV gene, and the mutation localized in the first exon was shown in . The ITGAV protein in the 13th amino acid occurred a mutation in the high-breeding group from Leu to Pro compared with low-breeding group, and the frequency of allele C of c.38C > T site in high fecundity goat population was 83.3%.

To further study the effect of selection signal on gene function, we predicted the influence of c.38C > T mutation of ITGAV on protein tertiary structure and hydrophilicity of amino acid residues. The results showed that the mutation caused a change of space structure from α- Helix to coil in the region of the mutation and their attachments (). Meanwhile, it also led to the decrease of coefficient of hydrophilicity of amino acid residue from 0.111 to −0.489, resulting in the increase of hydrophilicity of nearby ().

Positive selection site mutation and protein structure prediction of TPRN gene

By selective signal detection, three missense variants were found in positive selection sites in TPRN gene, including three unreported identified SNP sites (one located in the second exon and two located in the 4th exon) (). Among the three mutations, allele A of c.133A > T, which corresponds to the 45th amino acid Thr, had a gene frequency of 83.8% in the high fecundity goats. However, allele T of c.133A > T corresponds to the 45th amino acid Ser in low fecundity goats had a gene frequency of 100%. In addition, in goats with high fecundity, the sites that c.1192A > G (p.398P > R) and c.1250A > C (p.417G > Q) were involved in arginine and glutamine, and the frequency of allele A increased to 75% and 41.6%, respectively.

Figure 3. Result of selection signal detection and protein structure prediction on the TPRN gene of Beichuan-white goats. a) Result of the TPRN gene selection signal. The red dot represents the missense mutation site. The black solid line represents the selection signal screening threshold (FST =0.27). the effect of b) c.133A > T and d) c.1192A > G and c.1250A > C locus variation on the hydrophilicity of 40-60 and 380-440 amino acid sites in TPRN. The black and red lines represent wild-type and mutant TPRN proteins, respectively. The effect of c) c.133A > T and e) c.1192A > G and c.1250A > C locus mutation on the overall and local schematic 3D structure of wild-type and mutant TPRN protein.

Figure 3. Result of selection signal detection and protein structure prediction on the TPRN gene of Beichuan-white goats. a) Result of the TPRN gene selection signal. The red dot represents the missense mutation site. The black solid line represents the selection signal screening threshold (FST =0.27). the effect of b) c.133A > T and d) c.1192A > G and c.1250A > C locus variation on the hydrophilicity of 40-60 and 380-440 amino acid sites in TPRN. The black and red lines represent wild-type and mutant TPRN proteins, respectively. The effect of c) c.133A > T and e) c.1192A > G and c.1250A > C locus mutation on the overall and local schematic 3D structure of wild-type and mutant TPRN protein.

The effect of mutations on the hydrophilic coefficient of amino acid site and 3D structure were analyzed. TPRN mutation showed that the variation of c.133A > T site only increased slightly the hydrophobicity of this site because its structure in this position inward spirals (). The variation of c.1192A > G and c.1250A > C sites increased the hydrophilicity of the surrounding region significantly with the change of protein structure ().

Positive selection site mutation and hydrophilic coefficient prediction of CDH23, CELSR1, LRP4, RYR2 genes

More attention has been paid to the newly discovered genetic missense mutations. A mutation that called c.7640T > C has not been proposed in CELSR1 (), which encodes two amino acids, Val and Ala. This mutation was found to cause a difference in allele frequency. As a consequence, the allele T frequency increased to 50% in high fecundity goats. The conversion from alanine to valine resulted in an increase in the hydrophilicity of the region around this site (). Unfortunately, protein structure predictions of more than 1500 amino acids (AA) by the I-TASSER online program are generally considered inaccurate.

Figure 4. Result of selection signal detection and hydrophilicity coefficient on the CELSR1 gene of Beichuan-white goats. a. Result of the CELSR1 gene selection signal. The red dot represents the missense mutation site. The black solid line represents the selection signal screening threshold (FST =0.27). b. Effect of c.7640T > C locus variation on the hydrophilicity of 2540–2560 amino acid sites in CELSR1. The black and red lines represent wild-type and mutant CELSR1 proteins, respectively.

Figure 4. Result of selection signal detection and hydrophilicity coefficient on the CELSR1 gene of Beichuan-white goats. a. Result of the CELSR1 gene selection signal. The red dot represents the missense mutation site. The black solid line represents the selection signal screening threshold (FST =0.27). b. Effect of c.7640T > C locus variation on the hydrophilicity of 2540–2560 amino acid sites in CELSR1. The black and red lines represent wild-type and mutant CELSR1 proteins, respectively.

Missense mutations at positive selection sites (e.g.rs661695950, rs638523001 and rs652767212) have been put forward in some genes (CDH23, LRP4, RYR2) (, Citation4a, and Citation5a). By predicting the change of hydrophilic coefficient, the results showed that among the reported genes, c.5041C > T caused a small increase of hydrophilicity in the region near this position of CDH23 gene (), while c.5120T > C had the opposite effect on LRP4 gene in high fecundity goats (). In addition, c.2844T > G had no effect on the hydrophilicity of RYR2 gene ().

Discussion

Genome-wide selection signal detection strategy

In the process of evolution, organisms are affected by selection, which manifests as changes in the frequency of alleles in individuals and populations, and manifests as changes in genetic diversity in gene structure. Long-term artificial selection has left genetic footprints in the goat genome adapt to different breeding goals (Citation17). Some pathways that occurred in other species have been found to be frequent targets of selection. Therefore, it is theoretically feasible to identify character-specific selection traits by constructing strategies for population pairs with phenotypic differences. The advent of whole-genome sequencing and increasingly complete surveys of genetic variation help to investigate the regulation of genes under positive selection (Citation18,Citation19). The FST (Fixation Index) statistical approach, a selective sweep method, focuses on the amount of differentiation at a specific site of genome to determine whether selection is acting on the genomic region (Citation20,Citation21). Some genes related to some traits like coat color, growth, muscle development of goats have been identified with the boost of selected sweep analysis by whole-genome sequencing (Citation22). Yao et al. conducted a whole-genome analysis on Huai goats and BoHuai goats, and identified a region on chromosome 7 that exhibited strong selection signals. Several genes related to important traits were detected in this region, including those associated with lipid metabolism (LDLR, STAR, ANGPTL8), reproductive ability (STAR), and disease resistance (CD274, DHPS, PDCD1LG2). These findings provide clues for further research on the important traits of Hua goats and BoHuai goats, and contribute to a better understanding of the genetic mechanisms underlying these traits (Citation22). Chong et al. performed selection signal analysis by dividing Hu sheep into high-fertility and low-fertility groups, and identified 101 reproductive-related genes and 22 positively selected mutation sites (Citation23). Tao et al. based on differences in reproductive ability, performed screening on YunShang black goat and discovered promising genes involved in ovarian function (PPP2R5C, CDC25A, ESR1, RPS26, and SERPINBs), seasonal reproduction (DIO3, BTG1, and CRYM), and metabolism (OSBPL8, SLC39A5, and SERPINBs) (Citation24–27). Based on this, we applied FST method by whole-genome sequencing to assess genetic differentiation parameters between two populations of high and low reproductive Beichung-white goats. We found that several genes carrying selective traits were associated with litter size. Currently, the breeding of Beichuan-white goats mainly focuses on phenotypic selection, which involves selecting based on traits such as birth weight and number of offspring. However, this method has a slow breeding progress due to its susceptibility to environmental and nutritional factors, and it has limited genetic improvement potential. In contrast, the whole-genome selection approach used in this experiment provides a comprehensive examination of genetic variations associated with reproductive traits in goats. This approach enables a more detailed understanding of the underlying genetic mechanisms and accelerates the breeding progress of Beichuan-white goats.

Gene mutations could affect transcription, translation and mRNA transport, and missense mutations may alter the protein structure and lose its function, whereas sometimes the protein function is enhanced, thus improving the production traits of the animal (Citation28–31). An et al. investigated polymorphisms of GnRH1 and GDF9 genes in different goat breeds, and a missense mutation was found in the GDF9 gene had a significant effect on goat litter size by association analysis (Citation32). In addition, two missense mutations (c.1457G > A and c.1645G > A) in PRLR gene have also been found to involve in regulating litter size in goats (Citation33). Therefore, in this study, we focused on missense mutations in genes under the positive selection with a view to identifying proteins with potentially altered functions related to reproduction. This emphasis on functional genetic variation enhances our ability to identify specific genetic changes that may enhance reproductive traits in goats. However, while our study identifies candidate genes and missense mutations associated with reproductive traits, further experimental validation is required to confirm their functional impact on fertility. In addition, investigating gene-environment interactions could provide insights into how genetic variants related to reproduction respond to different environmental conditions, such as nutrition, temperature, and stress.

Among the GO-enriched pathways with missense mutations, embryonic morphogenesis was focused on. Changes in embryo morphogenesis are influenced by the embryonic genome, which is reprogrammed during the transformation of oocytes into embryos and before implantation (Citation34). As a result, we identified ITGAV, LRP4, CDH23, TPRN, RYR2 and CELSR1 genes related to this pathway as candidate genes. Some reports from different animals also confirmed that these genes were involved in reproduction related pathways (Citation35,Citation36).

Genetic effect of c.38C > T mutation in ITGAV gene in goats

The human ITGAV protein, a member of the membrane bound receptor family, is involved in many developmental processes, including extracellular matrix mediated cell adhesion and migration, cytoskeletal organization, cell proliferation, survival and differentiation. It immobilizes cells to various substrates and strengthens external signals through membranes (Citation37). During the early stages of pregnancy in livestock, the endometrium undergoes huge morphological and functional changes, and the embryo is regulated by adhesion so that it can adhere to the uterine wall, and because of this implantation, the placenta is formed. This period is accompanied by dynamic changes in gene expression (Citation38,Citation39). ITGAV is associated with cell adhesion and cell differentiation (Citation40). ITGAV-containing heterodimers may be involved in TGFβ activation during embryo implantation through mechanisms of traction and Intracellular forces, and then active TGFβ could bind and activate TGFβRs at both embryo attachment and nonattachment sites (Citation41). One study showed that placental ITGAV expression in late pregnancy was positively correlated with the number of live births in pigs (Citation42). Therefore, in this study, the missense mutation of ITGAV was subject to positive selection, possibly due to the regulatory role played by this gene in reproduction.

Integrins such as ITGAV may exist in a high-affinity state when the fetus is attached to the maternal side. At this stage, biological forces such as tension, compression and shear forces play an important role in stabilizing the high affinity conformation of ITGAV, which are influenced by dynamic changes in maternal posture, movement and/or fetal position (Citation43). ITGAV was found to increase in the luminal epithelium and trophectoderm of interplacentomal uterine and aggregate with several integrin-related proteins during pregnancy in sheep (Citation44). The c.38C > T site mutation caused the change of the 13th amino acid of ITGAV, for proline was mainly found in high fecundity goats, whereas leucine in the low fecundity goats. Propensity of the change in this site in the high fecundity goats led to an increase in hydrophilicity near the amino acid residue (), suggesting that ITGAV protein folding may be altered, thereby regulating its affinity state. Then, based on prediction, mutations in the ITGAV site were found to affect the 3D structure of the protein. The conformational change of ITGAV may affect its binding with proteases and affinity for fibronectin, and thus affecting cell adhesion. At the same time, cell migration and proliferation were affected by cell adhesion, which may affect embryonic stability (Citation45).

Genetic effect of missense mutation in TPRN gene in goats

The locus TPRN encodes a sensory epithelial protein, and mutations of which cause a progressive course of autosomal recessive nonsyndromic hearing loss (Citation46). Wang et al. found that TPRN was involved in breast cancer prognosis, but the molecular mechanism remained to be elucidated (Citation47). To date, few litter size related functions of TPRN have been reported and it has great potential as a biomarker for reproduction.

Our results showed four missense mutation sites for TPRN in high fecundity goats, one of which has been suggested. However, there was no direct evidence that this change was related to goat reproduction. Further research should be carried out to explore the protein conformation by amino acid transformations (). For example, the 45th amino acid of TPRN was converted from threonine to serine by mutation at the c.133A > T site in high fecundity goats, and these two amino acids are often involved in the phosphorylation of proteins. Threonine and serine are usually attached by the phosphoric group and then activated by kinases, thereby regulating protein function and localization (Citation48). In addition, some conserved amino acid residues (phenylalanine, tyrosine, arginine, etc.) on the protein surface have been identified to interact with DNA, suggesting that the c.1192A > G mutation (R/G) of TPRN in high fecundity goats may be involved in influencing intramolecular and intermolecular communication (Citation49,Citation50).

Genetic effect of c.7640T > C mutation in CELSR1 gene in goats

The directional movements of the cilia enable the ovum to be transported steadily to the ampulla along with the fluid in the oviduct (Citation51). They are regulated both within individual cells and between cells, but the mechanisms of regulation differ between animal species and organs (Citation52). In mammalian ependyma, cilia direction is regulated by planar cell polarity (PCP) factors. CELSR1, a PCP factor and associated with sexual development, is asymmetrically located on the boundaries of each cell and is responsible for coordinating cilia orientation between cells (Citation53). In addition, it could coordinate cilia direction of tissue-wide by aligning microtubule networks between cells. The researchers found that cilia orientation was not coordinated in Celsr1−/− mutant oviducts (Citation54). In this study, CELSR1 was identified as subject to positive selection in high fecundity goats, possibly due to its effect on cilia movements. Another reason contributing to the positive selection of CELSR1 is the importance of CELSR1 for the formation of fold pattern in the oviduct epithelium, which work together with cilia to improve transport efficiency. It was reported that the defect of CELSR1 resulted in ectopically-branched folds in mice and even infertility in humans and mice (Citation55,Citation56).

CELSR proteins consist of a large ectodomain (about 2700 amino acids), seven transmembrane segments and a cytoplasmic tail that is poorly conserved among the protein family. The ectodomain contains a G-protein-coupled receptor (GPCR) proteolytic site (GPS) embedded in the GAIN (GPCR Autoproteolysis INducing) domain (Citation57). The GPS motif is an important element in receptor function (Citation51). The crystal structure indicated that the GPS motif needed to be embedded in the GAIN domain to mediate autoproteolysis instead of acting by itself (Citation57). The c.7640T > C mutation of CELSR1 in high fecundity goats is located in GAIN domain, and the increased hydrophilicity near this site may influence proteolysis and thus regulate function.

Reported missense mutations in other genes

Although the missense mutations of CDH23, LRP4 and RYR2 genes identified in this study have been reported, there was no denying that some hints for our research were obtained by some interesting phenomena. For example, alleles of mutant sites in CDH23 and RYR2 genes (c.5041C > T and c.2844T > G) were completely fixed in highly fecundity goats, which infers a different direction of evolution. On the other hand, the amino acid corresponding to these sites of these genes was altered, whereas the hydrophilicity around these sites changed little, suggesting that they may not be critical to protein function.

Conclusion

In this study, whole genome resequencing and selective signatures were conducted to identify candidate genes associated with kidding ability in phenotypically distinct population pairs of Beichuan white goats (high-fecundity and low-fecundity). The results indicate that 1739 positively selected genes were identified using FST analysis. GO pathway analysis revealed that these positively selected genes were primarily enriched in microtubule-based processes, regulation of ion transmembrane transport, regulation of system processes, and other functions. A total of 106,750 point mutations were found in the positively selected genes, including 485 missense mutations. The pathways involving missense mutations were predominantly associated with differentiation, embryonic morphogenesis, and cell morphogenesis. Importantly, the high fertility of Beichuan-white goats may be linked to specific positively selected genes (ITGAV, LRP4, CDH23, TPRN, RYR2, CELSR1). Among these genes, certain mutations (ITGAV: c.38C > T; TPRN: c.133A > T, c.1192A > G, c.1250A > C; CELSR1: c.7640T > C; RYR2: c.2844T > G; CELSR1: c.7640 > C) increased the hydrophilicity of the mutation sites, while LRP4 (C.5120T > C) decreased hydrophilicity. This study deepened our understanding of the genomic characteristics underlying the kidding traits of Beichuan white goats and provided valuable guidance for further marker-assisted selection breeding and genomic selection breeding in goats. However, future studies incorporating genome-wide data from a larger number of individuals will be necessary to comprehend the complex interplay of factors involved in reproductive evolution.

Authors’ contributions

Conceptualization, Aimin Zhou and Yi Ding; Methodology, Aimin Zhou, Long Xiao and Tingjian Li; Investigation, Xiaohui Zhang and Yadong Liu; Supervision, Long Xiao; Writing – Original Draft, Aimin Zhou and Yi Ding; Writing – Review & Editing, all authors.

Compliance with ethical standards

Ethical approval

The main research project researcher of this work was approved for animal ethics by the Ethical Committee of Mianyang Academy of Agricultural Sciences with permission number MAAS-2022-003. All applicable guidelines for the care and use of animals were followed by the authors.

Consent for publication

The manuscript has been read and approved by all named authors.

Supplemental material

Supplemental Material

Download MS Word (17.4 KB)

Supplemental Material

Download MS Word (27.7 KB)

Supplemental Material

Download MS Word (327.1 KB)

Supplemental Material

Download MS Word (205 KB)

Supplemental Material

Download MS Word (400.7 KB)

Supplemental Material

Download MS Word (116 KB)

Supplemental Material

Download MS Word (66.3 KB)

Acknowledgements

We are thankful to Xunping JIANG and all members of the key laboratory of agricultural animal genetics who provided expertise that greatly assisted the research.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

Supported by the earmarked fund for China Agriculture Research System of MOF and MARA (CARS-38), Sichuan Science and Technology Program (2021YFYZ0003 and 2023YFN0034). The authors are thankful to anonymous, for their valuable comments, suggestions and proofreading of the manuscript.

References

  • Silpa MV, Naicy T, Aravindakshan TV, Radhika G, Boswell A, Mini M. Sirtuin3 (SIRT3) gene molecular characterization and SNP detection in prolific and low prolific goat breeds. Theriogenology. 2018;122:47–52.
  • Henkel J, Saif R, Jagannathan V, et al. Selection signatures in goats reveal copy number variants underlying breed-defining coat color phenotypes. PLoS Genet. 2019;15(12):e1008536.
  • Das A, Shaha M, Gupta MD, Dutta A, Miazi OF. Polymorphism of fecundity genes (BMP15 and GDF9) and their association with litter size in Bangladeshi prolific Black Bengal goat. Trop Anim Health Prod. 2021;53(2):230.
  • Gootwine E. Invited review: Opportunities for genetic improvement toward higher prolificacy in sheep. Small Ruminant Res. 2020;186:106090.
  • Abdolahi S, Shokrollahi B, Saadati N, Morammazi S. No polymorphism of melatonin receptor 1A (MTNR1A) gene was found in Markhoz goat. Vet Med Sci. 2019;5(2):157–161.
  • Kang Z, Bai Y, Lan X, Zhao H. Goat AKAP12: Indel mutation detection, association analysis with litter size and alternative splicing variant expression. Front Genet. 2021;12:648256.
  • Wang J-J, Zhang T, Chen Q-M, et al. Genomic signatures of selection associated with litter size trait in Jining Gray Goat. Front Genet. 2020;11:286.
  • Guan D, Luo N, Tan X, et al. Scanning of selection signature provides a glimpse into important economic traits in goats (Capra hircus). Sci Rep. 2016;6(1):36372.
  • Liu Z, Ji Z, Wang G, Chao T, Hou L, Wang J. Genome-wide analysis reveals signatures of selection for important traits in domestic sheep from different ecoregions. BMC Genomics. 2016;17(1):863.
  • Siddiki AZ, Miah G, Islam MS, et al. Goat genomic resources: The search for genes associated with its economic traits. Int J Genomics. 2020;2020:5940205.
  • Babraham Bioinformatics – FastQC. A quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/. Accessed June 12, 2018.
  • Li H, Handsaker B, Wysoker A, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–2079.
  • DePristo MA, Banks E, Poplin R, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43(5):491–498.
  • Danecek P, Auton A, Abecasis G, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–2158.
  • Rubin CJ, Zody MC, Eriksson J, et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature. 2010;464(7288):587–591.
  • Kelley LA, Sternberg MJ. Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc. 2009;4(3):363–371.
  • Islam R, Li Y, Liu X, et al. Genome-wide runs of homozygosity, effective population size, and detection of positive selection signatures in six Chinese goat breeds. Genes (Basel). 2019;10(11):938.
  • Sabeti PC, Schaffner SF, Fry B, et al. Positive natural selection in the human lineage. Science. 2006;312(5780):1614–1620.
  • Buffalo V, Coop G. Estimating the genome-wide contribution of selection to temporal allele frequency change. Proc Natl Acad Sci USA. 2020;117(34):20672–20680.
  • Wang K, Liu X, Qi T, et al. Whole-genome sequencing to identify candidate genes for litter size and to uncover the variant function in goats (Capra hircus). Genomics. 2021;113(1):142–150.
  • Yao Y, Pan Z, Di R Liu Q, et al. Whole genome sequencing reveals the effects of recent artificial selection on litter size of Bamei mutton sheep. Animals (Basel). 2021;11(1):157.
  • Yao Z, Zhang S, Wang X, et al. Genetic diversity and signatures of selection in BoHuai goat revealed by whole-genome sequencing. BMC Genomics. 2023;24(1):116.
  • Chong Y, Jiang X, Liu G. An ancient positively selected BMPRIB missense variant increases litter size of Mongolian sheep populations following latitudinal gradient. Mol Genet Genomics. 2022;297(1):155–167.
  • Tao L, He XY, Jiang YT, et al. Combined approaches to reveal genes associated with litter size in Yunshang black goats. Anim Genet. 2020;51(6):924–934.
  • Guo J, Tao H, Li P, et al. Whole-genome sequencing reveals selection signatures associated with important traits in six goat breeds. Sci Rep. 2018;8(1):10405.
  • Saif R, Henkel J, Mahmood T, Ejaz A, Ahmad F, Zia S. Detection of whole genome selection signatures of Pakistani Teddy goat. Mol Biol Rep. 2021;48(11):7273–7280.
  • Yang BG, Yuan Y, Zhou DK, et al. Genome-wide selection signal analysis of Australian Boer goat reveals artificial selection imprinting on candidate genes related to muscle development. Anim Genet. 2021;52(4):550–555.
  • Goymer P. Synonymous mutations break their silence. Nat Rev Genet. 2007;8(2):92–92.
  • Minde DP, Anvarian Z, Rüdiger SG, Maurice MM. Messing up disorder: how do missense mutations in the tumor suppressor protein APC lead to cancer? Mol Cancer. 2011;10(1):101.
  • Chen Y, Lu H, Zhang N, Zhu Z, Wang S, Li M. PremPS: Predicting the impact of missense mutations on protein stability. PLoS Comput Biol. 2020;16(12):e1008543.
  • Sallam AM. A missense mutation in the coding region of the toll-like receptor 4 gene affects milk traits in Barki sheep. Anim Biosci. 2021;34(4):489–498.
  • An XP, Hou JX, Zhao HB, et al. Polymorphism identification in goat GNRH1 and GDF9 genes and their association analysis with litter size. Anim Genet. 2013;44(2):234–238.
  • Hou JX, An XP, Han P, Peng JY, Cao BY. Two missense mutations in exon 9 of caprine PRLR gene were associated with litter size. Anim Genet. 2015;46(1):87–90.
  • Peaston AE, Evsikov AV, Graber JH, et al. Retrotransposons regulate host genes in mouse oocytes and preimplantation embryos. Dev Cell. 2004;7(4):597–606.
  • Stenhouse C, Hogg CO, Ashworth CJ. Association of foetal size and sex with porcine foeto-maternal interface integrin expression. Reproduction. 2019;157(4):317–328.
  • Claycombe-Larson KJ, Bundy AN, Kuntz T, et al. Effect of a maternal high-fat diet with vegetable substitution on fetal brain transcriptome. J Nutr Biochem. 2022;108:109088.
  • Sims MA, Field SD, Barnes MR, et al. Cloning and characterisation of ITGAV, the genomic sequence for human cell adhesion protein (vitronectin) receptor alpha subunit, CD51. Cytogenet Cell Genet. 2000;89(3-4):268–271.
  • Lin H, Wang H, Wang Y, Liu C, Wang C, Guo J. Transcriptomic Analysis of the Porcine Endometrium during Embryo Implantation. Genes (Basel). 2015;6(4):1330–1346.
  • Pierzchała M, Pierzchała D, Ogłuszka M, et al. Identification of differentially expressed gene transcripts in porcine endometrium during early stages of pregnancy. Life (Basel). 2020;10(5):68.
  • Frank JW, Seo H, Burghardt RC, Bayless KJ, Johnson GA. ITGAV (alpha v integrins) bind SPP1 (osteopontin) to support trophoblast cell adhesion. Reproduction. 2017;153(5):695–706.
  • Massuto DA, Kneese EC, Johnson GA, et al. Transforming growth factor beta (TGFB) signaling is activated during porcine implantation: proposed role for latency-associated peptide interactions with integrins at the conceptus-maternal interface. Reproduction. 2010;139(2):465–478.
  • Astrof NS, Salas A, Shimaoka M, JianFeng Chen J, Springer TM. Importance of force linkage in mechanochemistry of adhesion receptors. Biochemistry. 2006;45(50):15020–15028.
  • Burghardt RC, Burghardt JR, Taylor JD, et al. Enhanced focal adhesion assembly reflects increased mechanosensation and mechanotransduction at maternal-conceptus interface and uterine wall during ovine pregnancy. Reproduction. 2009;137(3):567–582.
  • Beyeler J, Katsaros C, Chiquet M. Impaired contracture of 3D collagen constructs by fibronectin-deficient murine fibroblasts. Front Physiol. 2019;10:166.
  • Li Y, Pohl E, Boulouiz R, et al. Mutations in TPRN cause a progressive form of autosomal-recessive nonsyndromic hearing loss. Am J Hum Genet. 2010;86(3):479–484.
  • Wang CCN, Li CY, Cai J-H, et al. Identification of prognostic candidate genes in breast cancer by integrated bioinformatic analysis. J Clin Med. 2019;8(8):1160.
  • Wang R, Wang G. Protein modification and autophagy activation. Adv Exp Med Biol. 2019;1206:237–259.
  • Cusack MP, Thibert B, Bredesen DE, del Rio G. Efficient identification of critical residues based only on protein structure by network analysis. PLoS One. 2007;2(5):e421.
  • Yan W, Zhou J, Sun M, Chen J, Hu G, Shen B. The construction of an amino acid network for understanding protein structure and function. Amino Acids. 2014;46(6):1419–1439.
  • Hino T, Yanagimachi R. Active peristaltic movements and fluid production of the mouse oviduct: their roles in fluid and sperm transport and fertilization†. Biol Reprod. 2019;101(1):40–49.
  • Boutin C, Labedan P, Dimidschstein J, et al. A dual role for planar cell polarity genes in ciliated cells. Proc Natl Acad Sci USA. 2014;111(30):E3129–3138.
  • Gross N, Taylor T, Crenshaw T, Khatib H. The intergenerational impacts of paternal diet on DNA methylation and offspring phenotypes in sheep. Front Genet. 2020;11:597943.
  • Usami FM, Arata M, Shi D, et al. Intercellular and intracellular cilia orientation is coordinated by CELSR1 and CAMSAP3 in oviduct multi-ciliated cells. J Cell Sci. 2021;134(4): jcs257006.
  • Shi D, Komatsu K, Hirao M, et al. Celsr1 is required for the generation of polarity at multiple levels of the mouse oviduct. Development. 2014;141(23):4558–4568.
  • Goffinet AM, Tissir F. Seven pass cadherins CELSR1-3. Semin Cell Dev Biol. 2017;69:102–110.
  • Prömel S, Langenhan T, Araç D. Matching structure with function: the GAIN domain of adhesion-GPCR and PKD1-like proteins. Trends Pharmacol Sci. 2013;34(8):470–478.
  • Shashidhar S, Lorente G, Nagavarapu U, et al. GPR56 is a GPCR that is overexpressed in gliomas and functions in tumor cell adhesion. Oncogene. 2005;24(10):1673–1682.