- Research
- Open access
- Published:
Validation of loci and genes associated with fertility in Holstein cows using gene-set enrichment analysis-SNP and genotype-by-sequencing
BMC Genomics volume 26, Article number: 174 (2025)
Abstract
Background
The financial strain fertility issues cause the dairy cattle industry is substantial, with over $7 billion in lost revenue accrued annually due to a relatively low cow conception rate (CCR; 30–43%) for US dairy cows. While CCR has been improving through genomic selection, identification of causal mutations would help improve the rate of genetic progress with genomic selection and provide a better understanding of infertility. The objectives of this study were to: (1) identify genes and gene-sets associated with CCR to the first breeding (CCR1) and the number of breedings required to conceive (TBRD) in Holstein cows and (2) identify putative functional variants associated with CCR1 and TBRD through a custom genotype-by-sequencing array. The study consisted of 1,032 cows (494 pregnant to first breeding, 472 pregnant to subsequent [2–20] services, and 66 that never conceived). Cows were artificially inseminated, and pregnancy was determined 35d later by rectal palpation of uterine contents. Gene-set enrichment analyses with SNP data (GSEA-SNP) were conducted for CCR1 and TBRD with a normalized enrichment score (NES) ≥ 3.0 required for significance. Leading edge genes (LEG) and positional candidate genes from this and 26 additional studies were used to validate 100 loci associated (P < 1 × 10− 5) with cow fertility using a custom sequencing genotyping array of putative functional variants (exons, promoters, splice sites, and conserved regions).
Results
GSEA-SNP identified 95 gene-sets (1,473 LEG) enriched for CCR1 and 67 gene sets enriched (1,438 LEG) for TBRD (NES ≥ 3). Thirty-four gene-sets were shared between CCR1 and TBRD along with 788 LEG. The association analysis for TBRD identified three loci: BTA1 at 83 Mb, BTA1 at 145 Mb, and BTA 20 at 46 Mb (P < 1 × 10− 5). The loci associated with TBRD contained candidate genes with functions relating to implantation and uterine receptivity. No loci were associated with CCR1, however a single locus on BTA1 at 146 Mb trended toward significance with an FDR of 0.04.
Conclusions
The validation of three loci associated with CCR and TBRD in Holsteins can be used to improve fertility through genomic selection and provide insight into understanding infertility.
Background
Revenues from dairy cattle products account for roughly 21.3% ($41.8Â billion) of all animal cash receipts within the United States [1]. Worldwide, global revenue attributed to milk is expected to amount to $350Â billion in 2024 and continue to grow in subsequent years [2]. Prior to lactation and the generation of dairy products, cattle must first conceive and maintain the pregnancy through the normal gestation period, making fertility critical to dairy profitability.
The dairy industry has struggled with fertility for decades, with the average cow conception rate (CCR) ranging between 30–43% [3–4]. Previous studies have estimated the average economic cost of pregnancy loss from breeding to day 60 of gestation in dairy cows to be $2,333 [5]. This cost is dependent on several factors including length of pregnancy, increased chance of culling, and the cost of rebreeding. To put this on a national scale, if 35% of the 9.38 million U.S. cows [6] lost a single pregnancy it would result in a cost of greater than $7.6 billion dollars to the dairy industry. This illustrates the importance of selecting cattle that can conceive and maintain pregnancies.
Prior to the early 2000s, genetic selection in the dairy industry primarily focused on milk yield or milk components [7–8]. Selection for female fertility began with the inclusion of daughter pregnancy rate in 2003 [9], and expanded to include overall heifer conception rate and CCR as well as conception rates to first breeding in 2009 [10]. Additional fertility measures such as early first calving (the age at which a heifer calves) were added to selection indexes in 2019 [11].
Since its inclusion in the Council for Dairy Cattle Breeding genomic evaluations, cow conception rate to the first breeding (CCR1) has improved by 7% points reaching 44%, while CCR to any service has improved by 6% points reaching 42% [4]. This improvement in CCR has been achieved through genomic selection using single nucleotide polymorphisms (SNPs) that are in linkage disequilibrium (LD) with the mutation responsible for the phenotype (causal mutation). To date, few casual variants are known or used in genomic selection for female fertility traits as finding the causal mutation can be both time consuming and challenging. The advantages to identifying the causal mutations is that the accuracy of prediction of fertility will be higher because the prediction is not reliant on the lack of changing LD with successive generations of meiotic events or recombination differences between breeds. Identifying the causal mutation also offers insights into the molecular mechanism of fertility [12].
Given the importance of fertility to the dairy industry and the lack of characterization of casual mutations associated with CCR, the goals of the current study were to: (1) identify leading edge genes and gene sets enriched for CCR and the number of times a cow was bred to achieve a pregnancy (TBRD) using gene set enrichment analysis using SNPs (GSEA-SNP) and (2) perform an association analysis of loci validated for an association with fertility (from at least two independent studies) using a custom genotype by sequencing (GBS) panel to identify putative causal variants associated with the CCR and TBRD in Holstein cows.
Materials and methods
Study population
This study was conducted with the approval of Washington State University’s Institutional Animal Care and Use Committee (#4295) and the authors confirm that this study is reported in accordance with ARRIVE guidelines. The Holstein study population consisted of 2,015 Holstein cows sampled from six dairies located in central Washington [13]. All dairies provided written agreement to participate in the study. No animals were euthanized for this study. Whole blood (~ 16 ml) was collected into EDTA tubes from each cow via venipuncture of the tail vein. Blood was collected from cows without being anesthetized, as approved by the Washington State University’s Institutional Animal Care and Use Committee (#4295). Cows (n = 951) were removed from the study if they experienced an inflammatory event or disease that could potentially have impacted pregnancy retention such as: lameness, fever, mastitis, uterine disease, spontaneous abortions, dystocia, foot disease, pink eye, respiratory disease, and metabolic issues. All cows were artificially inseminated (AI) at either observed estrus or using timed AI. Pregnancy status was determined at 35 days post insemination by rectal palpation of uterine contents. Pregnancy status after d35 was not evaluated for the current study, as this study aims to focus on early embryonic loss not fetal losses. Animals remaining in the study (n = 1064) were genotyped using the Illumina BovineHD Beadchip (San Diego, CA) at Neogen Laboratories (Lincoln, NE). Prior to any analysis, samples were quality control filtered for duplicates and individual call rate (< 90%) with 32 cattle removed. This left a total of 1,032 cows for the study: 494 cows that conceived to the first service, 472 cows that were bred 2–20 times before conceiving, and 66 infertile cows that never conceived after a minimum of 6 AI services (range 6–20 AI attempts) [13]. The phenotypes of CCR1 and TBRD were used for the GSEA-SNP and the association analysis. Phenotypes were coded as follows: CCR1 using 1 (successful conception to first service) and 0 (no conception at first service); and TBRD which used a scale from 1 to 20 depending on the number of inseminations a cow received.
For this study, only first lactation (first parity) cow records were used to ensure the entire study population were the same age for comparison. This data was collected retrospectively through farm management records. The average 305-day milk yields did not differ (P = 0.14) across the different phenotypic groups of cattle. The average 305-day milk yield for the entire population was 28,831 kg, additional milk yield information can be found in Supplemental Table 1. Conception rate did not differ between AI technicians (P > 0.05) or between service sires (P = 0.99) [13]. A total of 435 sires (433 Holstein and 2 Angus bulls) were used across all dairies with an average conception rate of 26.8% [13]. Most cows that failed to conceive to one sire were subsequently rebred to a different service sire reducing the risk for a cow to be categorized as infertile in the TBRD phenotype due solely to a service sire effect. Dairy of origin did have an effect on CCR1 and TBRD (P < 1 × 10− 10) and was included as a covariate in all analyses.
Gene-set enrichment analysis-SNP
The genotypes from the Illumina BovineHD Beadchip (San Diego, CA) at Neogen Laboratories (Lincoln, NE), were used to conduct an association analysis to identify the SNPs that would serve as representatives for the genes in the gene sets [13]. A total of 625,093 SNP were analyzed and mapped to 21,039 protein-coding genes within the ARS-UCD 1.2 genome assembly (https://www.animalgenome.org/repository/cattle/UMC_bovine_coordinates/). SNPs with the greatest evidence of an association with CCR1 or TBRD were used as the gene proxies for the GSEA-SNP. One SNP represented each gene within a 17 kb region up- and down-stream of the gene. This 17 kb region was based on the average haplotype block size of a large (n = 4,800) Holstein cattle population [13,14,15] also genotyped with the BovineHD BeadChip calculated using the method described by Gabriel et al., in 2002 [16]. A single SNP could represent more than one gene if it fell within the parameters outlines above. Genes were ranked by their significance (P-value) for their association with CCR1 or TBRD from the genome wide association analysis.
The GSEA-SNP was performed following the methods of Wang et al. (2007) [17] and five gene set databases were used: Biocarta (217 gene sets; http://www.genecarta.com/), Gene Ontology or GO (3,147 gene sets; http://www.geneontology.com), Kyoto Encyclopedia of Genes and Genomes or KEGG (186 gene sets; http://www.genome.jp.kegg), Protein Analysis Through Evolutionary Relationships or PANTHER (165 gene sets; http://www.pantherdb.org), and Reactome (674 gene sets; http://www.reactome.org). An enrichment score for each gene set was computed using the highest value from the running sum statistics, similar to a weighted Kolmogorov-Smirnov-like statistic [18]. Each gene set received a permuted P-value calculated using 10,000 phenotype-based permutations in GenABEL in R [19]. To account for the varying number of genes within each gene set, a normalized enrichment score (NES) was computed, and enriched gene sets were those that had an NES ≥ 3.0. For enriched gene sets, a list of genes that contributed to the peak NES were identified as leading edge genes (LEG).
Genotype by sequencing
Fertility validated loci were identified by comparing results from twenty-six studies [13, 15, 20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43] (Table 1), as well as the GSEA-SNP from the current study, that utilized a range of fertility phenotypes and cattle breeds. A locus was defined as SNP associated with fertility that were in linkage disequilibrium (LD) with each other (D’ > 0.75). This D’ threshold was used as it falls within ranges previously used to define SNPs within a locus [42]. To identify putative causal mutations in the 202 loci validated by two or more fertility studies, a custom genotyping array was designed after whole genome sequencing was conducted on 24 Holsteins at 10X coverage using the Illumina HiSeq X Ten at the University of Missouri sequencing core. Sequences of the 202 loci were mapped to the ARS-UCD 1.2 reference assembly. There were 5,102 variants identified within the 202 loci. Variants were screened to determine which ones were located in regulatory regions (n = 317), exons (n = 633), splice sites (n = 22), and evolutionarily conserved regions (n = 392). After screening, 1,358 variants within 152 loci were sent for probe design at Neogen Laboratories (Lincoln, NE). Of the variants identified, 284 variants (21%) failed the probe design due to the presence of secondary structures, high or low GC content, or repetitive motifs of the targeted sequence. Probes were successfully designed for 713 variants covering both directions and 361 variants where probes were successfully designed for a single direction. This resulted in the creation of a genotyping array with 1,787 probes which covered 100 of the 152 loci identified across studies. Paired-end 150 bp libraries for 1,064 cows were prepared and analyzed at Neogen laboratories (Lincoln, NE) following Tecan’s targeted genotyping V2 Allegro protocols (Männedorf, Switzerland). Sequencing was performed using an Illumina NextSeq2000 system (San Diego, CA).
Raw sequencing data was processed to create variant call format (VCF) files for the association analysis (Fig. 1). Raw sequencing reads were assessed for quality using the FastQC software [44] and adaptors were trimmed off with Trim Galore and Cutadapt [45, 46]. Files were aligned to the ARS-UCD 1.2 reference assembly using bowtie2 [47], and sorted and indexed using SAMtools [48]. Variant calling was successful for 1,222 or the 1,787 probes using Platypus [49]. The Platypus VCF files were then uploaded into SNP and Variation Suite (SVS) version 8.9.1 (Golden Helix, Bozeman, MT) for analysis. Quality control filtering of SNPs was performed on the 1,222 called variants. SNPs were then removed for call rate < 85% (n = 127) or minor allele frequency < 1% (n = 80), leaving 1,015 SNPs spanning 100 loci that were further analyzed.
Association analysis
Three association analyses were conducted for each phenotype (CCR1 and TBRD) using the GBS variants in conjunction with the Illumina BovineHD BeadChip (San Diego, CA) genotypes the cattle were originally genotyped on. Given that the model of inheritance for cow fertility is unknown, three modes of inheritance were analyzed for each phenotype: additive, dominant, and recessive. SNPs were considered significant if their uncorrected P-value met the Wellcome Trust significance threshold of P < 1 × 10− 5 [50]. The association analyses were performed using SVS software and an efficient mixed-model association expedited (EMMAX) model [51]. This model was described in detail previously [13, 15, 51]. Briefly, the model is described by\(\:\:\varvec{y}=\mathbf{X}\varvec{\upbeta\:}+\varvec{Z}u+\text{ϵ}\), y is the n × 1 vector of observed phenotypes, X is an n × f matrix of fixed effects (f), β is an f × 1 vector containing the fixed effect coefficients, Z is an n × t matrix relating the random effects (t) to the phenotype, and \(\:u\) is the random effect of the mixed model. The model assumes residuals to be independent with an identical distribution such that \(\:Var\left(u\right)=\:{{\sigma\:}_{g}}^{2}\varvec{K}\) and \(\:\left(ϵ\right)=\:{{\sigma\:}_{e}}^{2}\varvec{I}\), and\(\:\:Var\left(y\right)=\:{{\sigma\:}_{g}}^{2}\varvec{Z}\varvec{K}{\varvec{Z}}^{\varvec{{\prime\:}}}+\:{{\sigma\:}_{e}}^{2}\varvec{I}\). For this study, K is a matrix of pairwise genomic relationships and Z is the identity matrix, I [52].
Observed heritability was estimated using a genomic best linear unbiased predictor (GBLUP) [53] analysis using AI-REML (average information algorithm) [54, 55]. While computationally intensive, this method to estimate heritability provides a more accurate estimation of heritability when sample size is limited.
Results
Gene-set enrichment analysis-SNP
The GSEA-SNP identified 95 gene sets with NES ≥ 3 (Supp. Table 2) that were enriched for CCR1. There were 1,473 unique LEG across the 95 gene sets, with 2 LEG, scinderin (SCIN) and villin 1 (VIL1), found in 40 of the 95 gene sets. Gene sets with greatest evidence for enrichment with CCR1 (NES > 4.0) are shown in Table 2. For TBRD, 67 gene sets were enriched (NES ≥ 3; Supp. Table 3) and 4 gene sets had an NES ≥ 4 (Table 3). Two gene sets with NES > 4.0 for TBRD (recruitment of mitotic centrosome proteins and complexes; loss of NLP from mitotic centrosomes) also were enriched for CCR1 with NES > 4.0. There were 1,438 unique LEG for TBRD. A single LEG, Rac family small GTPase 1 (RAC1), was shared in 28 gene sets.
Thirty-four gene sets were enriched in both CCR1 and TBRD (Table 4), which included 788 shared LEG. The three most commonly shared LEG were twinfilin actin binding protein 1 (TWF1) identified in 58 gene sets, RAC1 in 59 gene sets and VIL1 in 61 gene sets that contribute to the function of the cytoskeleton.
Many of the shared gene sets can be grouped into categories based on common functions. These groups include gene sets related to cellular component organization and structure (n = 17), mitotic cell cycle/mitosis (n = 6), RNA and mRNA metabolic processes (n = 3), and signaling pathways (n = 3). The remaining five gene sets have functions that do not fall in any of the above-mentioned categories (Table 4).
Association analyses
There were no SNP associated with CCR1 in any model (P < 1 × 10− 5). However, a single locus on BTA1 at 146 Mb in the additive model had a false discovery rate (FDR) of 0.04, though the P-value did not meet the Wellcome Trust significance threshold. The observed heritability for this model was 0.19 ± 0.05.
The TBRD association analysis identified 4 SNP (3 loci), two on BTA1 and one on BTA20 in the additive model (P < 1 × 10− 5; Table 5). The locus on BTA1 at 146 Mb that was associated with TBRD was the same locus in the additive model of CCR1 that trended toward significance (FDR = 0.04). This locus contained a synonymous variant within disco interacting protein 2 homolog A (DIP2A). The other locus on BTA1 was located at 83 Mb. The two SNP within this locus were a missense and a synonymous variant within the first exon of ENSBTAG00000032217. This missense mutation (rs110205198), which results in an amino acid change from cysteine to serine, was further examined to determine if there were any predicted detrimental effects caused by the amino acid substitution. This was done using the sequencing homology-based tool SIFT (sorts intolerant from tolerant) which predicts if substitutions of amino acids could result in a phenotypic effect of the protein through a change in its structure with values ranging from 0 (intolerable change) to 1 (tolerated change) [56]. The predicted damage score for the amino acid substitution caused by this mutation was only 0.9 (with low confidence) indicating the substitution is likely tolerated. However, the low confidence warning indicates that there was not enough diversity between the sequences to accurately predict altered protein function [56]. The locus on BTA20 contained a variant within a CNV (nsv810323). No SNP were associated with TRBD in the dominant or recessive models and observed heritability for TBRD was 0.18 ± 0.05.
The significant GBS SNP were then evaluated for LD with the original SNP that were used to create the custom assay. When compared to the original nine SNP used in to create the custom assay, the two GBS SNP in the locus on BTA1 at 84 Mb were in LD with only two of the original SNP (D’ > 0.75). The other significant GBS SNP on BTA1 at 146 Mb was in LD with one of the two original SNPs (D’ = 0.80), while the GBS SNP on BTA20 at 46 Mb was in high LD with both of the original SNP used to create the custom assay. The high level of LD with the original markers for the loci on BTA1 @146 Mb and BTA20 at 46 Mb suggests that progress can be made using the SNP on the currently available BeadChip. However, it should be noted that the two GBS SNP from the locus on BTA1 @84 Mb are only in LD with two of the nine original SNP indicting that the new GBS markers would be more useful for selection of fertility in cattle.
Loci associated with fertility in this study were compared to loci reported by Kiser et al. (2019a) [13] using only SNPs on the Illumina BovineHD BeadChip to determine if the GBS markers were helpful in further defining the validated CCR1 and TBRD loci. None of the three loci from the current studies were associated with fertility in Kiser et al. 2019a [13], meaning they were associated with cow fertility for the first time within the current analyses and could provide functional targets for further analysis to identify putative causal mutations. Originally, these loci were identified in two or more of the external fertility analyses investigated [15, 20, 30, 42] and associated with heifer fertility instead of cow fertility.
Discussion
Most of the enriched gene sets have functions that fit within the organization of cellular components. These gene sets contain genes that are expressed within macromolecular complexes of the cell [57]. Gene Ontology’s cellular component designation characterizes genes whose molecular function occurs within a cellular compartment of the cell such as the plasma membrane or the cytoskeleton. The three most identified leading edge genes for CCR1 and TBRD (villin 1 (VIL1), RAC1, and TWF1) were present within the gene sets in the organization of cellular components as they control different aspects of the cytoskeleton [58,59,60]. VIL1 is often expressed on cell borders to help maintain their shape and their adhesion with other cells [61]. and its expression is also used as a marker for cells that have undergone the epithelial-to-mesenchymal transition, which is important for normal female reproductive organ function [62]. Similarly, RAC1 is involved with uterine cell remodeling. A deficiency of RAC1 expression in mice during the peri-implantation period impairs the function of the luminal epithelia reducing uterine receptivity [63]. Additional, several cattle studies have identified associations between several p21 activated kinases genes, which are regulated by RAC1, and fertility traits such as conception rate and pregnancy rate in Holsteins [25, 64]. Twinfilin Actin Binding Protein 1 is a highly conserved actin binding protein that regulates the assembly of the cytoskeleton through the binding of actin monomers [65]. While its function in cattle fertility is not characterized, in mice TWF1 was the most common isoform of twinfilin in embryos as well as in non-muscle cell types within adults [65]. In general, the cytoskeleton plays an important role in all cells and is considered to have three overarching functions: (1) organizing the contents within a cell, (2) enabling the cell to move and change shape, and (3) connecting cells to their external environments [66].
Prior to blastocyst implantation, changes to the actin cytoskeleton of uterine epithelial cells can result in a loss of cell polarity [67, 68]. The loss of cell polarity can result in actin cytoskeleton disorganization leading to abnormal endometrial receptivity and implantation failure. In women, this failure influences the plasma membrane transformation period during the mid-secretory phase of the menstrual cycle [69]. In mice, cytoskeleton disorganization negatively impacts embryo development prior to implantation [70].
The second largest functional category of enriched gene sets is related to mitosis and the cell cycle. Mitotic cell division is an essential process for sustaining life. In the uterus, successful regulation of mitosis is important for fertility for the developing uterus, placenta, and embryo. Dysregulation of mitotic cell division leads to defects in uterine receptivity through alterations to stromal proliferation [71, 72] which could play a role in recurrent implantation failure in women [73]. During mitosis, centrosomes play a crucial role in the formation of the spindle assembly [74]. During reproduction, centrosomes are susceptible to rearrangements and changes to their complex structure can result in aneuploidy, mis-segregation of chromosomes, and destabilization of chromosomal integrity [75, 76]. Issues with centrosomes and their components can also result in oocyte abnormalities. Wang et al. (2001) [76] found that a main indicator of oocyte quality and thus chances of successful fertilization was centrosome integrity and its impact on spindle integrity.
Three gene sets (Neurotrophin Signaling Pathway, Notch Signaling Pathway, and Signaling by Neurotrophin Growth Factors [NGF]) in the signaling pathway category are involved in cellular proliferation and differentiation. Previous studies have found that NGF can increase cell proliferation through Notch signaling [77], and that the inhibition of Notch Signaling can result in the downregulation of NGF during acute inflammatory responses [78]. In addition to their nervous system roles, NGF and Notch Signaling function in ovulation [79] and pregnancy establishment [80], respectively. After systemic injection with NGF, luteinizing hormone and progesterone increase to aid in ovulation as well as establish and maintain pregnancy in llamas [79] and dairy cattle [81]. This increased expression of fertility hormones is suggested to increase uterine receptivity and reproductive efficiency. Notch Signaling has a broad role in female fertility from prenatal development, the estrus cycle, implantation and pregnancy [80]. Within the female reproductive tract, Notch Signaling activation is influenced by progesterone levels and has a role in pre-implantation angiogenesis by influencing the vasculature of the endothelial cells [80, 82]. Additionally, inhibition of Notch Signaling within the endometrium has been linked to repeated implantation failure, endometriosis, and polycystic ovary syndrome which cause infertility in women [83].
Gene sets enriched for CCR1 and TBRD that are related to RNA and mRNA (Table 4) could have several different influences on uterine receptivity and pregnancy maintenance. For example, some RNA and mRNAs are known to influence uterine receptivity during implantation through the regulation of Wnt signaling [84]. In humans, a meta-analysis on genes that were differentially expressed during embryo implantation and/or the endometrium identified 39 mRNA genes as uterine receptivity markers for uterine receptivity [85]. These genes have functions related to the immune response, exosomes, and the complement cascade pathway during successful pregnancies [85]. Alternative splicing of RNA and mRNA can also influence uterine receptivity. In mice, alternative splicing of epithelial splicing regulatory protein 1 (Esrp1) is associated with fertility, whereas Esrp1knockout female mice are infertile. Female Esrp1−/− mice present with smaller ovaries and have impaired ovulation [86]. In humans, alternative splicing helps regulate gene expression within the myometrium to maintain quiescence throughout gestation [87]. Proteins involved in alternative splicing regulation like SF2/ASF and hnRNPA1 upregulate the production of certain GTP-binding protein isoforms during pregnancy leading to elevated cAMP levels and myometrial quiescence [88–89]. Alterations to RNA and mRNA processing and splicing have the potential to alter uterine receptivity and ultimately pregnancy success.
The association analyses identified 3 loci associated with CCR (P < 1 × 10− 5) from the 100 loci GBS panel. Two of the associated loci contained genes. Disco Interacting Protein 2 Homolog A (DIP2A) has potential functions in uterine receptivity while the function of ENSBTAG00000032217 is currently uncharacterized. However, ENSBTAG00000032217 is an ortholog to L-lactate dehydrogenase which is an enzyme that catalyzes the conversion of pyruvate to lactate as it converts NAD+ to NADH to produce energy in nearly all living cells. Previous work in mice has shown that DIP2A is expressed in the endometrium [90], and in bovine conceptuses during pregnancy recognition [91]. The DIP2A protein is also a receptor for follistatin which is essential for uterine receptivity [92, 93]. A successful pregnancy hinges on the ability of the embryo to implant into the endometrium. Further functional analyses are needed to determine how and if the variants identified in this study might influence uterine receptivity, implantation, and ultimately conception rate in dairy cows.
The observed heritability estimates from this study were moderate (CCR1 0.19 ± 0.05; TBRD 0.18 ± 0.05). When compared to the original heritability estimates calculated by Kiser et al. (2019a) [13] based on genome wide association analysis using 625,093 SNPs (0.56 ± 0.06 for CCR1 and 0.42 ± 0.07 for TBRD), the estimates from the current study were lower but similar to the heritability reported by others [94,95,96,97]. However, the current study heritability estimates confirm that significant improvement in fertility can be achieved by selecting for fertility traits. As one might expect, when the genetic correlation between CCR1 and TBRD was calculated it results in a genetic correlation of -1. This indicates that as the number of breedings needed to result in pregnancy increases, the conception rate to the first service decreases. Identifying variants and genes that are associated with both traits will help improve overall fertility within the industry.
The gene-sets, leading edge genes from the GSEA-SNP, the positional candidate genes, and the putative functional variants from the association study have potential roles in uterine receptivity and embryo implantation. Of the two loci that harbor positional candidate genes, one missense variant was located within the currently uncharacterized ENSBTAG00000032217, while two synonymous mutations were found within DIP2A and ENSBTAG00000032217. While the one missense variant is not thought to impact the function of the ENSBTAG00000032217 protein, additional investigation is needed to confirm this due to the low confidence warning in the SIFT prediction.
Exploring putative causal mutations by genotyping through sequencing aided in defining the loci associated with fertility in dairy cattle. These results have practical implications in genomic selection as they could be incorporated into current commercial genotyping panels to facilitate genomic predicted transmitting abilities. This improvement in fertility would help alleviate financial losses dairy producers incur from failed pregnancies and increase their efficiency and profitability.
Data availability
Data generated and/or analysed during the current study are available in the CattleQTLdb repository (https://www.animalgenome.org/cgi-bin/QTLdb/BT/pubtails? PUBMED_ID=31718557) and the GBS data is pending review and acceptance by USDA Ag Data Commons (https://figshare.com/s/a748d0d75fbaf4303648).
Abbreviations
- AI:
-
Artificial insemination
- CCR:
-
Cow conception rate
- CCR1:
-
Cow conception rate to first service
- EMMAX:
-
Efficient mixed-model association expedited
- FDR:
-
False discovery rate
- GBLUP:
-
Genomic best linear unbiased predictor
- GBS:
-
Genotype by sequencing
- GSEA-SNP:
-
Gene-set enrichment analysis with SNP data
- LEG:
-
Leading edge gene
- NES:
-
Normalized enrichment score
- NGF:
-
Neurotrophin growth factors
- SNP:
-
Single nucleotide polymorphism
- SVS:
-
SNP variation suite
- TBRD:
-
Number of times bred to conception
- VCF:
-
Variant call format
References
USDA, Economic Research Service. Farm income and wealth statistics. 2023. https://data.ers.usda.gov/reports.aspx?ID=17845
Statista. Milk - Worldwide. Retrieved October 15, 2024, from https://www.statista.com/outlook/cmo/food/dairy-products-eggs/milk/worldwide
Fernandez-Novo A, Fargas O, Loste JM, Sebastian F, Perez-Villalobos N, Pesantez-Pacheco JL, et al. Pregnancy loss (28–110 days of pregnancy) in Holstein cows: a retrospective study. Animals. 2020;10(6):925.
Council on Dairy Cattle Breeding. 2022. CDCB Conception rate in 2022 for all breeds of cows. https://webconnect.uscdcb.com/#/national-performance-metrics. Accessed 13 September 2024.
Lee J, Kim I. Pregnancy loss in dairy cows: the contributing factors, the effects on reproductive performance and the economic impact. J Vet Sci. 2007;8(3):283–8.
Quick Stats. United States Department of Agriculture National Agriculture Statistics Service. https://quickstats.nass.usda.gov/. Accessed 27 July 2023.
Crowe MA. Fertility in dairy cows– the conference in perspective. In Fertility in Dairy Cows bridging the gaps. Eds Royal MD, Friggens NC and Smith RF. UK: British Society of Animal Science / Cambridge University Press. 2007;156–60.
Berry DP, Wall E, Pryce JE. Genetics and genomics of reproductive performance in dairy and beef cattle. Animal. 2014;8(S1):105–21.
VanRaden PM, Sanders AH, Tooker ME, Miller RH, Norman HD. Daughter pregnancy rate evaluation of cow fertility. 2002. https://aipl.arsusda.gov/reference/fertility/DPR_rpt.htm#announce. Accessed 3 July 2023.
Norman HD, VanRaden RM, Fok G, Hutchison J. Conception rate evaluations for males and females. In: Changes to evaluations system. 2010. https://aipl.arsusda.gov/reference/changes/eval1008.html. Accessed 3 July 2023.
Norman HD, Hutchison J. New trait: Early first calving. 2019. https://queries.uscdcb.com/News/CDCB Connection Early First Calving 03_2019.pdf. Accessed 10 July 2023.
Misztal I, Aguilar I, Lourenco D, Ma L, Steibel JP, Toro M. Emerging issues in genomic selection. J Anim Sci. 2021;99(6):skab092.
Kiser JN, Clancey E, Moraes JGN, Dalton J, Burns GW, Spencer TE, Neibergs HL. Identification of loci associated with conception rate in primiparous holstein cows. BMC Genomics. 2019a;20:840.
Neibergs HL, Seabury CM, Wojtowicz AJ, Wang Z, Scraggs E, Kiser JN, et al. Susceptibility loci revealed for bovine respiratory disease complex in pre-weaned holstein calves. BMC Genomics. 2014;15:1164.
Kiser JN, Keuter EM, Seabury CM, Neupane M, Moraes JGN, Dalton J, et al. Validation of 46 loci associated with female fertility traits in cattle. BMC Genomics. 2019b;20:576.
Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, et al. The structure of haplotype blocks in the human genome. Science. 2002;296:2225–9.
Wang K, Li M, Bucan M. Pathway-based approaches for analysis of genomewide association studies. Am J Hum Genet. 2007;81:1278–83. https://doiorg.publicaciones.saludcastillayleon.es/10.1086/522374
Holden M, Deng S, Wojnowski L, Kulle B. GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies. Bioinformatics. 2008;24:2784–5. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/bioinformatics/btn516
Aulchenko YS, Ripke S, Isaacs A, van Duijn CM. GenABEL: an R library for genome-wide association analysis. Bioinformatics. 2007;23:1294–6. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/bioinformatics/btm108
Höglund JK, Guldbrandsten B, Su G, Thomsen B, Lund MS. Genome scan detects quantitative trait loci affecting female fertility traits in Danish and Swedish holstein cattle. J Dairy Sci. 2009;92(5):2136–43.
Huang W, Kirkpatrick BW, Rosa GJM, Khatib H. A genome-wide association study using selective DNA pooling identifies candidate markers for fertility in Holstein cattle. Anim Genet. 2010;41(6):570–8.
Pryce JE, Bolormaa S, Chamberlain AJ, Bowman PJ, Savin K, Goddard ME, Hayes BJ. A validated genome-wide association study in 2 dairy cattle breeds for milk production and fertility traits using variable length haplotypes. J Dairy Sci. 2010;93:3331–45.
Sahana G, Guldbrandtsen B, Bendixen C, Lund MS. Genome-wide association mapping for female fertility traits in Danish and Swedish holstein cattle. Anim Genet. 2010;41(6):579–88.
Blaschek M, Kaya A, Zwald N, Memili E, Kirkpatrick BW. A whole-genome association analysis of noncompensatory fertility in Holstein bulls. J Dairy Sci. 2011;94(9):4695–9.
Cole JB, Wiggans GR, Ma L, Sonstegard TS, Lawlor TJ Jr, Crooker BA, Van Tassell CP, Yang J, Wang S, Matukumalli L, Da Y. Genome-wide association analysis of thirty one production, health, and reproduction and body conformation traits in contemporary US Holstein cows. BMC Genomics. 2011;12:408.
Olsen HG, Hayes BJ, Kent MP, Nome T, Svendsen M, Larsgard AG, Lien S. Genome-wide association mapping in Norwegian red cattle identifies quantitative trait loci for fertility and milk production on BTA12. Anim Genet. 2011;42(5):466–74.
Cochran SD, Cole JB, Null DJ, Hansen PJ. Discovery of single nucleotide polymorphisms in candidate genes associated with fertility and production traits in Holstein cattle. BMC Genet. 2013;14:49.
Minozzi G, Nicolazzi EL, Stella A, Biffani S, Negrini R, Lazzari B, Ajmone-Marsan P, Williams JL. Genome wide analysis of fertility and production traits in Italian holstein cattle. PLoS ONE. 2013;8:e80219.
Minten MA, Bilby TR, Bruno RGS, Allen CC, Madsen CA, Wang Z, Sawyer JE, Tibary A, Neibergs HL, Geary TW, Bauersachs S, Spencer TE. Effects of fertility on gene expression and function of the bovine endometrium. PLoS ONE. 2013;8:e69444.
Höglund JK, Sahana G, Guldbrandtsen B, Lund MS. Validation of associations for female fertility traits in Nordic Holstein, Nordic Red, and Jersey dairy cattle. BMC Genet. 2014;15:8.
Akanno EC, Plastow G, Fitzsimmons C, Miller SP, Baron V, Ominski K, et al. Genome-wide association for heifer reproduction and calf performance traits in beef cattle. Genome. 2015;58(12):549–57.
Höglund JK, Buitenhuis B, Guldbrandtsen B, Lund MS, Sahana G. Genome-wide association study for female fertility in Nordic Red cattle. BMC Genet. 2015;16:110.
Iso-Touru T, Sahana G, Guldbrandtsen B, Lund MS, Vikki J. Genome-wide association analysis of milk yield traits in Nordic Red cattle using imputed whole genome sequence variants. BMC Genet. 2016;17:55.
Moore SG, Pryce JE, Hayes BJ, Chamberlain AJ, Kemper KE, Berry DP, McCabe M, Cormican P, Lonergan P, Fair T, Butler ST. Differentially expressed genes in endometrium and corpus luteum of Holstein cows selected for high and low fertility are enriched for sequence variants associated with fertility. Biol Reprod. 2016;94:19.
Nayeri S, Sargolzaei M, Abo-Ismail MK, May N, Miller SP, Schenkel F, Moore SS, Stothard P. Genome-wide association for milk production and female fertility traits in Canadian dairy Holstein cattle. BMC Genet. 2016;17:75.
Ortega MS, Denicol AC, Cole JB, Null DJ, Hansen PJ. Use of single nucleotide polymorphisms in candidate genes associated with daughter pregnancy rate for prediction of genetic merit for reproduction in Holstein cows. Anim Genet. 2016;47(3):288–97.
Liu A, Wang Y, Sahana G, Zhang Q, Liu L, Lund MS, Su G. Genome-wide association studies for female fertility traits in Chinese and nordic holsteins. Sci Rep. 2017;7:8487.
Müller MP, Rothammer S, Seichter D, Russ I, Hinrichs D, Tetens J, Thaller G, Medugorac I. Genome-wide mapping of 10 calving and fertility traits in Holstein dairy cattle with special regard to chromosome 18. J Dairy Sci. 2017;100(3):1987–2006.
Neupane M, Geary TW, Kiser JN, Burns GW, Hansen PJ, Spencer TE, Neibergs HL. Loci and pathways associated with uterine capacity for pregnancy and fertility in beef cattle. PLoS ONE. 2017;12(12):e0188997.
Ortega MS, Denicol AC, Cole JB, Null DJ, Taylor JF, Schnabel RD, Hansen PJ. Association of single nucleotide polymorphisms in candidate genes previously related to genetic variation in fertility with phenotypic measurements of reproductive function in Holstein cows. J Dairy Sci. 2017;100(5):3725–34.
Fonseca PAS, Id-Lahoucine S, Reverter A, Medrano JF, Fortes MS, Casellas J, et al. Combining multi-OMICs information to identify key-regulator genes for pleiotropic effect on fertility and production traits in beef cattle. PLoS ONE. 2018;13:e0205295.
Galliou JM, Kiser JN, Oliver KF, Seabury CM, Moraes JGN, Burns GW, et al. Identification of loci and pathways associated with heifer conception rate in U.S. Holsteins. Genes. 2020;11(7):767.
Oliver KF, Geary TW, Kiser JN, Galliou JM, Van Emon ML, Seabury CM, et al. Loci associated with conception rate in crossbred beef heifers. PLoS ONE. 2020;15(4):e0230422.
Andrews S, FastQC. A quality control tool for high throughput sequence data. 2010. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet Jounral. 2011;17:10–2. https://doiorg.publicaciones.saludcastillayleon.es/10.14806/ej.17.1.200
Babraham Bioinformatics - Trim Galore. https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/. Accessed 3 January 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.5281/zenodo.7598955
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.
Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10(2):giab008.
Rimmer A, Phan H, Mathieson I, Iqbal Z, Tqigg SRF, WGS500 Consortium. Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications. Nat Genet. 2014;46(8):912–8.
Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447:661–78.
Kang HM, Sul JH, Service SK, Zaitlen NA, Kong SY, Freimer NB, et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010;42(4):348–54.
Kang HM, Zaitlen NA, Wade CM, Kirby A, Heckerman D, Daly MJ, Eskin E. Efficient control of population structure in model organism association mapping. Genetics. 2008;178(3):1709–23.
Taylor J. Implementation and accuracy of genomic selection. Aquaculture. 2014;420–421:S8–14.
Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88:76–82.
Lee SH, Yang J, Goddard ME, Visscher PM, Wray NR. Bioinformatics. 2012;28:2540–2.
Ng PC, Henikoff S. SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003;31(13):3812–4.
Thomas PD. The Gene Ontology and the meaning of biological function. Methods Mol biol. 2017;1446:15–24.
Bosco EE, Mulloy JC, Zheng Y. Rac1 GTPase: a rac of all trades. Cell Mol Life Sci. 2009;66(3):370–4.
Nakamura E, Satoh T, Iwakawa M, Nakawatari M, Oki A, Matsumoto K, et al. Villin 1, a diagnostic marker for endometrial adenocarcinoma with high grade nuclear atypia. Cancer Biol Ther. 2011;12(3):181–90.
Huo G, Wang Y, Chen J, Song Y, Zhang C, Guo H, et al. A pan-cancer analysis of the oncogenic role of twinfilin actin binding protein in human tumors. Front Oncol. 2021;11:692136.
Khurana S, George SP. Regultion of cell structure and function by actin-binding proteins: Villin’s perspective. FEBS Lett. 2008;582(14):2128–39.
Bilyk O, Coathman M, Jewer M, Postovit L. Epithelial-to-mesenchymal transition in the female reproductive tract: from normal functioning to disease pathology. Front Onco. 2017;7:145.
Tu Z, Wang Q, Cui T, Wang J, Ran H, Bao H, et al. Uterine RAC1 via Pak1-ERM signaling directs normal luminal epithelial integrity conducive to on-time embryo implantation in mice. Cell Death Differ. 2016;23:169–81.
Jiang J, Ma L, Prakapenka D, VanRaden PM, Cole JB, Da Y. A large-scale genome-wide association study in U.S. Holstein cattle. Front Genet. 2019;14:10.
Vartiainen MK, Sarkkinen EM, Matilainen T, Salminen M. Mammals have two twinfilin isoforms whose subcellular localizations and tissue distributions are differentially regulated. J Biol Chem. 2003;278(36):34347–55.
Fletcher DA, Mullins RD. Cell mechanics and the cytoskeleton. Nature. 2010;463(7280):485–92.
Luxford KA, Murphy CR. Cytoskeletal alterations in the microvilli of uterine epithelial cells during early pregnancy. Acta Histochem. 1989;87(2):131–6.
Jalali BM, Likszo P, Andronowska A, Skarzynski DJ. Alterations in the distribution of actin and its binding proteins in the porcine endometrium during early pregnancy: possible role in epithelial remodeling and embryo adhesion. Theriogenology. 2018;116:17–27.
Whitby S, Zhou W, Dimitriadis E. Alterations in epithelial cell polarity during endometrial receptivity a systematic review. Front Endocinol. 2020;11:596324.
Tan K, An L, Wang S, Wang Z, Zhang Z, Miao K, et al. Actin disorganization plays a vital role in impaired embryonic development of in vitro-produced mouse preimplantation embryos. PLoS ONE. 2015;10(6):e0130382.
Tsai C, Chou C, Yang C, Lai Y, Liang C, Chen C, Tsai T. Hurp deficiency in mice leads to email infertility caused by an implantation defect. J Biol Chem. 2008;283(39):26302–6.
Winuthayanon W, Hewitt SC, Korach KS. Uterine epithelial cell estrogen receptor alpha-dependent and -independent genomic profiles that underlie estrogen responses in mice. Biol Repro. 2014;91(5):110.
Zhou X, Cao Y, Zhou M, Han M, Liu M, Hu Y, et al. Decreased CD44v3 expression impairs endometrial stromal cell proliferation and decidualization in women with recurrent implantation failure. Reprod Biol Endorinol. 2022;20:170.
Barra V, Fachinetti D. The dark side of centromeres: types, causes and consequences of structural abnormalities implicating centromeric DNA. Nat Commun. 2018;9:4340.
McKinley KL, Cheeseman IM. The molecular basis for centromere identity and function. Nat Rev Mol Cell Biol. 2016;17(1):16–29.
Wang WH, Meng L, Hackett RJ, Oldenbourg R, Keefe DL. The spindle observation and its relationship with fertilization after intracytoplasmic sperm injection in living human oocytes. Fertil Steril. 2001;75(2):348–53.
Park JC, Chang IB, Ahn JH, Kim JH, Song JH, Moon SM, Park Y. Nerve growth factor stimulates glioblastoma proliferation through Notch1 receptor signaling. J Korean Neurosurg Soc. 2018;61(4):441–9.
Yin J, Hu H, Li X, Xue M, Cheng W, Wang Y, et al. Inhibition of notch signaling pathway attenuates sympathetic hyperinnervation together with the augmentation of M2 macrophages in rats post-myocardial infarction. Am J Physiol Cell Physiol. 2016;310:C41–53.
Ratto MH, Huanca W, Singh J, Adams GP. Comparison of the effect of ovulation-inducing factor (OIF) in the seminal plasma of llamas, alpacas, and bulls. Theriogenology. 2006;66(5):1102–6.
Moldovan GE, Miele L, Fazleabas AT. Notch signaling in reproduction. Trends Endocrinol Metab. 2021;32(12):1044–57.
Gajardo G, Paiva L, Ulloa-Leal C, Valderrama X, López G, Carrasco A, et al. Administration of beta-nerve growth factor during preovulatory stage improves endocrine and luteal function in dairy heifers. Animals. 2023;13(6):1004.
Shawber CJ, Lin L, Gnarra M, Sauer MV, Papaioannou, Kitajewski JK, Douglas NC. Vascular notch proteins and notch signaling in the peri-implantation mouse uterus. Vasc Cell. 2015;7:9.
Amjadi F, Salehi E, Zandieh Z, Rashidi M, Taleahmad S, Javedani masrour M, et al. Comparative evaluation of NOTCH signaling molecules in the endometrium of women with various gynecological diseases during the window of implantation. Iran J Basic Med Sci. 2019;22(4):426–31.
Shekibi M, Heng S, Nie G. MicroRNAs in the regulation of endometrial receptivity for embryo implantation. Int J Mol Sci. 2022;23(11):6210.
Altmäe S, Koel M, Võsa U, Adler P, Suhorutšenko M, Laisk-Podar T, et al. Meta-signature of human endometrial receptivity: a meta-analysis and validation study of transcriptomic biomarkers. Nature. 2017;7:10077.
Yu L, Zhang H, Guan Z, Qin D, Zhou J, Wu X. Alternative splicing in the germline: ESRP1 essential for female fertility. Development. 2021;148(2):e148e0202.
Tyson-Capper AJ. Alternative splicing: an important mechanism for myometrial gene regulation that can be manipulated to target specific genes associated with preterm labour. BMC Pregnancy Childbirth. 2007;7(Suppl 1):S13.
Europe-Finner GN, Phaneuf S, Tolkovsky AM, Watson SP, López Bernal A. Down-regulation of G alpha s in human myometrium in term and preterm labor: a mechanism for parturition. J Clin Enocrinol Metab. 1994;79(6):1835–9.
Pollard AJ, Krainer AR, Robson SC, Europe-Finner GN. Alternative splicing of the adenylyl cyclase stimulatory G-protein G alpha(s) is regulated by SF2/ASF and heterogeneous nuclear ribonucleoprotein A1 (hnRNPA1) and involves the use of an unusual TG 3’-splice site. J Biol Chem. 2002;227(18):15241–51.
Zhang L, Mabwi HA, Palange NJ, Jia R, Ma J, Bah FB, et al. Expression patterns and potential biological roles of Dip2a. PLoS ONE. 2015;10(11):e0143284.
Estepa IM, Tinning H, Vasconcelos EJR, Fernandez-Fuertes B, Sánchez JM, Burns GW, et al. Protein synthesis by day 16 bovine ceptuses during the time of maternal recognition of pregnancy. Int J Mol Sci. 2020;21(8):2870.
Ouchi N, Asaumi Y, Ohashi K, Higuchi A, Sono-Romanelli, Oshima Y, Walsh K. DIP2A functions as a FSTL1 receptor. J Biol Chem. 2010;285(10):7127–34.
Fullerton PT Jr, Monsivais D, Kommagani R, Matzuk MM. Follistatin is critical for mouse uterine receptivity and decidualization. Proc Natl Acad Sci USA. 2017;114(24):E4772–81.
Kuhn MT, Hutchison JL, Wiggans GR. Characterization of Holstein heifer fertility in the United States. J Dairy Sci. 2006;89:4907–20.
Tiezzi F, Maltecca C, Cecchinato A, Penasa M, Bittante G. Genetic parameters for fertility in dairy heifers and cows at different parties and relationships with production traits in first lactation. J Dairy Sci. 2012;95:7355–62.
Bowley FE, Green RE, Amer PR, Meier S. Novel approaches to genetic analysis of fertility traits in New Zealand dairy cattle. J Dairy Sci. 2015;98(3):2005–12.
Seabury CM, Smith JL, Wilson ML, Bhattarai E, Santos JEP, Chebel RC, et al. Genome-wide association and genomic prediction for a reproductive index summarizing fertility outcomes in U.S. Holsteins. G3. 2023;13(9):jkad043.
Acknowledgements
The authors would like to acknowledge the central Washington dairies for their assistance in collecting phenotypes and genotypes, as well as Neogene for their help designing the custom assay.
Funding
This project was supported by Agriculture and Food Research Initiative Competitive Grant no. 2013-68004-20365 from the USDA National Institute of Food and Agriculture. The funding body played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Author information
Authors and Affiliations
Contributions
HN. TS, and CS were responsible for the conception and design of the study. HN, JK, MN, JGNM, GB and TS were involved in the acquisition of data. JK, AH, and CS were responsible for the analysis of the data. JK and HN were involved in the interpretation of the data. JK and HN were involved in the drafting the manuscript. JK, HN, JD, MN, JGNM, and AH were involved in revision of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This study’s procedures were conducted with the approval of Washington State University’s Institutional Animal Care and Use Committee (#4295) and each dairy owner provided written agreement to participate in the study. This study is reported in accordance with ARRIVE guidelines (https://arriveguidelines.org).
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Supplementary Material 1
: Supplemental_Tables.xlsx: Additional file 1 (.xlsx): Table S1. Milk yield data. Table S2. Gene sets associated with cow conception rate to the first breeding in Holstein cows. Table S3. Gene sets associated with number of breedings to conception in Holstein cows.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kiser, J.N., Seabury, C.M., Neupane, M. et al. Validation of loci and genes associated with fertility in Holstein cows using gene-set enrichment analysis-SNP and genotype-by-sequencing. BMC Genomics 26, 174 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-025-11364-9
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-025-11364-9