Skip to main content

A multi-tissue and -breed catalogue of chromatin conformations and their implications in gene regulation in pigs

Abstract

Background

Topologically associating domains (TADs) are functional units that organize chromosomes into 3D structures of interacting chromatin, and play a crucial role in regulating gene expression by constraining enhancer-promoter contacts. Evidence suggests that deletion of TAD boundaries can lead to aberrant expression of neighboring genes. In our study, we analyzed high-throughput chromatin conformation capture (Hi-C) datasets from publicly available sources, integrating 71 datasets across five tissues in six pig breeds.

Results

Our comprehensive analysis revealed 65,843 TADs in pigs, and we found that TAD boundaries are enriched for expression Quantitative Trait Loci (eQTL), splicing Quantitative Trait Loci (sQTL), Loss-of-Function variants (LoFs), and other regulatory variants. Genes within conserved TADs are associated with fundamental biological functions, while those in dynamic TADs may have tissue-specific roles. Specifically, we observed differential expression of the NCOA2 gene within dynamic TADs. This gene is highly expressed in adipose tissue, where it plays a crucial role in regulating lipid metabolism and maintaining energy homeostasis. Additionally, differential expression of the BMPER gene within dynamic TADs is associated with its role in modulating the activities of bone morphogenetic proteins (BMPs)—critical growth factors involved in bone and cartilage development.

Conclusion

Our investigations have shed light on the pivotal roles of TADs in governing gene expression and even influencing traits. Our study has unveiled a holistic interplay between chromatin interactions and gene regulation across various tissues and pig breeds. Furthermore, we anticipate that incorporating markers, such as structural variants (SVs), and phenotypes will enhance our understanding of their intricate interactions.

Peer Review reports

Background

In eukaryotic cell nuclei, genomic DNA is organized into three-dimensional (3D) structures at various scales. In 2009, the development of high-throughput chromatin conformation capture (Hi-C) technology enabled the genome-wide identification of interacting fragments by combining second-generation sequencing with chromosome conformation capture (3 C) and molecular labeling techniques [1]. Hi-C has provided insights into chromatin interactions, revealing spatial hierarchical structures such as chromosome territories, compartments, and topologically associated domains (TADs) [2], as well as long-range interactions that play crucial roles in transcriptional regulation [3]. TADs are discrete units of folded chromatin ranging in size from 200 kb to 1 Mb and serve as independent regulatory domains characterized by extensive self-interactions [4]. Studies in model organisms such as Drosophila have demonstrated a strong correlation between TADs and functional epigenetic domains defined by chromatin marks [4]. The structural features and mechanisms underlying TAD formation and the regulation of local gene expression are being elucidated [2, 4,5,6,7,8,9]. For example, CCCTC-binding factors (CTCFs) and cohesion proteins, which are abundant at TAD boundaries, have been identified as essential for TAD localization and structural stability [10]. TAD boundaries are more evolutionarily conserved than the rest of the TADs and are enriched with CTCF and housekeeping genes, imposing genetic constraints on TADs [11, 12]. Moreover, TAD boundaries exhibit enrichment in various epigenetic makers such as histone modifications, DNA methylation sites, transcription start sites (TSSs), and transfer RNA (tRNA), which are closely associated with epigenetic regulation of transcriptional activity [8]. Disrupting the boundaries thus disrupts the interactions between protein-coding genes and their enhancers, leading to a decrease in gene expression levels [13]. These studies have also highlighted the crucial roles of TAD boundaries in gene expression regulation of human and animals.

The domestic pig (Sus scrofa) serves as a significant source of meat worldwide and is widely used as an animal model for human diseases and xeno-transplantation [14,15,16,17]. Previous projects like ENCODE, Roadmap Epigenomics, and Functional Annotation of Animal Genomes (FAANG) have identified cis-regulatory elements involved in gene expression regulation in human, cattle and pig [18,19,20,21,22,23]. Studies have revealed that changes in 3D chromatin structure during growth and development, as well as long-range interactions within specific breeds or tissues, play regulatory roles in gene expression [24]. In a comparison of Bama pigs and wild boar, chromatin architecture contributes to determining the regulatory mechanism of phenotypic differences between Bama pigs and wild boar [25]. Furthermore, the chromatin architecture had contributed to the analyses of pig phenotypes [9, 26]. Research provides valuable comparative epigenetic data, such as Hi-C, relevant to using pigs as models in human biomedical research [27]. However, a comprehensive characterization of chromatin architecture associated with genes regulation and even traits in pigs is lacking.

Therefore, we compiled a comprehensive dataset comprising 71 publicly available Hi-C data from 5 tissues in 6 pig breeds (Table 1). Leveraging this extensive dataset, we constructed a mini-atlas consisting of 65,843 TADs to provide a detailed map of TADs in pigs. Our study encompasses multiple objectives. Firstly, we examine the interplay between TADs and functional regulatory elements derived from the Pig Genotype-Tissue-Expression (PigGTEx) and FAANG, including expression quantitative trait loci (eQTL), splicing quantitative trait loci (sQTL), loss-of-function variants (LoFs) datasets and other regulatory elements. Secondly, we conduct inter-tissue and inter-species comparisons of TADs across different tissues and breeds. Furthermore, in order to deepen our understanding of the regulatory impact of TADs, we identify candidate genes that are likely influenced by changes of TADs and are associated with tissue- or breed-specific functions. These findings contribute to unraveling the complex regulatory landscape of the pig genome and offer potential avenues for future research in pig genetics.

Table 1 The numbers of breeds and tissues for 71 public Hi-C samples

Methods

Data collection

The high-throughput sequencing datasets contained 5 tissues and 6 breeds and were all collected from public datasets (Table S1). The public datasets were retrieved from the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA), and the corresponding SRA accession numbers can be found in Table S1. The genome annotation data, eQTL, sQTL, CTCF signals, top 1% predicted effect score of 13 tissues, and gene expression matrices were obtained from PigGTEx and FAANG [9, 26]. The chromatin states of the pig genome can be accessed through the UCSC Genome Browser (http://genome.ucsc.edu/s/zhypan/susScr11_15_state_14_tissues_new) [9]. Additionally, the high-throughput sequencing data generated for four tissues in this study are available in the Gene Expression Omnibus (GEO) under the accession number GSE158430.

Hi-C data processing and TAD calling

The Hi-C reads were processed using the Juicer software (version 1.5) [28]. Briefly, the high-quality Hi-C reads were mapped to the pig reference genome (susScrofa11.1, access date:2021-08-28) using BWA (v0.7.15) [29, 30] with default parameters. Unaligned read pairs and PCR duplicates were filtered out using Juicer, and alignments with low quality (MAPQ < 30) were also discarded.

Subsequently, intra-chromosomal contact matrices at 50 kb resolution were generated independently for each sample using valid read pairs. These matrices were then quantile normalized using the Knight-Ruiz algorithm [31]. Additionally, inter-chromosomal contact matrices at 1 Mb resolution were generated using the KR algorithm and normalized using log-counts per million (CPM), which represents the average abundance across all libraries.

Criteria for filtering samples

To ensure the quality and consistency of the Hi-C datasets used in this study, we implemented several filtering steps. Firstly, we excluded Hi-C samples with sequencing depths below the average sequencing depth of 48.3\(\:\times\:\) as higher sequencing depth is known to yield more reliable results for high-resolution Hi-C maps and 3D chromatin structure prediction. The interquartile range (IQR) method was employed to define the upper and lower limits of outliers. Datasets with sequencing depths outside the range of 30 (lower limit) and 208.8 (upper limit) were excluded, based on the criteria of data points above the third quartile (Q3) + 1.5 × IQR or below Q1 − 1.5 × IQR (Fig. S2).

To maintain consistency in comparisons, we removed any unique Hi-C datasets with read lengths different from the majority of the datasets (read length of 150), specifically those with a read length of 100. Trim Galore (v0.6.7) [32] was used to trim sequences with a Phred quality score below 20, setting the threshold at 20. Sequences shorter than 15 nucleotides were discarded, and the first 3 nucleotides at the 5’ end of Read 2 were trimmed (--q 20 --paired --max_n 15 --clip_R2 3).

Furthermore, we calculated the percentage of Hi-C contact reads out of all reads. After applying the aforementioned filtering steps, the remaining Hi-C datasets exhibited a balanced number and coverage of TADs.

Principal component analysis (PCA) and correlation analysis

After a unified analysis pipeline and rigorous filtering of Hi-C samples, we performed Principal Component Analysis with the PCA module of FAN-C (0.9.28) [33], and then assessed the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient of HiCReppy (v 0.1.0) [34].

Down-sampling analysis

To ensure equal representation and comparability within each comparison group, down-sampling was performed on each sample. Initially, All Valid Reads of each sample were using for down-sampling, and then we utilized HiC-Pro (v2.11.4) [35] to convert the matrix files back to the Juicer format for subsequent analysis. Down-sampling was conducted on the Hi-C interaction matrix following the method described by Carty [36].

In summary, the counts of each element in the matrix, representing pairs of genomic loci, were converted into a list of paired-end reads, where the size of the list matched the counts. Through a random subsampling procedure without replacement, reads were selected from this list and reassigned to create a new downsized dataset. This down-sampling process ensured that each sample within the comparison group had an equal number of valid read pairs, facilitating fair and unbiased comparisons.

Functional enrichment analysis of genes

To conduct gene function enrichment analysis, we utilized the biomaRt (v2.52.0) [37] R package. This package facilitated the conversion of swine gene Ensemble IDs to swine Entrez IDs. The obtained Entrez IDs were then inputted into clusterProfiler (v4.2.2) [38], enabling us to retrieve GO [39] and KEGG [40] enrichment annotations.

In order to gain deeper insights into the functional roles of genes within the conserved TADs, we conducted GO and KEGG enrichment analyses. These analyses aimed to identify significant enriched GO terms and KEGG pathways, respectively, providing valuable information about the functional characteristics of genes within the identified TADs.

Identification of conserve and differential TADs

Our study focused on six comparative groups: muscle tissue and adipose tissue, adipose tissue and ear tissue, liver tissue and adipose tissue, Large White pig and Rongchang pig, muscle tissue and ear tissue, and Bama Xiang pig and Large White pig (Table 2). Differential interactions regions for each group were detected using CHESS (v 0.3.7) [41]. Differential TADs were defined that TADs overlapped with the regions of differential interactions.

Table 2 The details of six comparison groups

When analyzing differential TADs, we took into account that the TAD regions identified by the Arrowhead algorithm [42] contained overlapping regions between TADs. To accurately calculate the frequency distribution of TADs throughout the genome, we assigned a frequency of 2 to the overlapping regions, while assigning a frequency of 1 to the non-overlapping regions. This approach ensured an accurate representation of TAD occurrences in the genome. TADs with a frequency of 24 (equal to the number of samples) were defined as conserved TADs, while all other TADs within each comparison group, excluding the differential TADs, were categorized as other TADs. We then calculated the proportions of both the differential TADs and conserved TADs between the groups.

Enrichment analysis of tads and functional annotation data

To evaluate the enrichment fold and odds ratios of functional regulatory elements within TADs and their flanking regions, we conducted a rigorous analysis. Initially, we identified the number of variants located within TAD regions across different pig tissues and breeds using bedtools (v2.30.0) [43]. Subsequently, we employed Fisher’s exact test to evaluate the enrichment probability of various functional elements, including eQTL, sQTL, 15 chromatin states, top 1% regulatory variants predicted by a deep learning method in 13 tissues, and LoFs within TAD regions. Here, we will provide a detailed description of the enrichment analysis for eQTLs as an example.

To calculate the enrichment, we implemented the following procedure:

We divided each TAD into 60 equally sized bins. Additionally, we extended each TAD upstream and downstream by the length of the TAD itself to define the TAD flanking region. We further divided the TAD flanking region into 120 equally sized bins. Consequently, both the TAD and its flanking region were uniformly segmented into a total of 180 bins.

Next, we computed the enrichment as the ratio of the number of eVariants (the genes and genes with alternative splicing events had at least one significant variant detected in e/sQTL study) within a specific bin to the number of SNPs within that bin. This value was divided by the ratio of the number of all eVariants in the entire genome to the number of all SNPs in the genome. The calculation was performed as follows: (number of eVariants in bin / number of SNPs in bin) / (number of all eVariants in the genome / number of all SNPs in the genome).

We employed a significance threshold of FDR < 0.01 to identify statistically significant enrichment between eQTL and TADs. This allowed us to determine the presence of meaningful relationships at a high confidence level.

By applying these robust methods, we were able to investigate the enrichment between eQTL and TADs at a fine-grained level within the genome. Similar procedures were followed for assessing the enrichment of other functional regulatory elements. This comprehensive analysis enabled us to uncover the intricate relationships and potential regulatory mechanisms between TADs and these elements.

Identification of differentially expressed mRNAs

In this study, RNA-seq gene expression matrices were obtained from the PigGTEx [26]. To identify differentially expressed genes (DEGs) in each comparison group, we utilized the EdgeR package (v3.36.0) [44], which is a widely used statistical method based on a negative binomial distribution model. For each comparison group, we applied filtering criteria set as FDR ≤ 0.01 and|log2FC| ≥ 1 to identify differential expression genes, where FC represents the fold change.

Results

A mini-atlas tads map of 24 Hi-C samples

We gathered a dataset comprising 71 Hi-C samples (Table 1), which yielded approximately 40 billion Hi-C sequence reads (Table S1). After alignment, we filtered out unmapped reads, self-loops, and non-valid data connections, resulting in an average mapping rate of 98.13% across all samples (see Methods, Fig. S1 and Table S2).

To mitigate potential biases arising from variations in sequencing depth across different projects, we conducted thorough data screening and filtering processes. Out of the initial 71 datasets, we retained 24 datasets for further analysis (see Methods, Table S3). All samples included in the study met the criterion of having effective interacting reads comprising more than 50% of the total reads [35]. The sequencing depths of the samples ranged from 48.3 to 204.01 (Table S3, Fig. 1a and Table 1), showing a strong correlation (P < 0.05, R2 = 0.80) between sequencing depth and Hi-C contact count (Fig. 1a; Table 3). For each sample, we assessed various parameters including the number of sequenced reads (90 M to 2000 M), contact counts (41 M to 1183 M), trimmed reads (0.1 M to 45 M), and TAD coverage (65–78%) (Fig. 1b-c, Table S2-S3). We conducted PCA on 24 datasets and demonstrated the effectiveness of our correction method in mitigating the influence of batch effects (Fig S3). By utilizing contact maps with a resolution of 50 kb, we observed that TADs spanned approximately 65–78% of the whole genome. The number of TADs detected ranged from 878 to 1077, with a median size of 1.7 Mb (Fig. 1c; Table 3). This comprehensive analysis provides insights into the coverage and characteristics of TADs within the genome.

Table 3 The mean values of Hi-C and TAD data grouped by breed and tissue
Fig. 1
figure 1

A mini-atlas TADs map of 24 Hi-C samples. (a) Linear regression analysis showing the relationship between sequence depth and Hi-C contact counts for five tissues and nine breeds of pigs. Gray points represent samples that were filtered out, while red points represent samples that were kept. (b) Counts of sequence reads, trimmed reads, and Hi-C contact reads. (c) Distribution of TAD coverage. The left y-axis represents the sample count, the right y-axis represents the scaled density (scaled to a maximum of 1), and the dotted line indicates the median of TAD coverage. (d) Enrichment of CTCFs (CCCTC-binding factor) at the TAD boundary in muscle (left panel), liver (middle panel), and adipose (right panel). (e-g) The number (e) and average length (f) of TADs and number of Hi-C contacts (g) for Hi-C datasets 687 grouped by tissue and species

Given the well-established importance of CTCFs and cohesions in TAD formation and chromatin loops [45,46,47], we investigated the CTCF signals in muscle tissue obtained from FANNG [9]. Specifically, we examined the enrichment of CTCF signals within the identified TADs in adipose, liver, and muscle tissues separately. Our analysis revealed significant enrichment of CTCF peaks at the boundaries of TADs in each tissue (Fig. 1d). This consistent CTCF enrichment across all three tissues highlights the high conservation of TADs. Furthermore, we assessed the consistency of TAD numbers, average TAD length, and the number of Hi-C contacts across different tissues and breeds. Strikingly, all three factors exhibited similar values, indicating a comparable TAD landscape and unbiased basis for TAD comparisons among various tissues and breeds (Fig. 1e-g). These findings provide reassurance regarding the reliability of our TAD analyses and enable robust comparisons of TADs across diverse biological contexts.

Impact of tads on regulation of transcriptional activity

To comprehensively investigate the impact of TADs on regulation of transcriptional activity, we examined the enrichment of functional regulatory elements from the pig-GTEx project within TADs. These elements included 15 chromatin states, eQTL, sQTL, LoFs, and top 1% predicted SNPs effect of 13 tissues using deep learning by ATAC-seq data (Fig. 2a).

Fig. 2
figure 2

Impact of TADs on regulation of transcriptional activity. (a) An overview of functional regulatory elements in Topologically Associated Domains (TADs) and their flanking regions. (b) Within TADs, TAD flanking regions, and the combined regions, enrichment analysis of Loss-of-Function variants (LoFs) (c), chromatin states (d), and eQTL and sQTL (e), the top 1% of predict effect score based on deep learning and ATAC-seq of 13 tissues. The distribution of each type of data in the combined TAD and flanking regions is shown in the right panel, with red, yellow, and green lines representing liver, adipose, and muscle tissues, respectively. Black lines indicate the three types of tissues

Initially, we analyzed a LoFs dataset consisting of 27,148 mutations, including splice acceptor, splice donor, start lost, stop gain, and stop loss. We assessed the enrichment of LoFs within the TAD body and flanking regions. Interestingly, we found that LoFs were more frequently observed at the TAD boundaries, while the TAD body showed lower enrichment (Fig. 2b). Furthermore, we examined SNPs with the highest predicted effect scores (top 1%) of 13 tissues using deep learning with ATAC-seq data and observed similar distribution and enrichment patterns to LoFs (Fig. 2e), suggesting that TAD boundaries have a greater impact on the gene regulations.

We further examined the enrichment and frequency of different chromatin states within TADs and their flanking (2× TAD length) regions. 15 chromatin states examined in this study were defined by the integration of five epigenetic marks across 14 different tissues (Table 4). These states primarily include promoters (TssA, TssAHet, and TssBiv, covering 1.16% of the entire genome), TSS-proximal transcription regions (TxFlnk, TxFlnkWk, and TxFlnkHet, covering 0.92% of the genome), enhancers (EnhA, EnhAMe, EnhAWk, EnhAHet, and EnhPois, covering 6.5% of the genome), repressed regions (Repr and ReprWk, covering 13.25% of the genome), and quiescent regions (Qui, covering 73.39% of the genome) (Table 4). Enrichment of enhancers, promoter and TSS-proximal regions in TADs boundary (upstream and downstream regions of 50 kb for each TAD) are significantly (P = 0.0172, 0.00012, 0.0005) higher than within TAD body, whereas enrichment fold of repressed regions was low (P = 0.0006) in TADs body and boundary (Fig. 2c). However, quiescent regions exhibit notably higher enrichment but not significantly (P = 0.55) within TAD body regions compared to TAD boundary (Fig. 2c). This observation may be attributed to the promoting role of CTCF in TAD formation, leading to a reduced proportion of quiescent regions near TAD boundaries. Furthermore, we assessed the enrichment of eQTL and sQTL of 34 tissues within TADs, which are known to influence gene expression and splicing. Interestingly, we observed the highest frequency and of eQTL and sQTL at the TAD boundary, the enrichment fold of eQTL and sQTL in TAD boundary were significantly (P < 0.05) higher than in TAD body (Fig. 2d). These findings highlight the critical role of TAD boundaries in the pig genome, as indicated by the enrichment of functional regulatory elements.

Table 4 Definitions and abbreviations of 15 chromatin States

Overall, our results demonstrate the importance of TAD boundaries in shaping the chromatin landscape and regulating gene expression and splicing in the pig genome.

Identification of conserved and differential tads across breeds and tissues

To gain a comprehensive understanding of TAD conservation and variability in the pig genomes, we divided 24 diverse samples into six comparison groups based on tissue and breed. The details of each group were shown in Table 2 and the correlations of 24 sample (R2 ~ 0.48–0.99) were show in Fig S4. Using graphical representations of the chromosomes, we visually depicted the differential TADs for each group and conserved TAD regions of all groups, providing an overview of our analysis results (Fig. 3a). Our findings revealed that the majority of TADs were conserved (Fig. 3b, Table S4). Surprisingly, we observed greater differences of Adipose and Liver tissue between Bamaxiang pigs compared to the other groups (Fig. 3b, Table S5), contrary to our initial expectations. We attributed this variation to differences in sampling and sequencing depth among the collected samples, which may lead to bias. To address the issue of varying sequencing depths, we performed down-sampling to ensure comparability among the samples and reanalyzed the differential and conserved TADs. As depicted in Fig. 3c, the differences observed between breeds were found to be smaller compared to the differences observed between tissues.

Fig. 3
figure 3

Identification of conserved and differential TADs across breeds and tissues. (a) Chromosome plot of differential and conserved TADs across species and tissues. (b) The proportion of conserved TADs and differential TADs in each comparison group. (c) Number of genes in differential TADs and conserved TADs for each comparison group. (d-e) Significantly enriched KEGG pathways and GO terms (P < 0.05) of genes in conservation TADs

Furthermore, we investigated the genes functions within the TADs identified as differential and conserved. We quantified the number of genes within the differential TADs for each comparison group and within the conserved TADs present across all groups (Fig. 3c, Table S6). Then we performed GO (Ashburner et al., 2000) and KEGG (Kanehisa and Goto, 2000) enrichment analyses on the 5,916 genes within the conserved TADs (Fig. 3c). These genes exhibited enrichment in pathways crucial for maintaining fundamental organismal functions (Fig. 3d-e, Table S7-S8). However, no significant enrichment in GO terms and KEGG pathways was found for the genes within the differential TADs. This analysis confirmed the reliability of the TAD structures and provided evidence supporting the essential role of conserved TADs in maintaining genome stability and organismal viability.

Identification of differentially expressed genes in differential tads

To gain further insights into the regulatory mechanisms underlying gene expression in pigs resulting from alterations in 3D chromatin structure across different breeds and tissues, we focused on identifying differentially expressed genes within TADs that exhibited tissue-specific differences [48]. In the differential TADs of Bamaxiang between liver and adipose tissue, we discovered a total of 10 differentially expressed genes (FDR < 0.01 and|log2FC| ≥ 1) (Fig. 4a, Table S9), including nuclear receptor coactivator 2 (NCOA2), a member of the p160 co-activator family. Our analysis also revealed that NCOA2 was located within a differentially TAD in adipose and muscle tissues of LargeWhite pigs, and RNA-seq data analysis confirmed its specific expression in adipose tissue (Fig. 4b).

Fig. 4
figure 4

Identification of differentially expressed genes in differential TADs. (a) Genes identified as differentially expressed in the liver and adipose tissue comparison group. (b) The plot regional chromosome position. The differential TAD of Bamaxiang between adipose and liver tissues where the gene NOCA2 is located (second-third panel), and chromatin states (fourth panel) and RNA-seq (last panel) in this region. the dashed lines indicate the differential positions and the RNA-seq signals in the two tissues. The bars represent the chromatin states; red represents promoters, green represents TSS-proximal transcribed regions, yellow represents enhancers, blue represents ATAC islands, and grey represents repressed regions. (c) Genes identified as differentially expressed between muscle and adipose tissue. (d) The plot regional chromosome position. The differential TAD of LargeWhite between adipose and muscle tissues where the gene BMPER is located (second-third panel), and chromatin states (fourth panel) and RNA-seq (last panel) in this region. The dashed lines indicate the differential positions and the RNA-seq signals in the two tissues. The bars represent the chromatin states; red represents promoters, green represents TSS-proximal transcribed regions, yellow represents enhancers, blue represent ATAC islands, and grey represents repressed regions. The small red rectangles mean a 50 kb resolution of TADs

Furthermore, we identified differentially expressed genes within the differentially TADs between muscle and adipose tissues in LargeWhite pigs, and total 10 differential genes ranked by log2FC were presented in Fig. 4c (Table S10). We then focused on the differential TAD where BMPER is located (Fig. 4d) and observed a more compact folding of the genome at the TAD boundary where BMPER resides, along with tissue-specific TSS that led to upregulated gene expression. By examining TADs across tissues, we identified differentially expressed genes within these tissue-specific differential TADs, providing further support for the pivotal role of such TADs in regulating tissue-specific gene expression and related functions.

Discussion

Unlike many studies focusing on a single breed or tissue, our analysis encompasses multiple pig breeds and tissues, providing a comprehensive overview of TAD specificity across different biological contexts. We not only analyze TAD structures but also integrate chromatin interaction data with gene expression profiles, offering insights into how TADs influence gene regulation across different conditions. We employ state-of-the-art computational techniques, to predict functional elements within TADs and the SNP roles in functional elements, which has not been extensively covered in previous studies. The comprehensive analysis presented in this study delves deep into the characterization of TADs in the pig genome, shedding light on their crucial roles in regulating gene expression across different tissues and breeds. The findings herein provide valuable insights into the structural and functional aspects of TADs in the context of the pig genome, offering a foundation for future studies on the genetic regulation of complex traits in pigs.

Our investigation has highlighted the preservation of TADs through the heightened presence of CTCF and the differential analysis of TADs. Kentepozidou et al. (2020) identified evolving clusters of CTCF binding sites as a characteristic of TAD boundary architecture [49]. CTCF binding sites exhibit enrichment at TAD boundaries, and this conservation across diverse cell types suggests a pivotal role for CTCF in both the establishment and maintenance of these boundaries [50]. The asymmetry of CTCF binding sites is noteworthy, where the convergent orientation of a CTCF site pair contributes to the formation of chromatin loops in vivo [50]. CTCF, highly concentrated at TAD boundaries, forms physical loops with intervening DNA, establishing an insulated environment crucial for the proper expression of lineage-specifying genes [51]. Additionally, CTCF binding actively contributes to the configuration of a higher-order genome structure by delineating the boundaries of extensive TADs [49]. Disruptions in TAD boundaries have been observed to significantly impact gene expression, underscoring the necessity of exploring genome-level alterations in TADs and TAD boundaries to comprehend the intricate interplay between 3D genome structure and phenotype [13]. Notably, our findings indicate an enrichment of functional DNA variants at TAD boundaries, including eVariants in e/sQTLs, LoFs, and the top 1% high-impact SNPs. We posit that this enrichment may be attributed to the ability of functional DNA variants to modulate chromatin, such as enhancer and promoter regions, as well as repressed segments, all of which exhibit heightened presence at TAD boundaries. The enrichment of the top 1% high-impact SNPs, predicted to induce substantial changes in open chromatin states, provides support for our hypothesis. Nanni et al. (2020) similarly noted the enrichment of epigenetic marks associated with gene expression at TAD boundaries [50]. Additionally, Lazar et al. (2018) found significant enrichment of genetic and epigenetic signatures at TAD boundaries, including higher CpG density compared to the rest of the genome, increased presence of CTCF binding, H3K4me3, and the existence of SINE elements [52]. These collective insights highlight the fundamental importance of TAD boundaries in orchestrating gene expression dynamics within the genome.

The identification of conserved and differential TADs across breeds and tissues provides a unique perspective on genome organization in pigs. The majority of TADs were found to be conserved, with variations primarily attributed to tissue differences rather than breed. This result emphasizes the need to account for tissue-specific effects in future studies. The functional enrichment analysis of genes within conserved TADs highlights their role in fundamental organismal functions, consistent with previous studies [48], reinforcing the significance of TAD conservation in maintaining genome stability and viability. Finally, the identification of differentially expressed genes within tissue or breed-specific differential TADs provides a link between 3D chromatin structure and gene expression regulation. Among the notable findings, we identified NCOA2, a member of the steroid receptor coactivator family, as a differential gene residing within the TAD unique to Bamaxiang adipose when compared Bamaxiang liver. Previous studies have shown that NCOA2 is highly expressed in adipose tissue and contains tissue-specific CpG islands with chromatin-accessible enhancers [53, 54]. Functionally, as a coactivator of PPARγ, NCOA2 plays a pivotal role in regulating lipid metabolism and energy homeostasis [55]. Furthermore, it emerges as a key player in modulating intramuscular fatty acid composition in pigs [56, 57]. In a different context, we observed BMPER as a differential gene situated within TAD unique to muscle when compared to adipose in Large White pigs. BMPER, or BMP endothelial cell precursor-derived regulator, is known for its role in dampening the activities of bone morphogenetic proteins (BMPs), growth factors pivotal in the development of bone and cartilage [58]. Intriguingly, variants of this gene in cattle have been associated with larger body sizes and extended rump lengths [59]. Moreover, BMPER has been demonstrated to enhance intramuscular fat content in pigs [60], with quantitative analysis unveiling a positive correlation between BMPER expression and intramuscular fat levels [61]. BMPER also serves as a marker for adipose progenitors and adipocytes, exerting a positive influence on adipogenesis [62]. These findings collectively illuminate the intricate interplay of genes within TADs across various tissues, offering valuable insights into their roles in shaping physiological traits of agricultural significance. The discovery of tissue-specific differential genes underscores the importance of TADs in shaping tissue-specific gene expression patterns.

However, we acknowledge some limitations in our study. Firstly, the exploration of the 3D genome in pigs is still in its early stages, and the availability of tissues and breeds for analysis is limited, let alone single-cell 3D genomic data. Secondly, the lack of phenotype data (such as GWAS data) hinders our ability to directly investigate the impact of TAD alterations on phenotypic traits. Lastly, the absence of genomic structural variation data prevents us from employing deep learning methods to explore genomic variations underlying TAD changes. These limitations should be considered when interpreting our results and highlight areas for future research and data integration to further elucidate the functional implications of TAD dynamics in pigs.

In conclusion, this study provides a comprehensive exploration of TADs in the pig genome, offering insights into their conservation, impact on gene regulation, and relevance to tissue-specific differences. The findings not only advance our understanding of genome organization in pigs but also lay the foundation for future investigations into the genetic basis of complex traits and diseases in pigs.

Conclusions

In summary, our comprehensive analysis of TADs in the pig genome has provided valuable insights into their role in shaping the chromatin landscape and regulating gene expression and splicing. We found that TAD boundaries are enriched with functional regulatory elements, such as CTCF signals, LoFs, top-effect SNPs, chromatin states, eQTL, and sQTL, underscoring their critical importance in gene regulation. Additionally, our study revealed both conserved and differential TADs across various pig breeds and tissues, with a focus on tissue-specific gene expression changes within these TADs. These findings highlight the dynamic nature of TADs in pigs and their significance in governing the transcriptional activity and functional diversity of the genome. Overall, our research contributes to a deeper understanding of 3D chromatin organization and its impact on gene regulation in the pig genome, with implications for both basic biology and livestock breeding.

Data availability

The dataset collected in our study was sourced from a public dataset and is presented in Table S1. We did not generate any new data. The source code of our papers is available at https://github.com/yinhongwei4079/Hi-C-analysis-codes.

References

  1. Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, et al. Comprehensive mapping of Long-Range interactions reveals folding principles of the human genome. Science. 2009;326(5950):289–93.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Nora EP, Lajoie BR, Schulz EG, Giorgetti L, Okamoto I, Servant N, Piolot T, van Berkum NL, Meisig J, Sedat J, et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature. 2012;485(7398):381–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Bolzer A, Kreth G, Solovei I, Koehler D, Saracoglu K, Fauth C, Müller S, Eils R, Cremer C, Speicher MR, et al. Three-dimensional maps of all chromosomes in human male fibroblast nuclei and prometaphase rosettes. PLoS Biol. 2005;3(5):e157.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Ulianov SV, Khrameeva EE, Gavrilov AA, Flyamer IM, Kos P, Mikhaleva EA, Penin AA, Logacheva MD, Imakaev MV, Chertovich A, et al. Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains. Genome Res. 2016;26(1):70–84.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Hou C, Li L, Qin ZS, Corces VG. Gene density, transcription and insulators contribute to the partition of the drosophila genome into physical domains. Mol Cell. 2012;48(3):471–84.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Sexton T, Yaffe E, Kenigsberg E, Bantignies F, Leblanc B, Hoichman M, Parrinello H, Tanay A, Cavalli G. Three-dimensional folding and functional organization principles of the drosophila genome. Cell. 2012;148(3):458–72.

    Article  CAS  PubMed  Google Scholar 

  7. Lupiáñez DG, Kraft K, Heinrich V, Krawitz P, Brancati F, Klopocki E, Horn D, Kayserili H, Opitz JM, Laxova R, et al. Disruptions of topological chromatin domains cause pathogenic rewiring of Gene-Enhancer interactions. Cell. 2015;161(5):1012–25.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Rowley MJ, Nichols MH, Lyu X, Ando-Kuri M, Rivera ISM, Hermetz K, Wang P, Ruan Y, Corces VG. Evolutionarily conserved principles predict 3D chromatin organization. Mol Cell. 2017;67(5):837–e852837.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Pan Z, Yao Y, Yin H, Cai Z, Wang Y, Bai L, Kern C, Halstead M, Chanthavixay G, Trakooljul N, et al. Pig genome functional annotation enhances the biological interpretation of complex traits and human disease. Nat Commun. 2021;12(1):5848.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Zuin J, Dixon JR, van der Reijden MIJA, Ye Z, Kolovos P, Brouwer RWW, van de Corput MPC, van de Werken HJG, Knoch TA, van IJcken WFJ, et al. Cohesin and CTCF differentially affect chromatin architecture and gene expression in human cells. Proc Natl Acad Sci. 2014;111(3):996–1001.

    Article  CAS  PubMed  Google Scholar 

  11. Spitz F, Furlong EEM. Transcription factors: from enhancer binding to developmental control. Nat Rev Genet. 2012;13(9):613–26.

    Article  CAS  PubMed  Google Scholar 

  12. McArthur E, Capra JA. Topologically associating domain boundaries that are stable across diverse cell types are evolutionarily constrained and enriched for heritability. Am J Hum Genet. 2021;108(2):269–83.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Akdemir KC, Le VT, Chandran S, Li Y, Verhaak RG, Beroukhim R, Campbell PJ, Chin L, Dixon JR, Futreal PA, et al. Disruption of chromatin folding domains by somatic genomic rearrangements in human cancer. Nat Genet. 2020;52(3):294–305.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Corominas J, Ramayo-Caldas Y, Puig-Oliveras A, Estellé J, Castelló A, Alves E, Pena RN, Ballester M, Folch JM. Analysis of Porcine adipose tissue transcriptome reveals differences in de Novo fatty acid synthesis in pigs with divergent muscle fatty acid composition. BMC Genomics. 2013;14:843.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Fang X, Mu Y, Huang Z, Li Y, Han L, Zhang Y, Feng Y, Chen Y, Jiang X, Zhao W, et al. The sequence and analysis of a Chinese pig genome. GigaScience. 2012;1(1):2047–217. X-2041-2016.

    Article  Google Scholar 

  16. Hryhorowicz M, Lipiński D, Hryhorowicz S, Nowak-Terpiłowska A, Ryczek N, Zeyland J. Application of genetically engineered pigs in biomedical research. Genes. 2020;11(6):670.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Martin V, Knecht C, Duerlinger S, Richter B, Ladinig A. A pig model to assess skin lesions after apomorphine application. Biomedicines. 2023;11(5):1244.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Andersson L, Archibald AL, Bottema CD, Brauning R, Burgess SC, Burt DW, Casas E, Cheng HH, Clarke L, Couldrey C, et al. Coordinated international action to accelerate genome-to-phenome with FAANG, the functional annotation of animal genomes project. Genome Biol. 2015;16(1):57.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, Ziller MJ, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518(7539):317–30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Burns EN, Bordbari MH, Mienaltowski MJ, Affolter VK, Barro MV, Gianino F, Gianino G, Giulotto E, Kalbfleisch TS, Katzman SA, et al. Generation of an equine biobank to be used for functional annotation of animal genomes project. Anim Genet. 2018;49(6):564–70.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Fang L, Liu S, Liu M, Kang X, Lin S, Li B, Connor EE, Baldwin RL, Tenesa A, Ma L, et al. Functional annotation of the cattle genome through systematic discovery and characterization of chromatin States and butyrate-induced variations. BMC Biol. 2019;17(1):68.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Kingsley NB, Kern C, Creppe C, Hales EN, Zhou H, Kalbfleisch TS, MacLeod JN, Petersen JL, Finno CJ, Bellone RR. Functionally annotating regulatory elements in the equine genome using histone mark ChIP-Seq. Genes. 2019;11(1):3.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Halstead MM, Kern C, Saelao P, Wang Y, Chanthavixay G, Medrano JF, Van Eenennaam AL, Korf I, Tuggle CK, Ernst CW, et al. A comparative analysis of chromatin accessibility in cattle, pig, and mouse tissues. BMC Genomics. 2020;21(1):698.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Chepelev I, Wei G, Wangsa D, Tang Q, Zhao K. Characterization of genome-wide enhancer-promoter interactions reveals co-expression of interacting genes and modes of higher order chromatin organization. Cell Res. 2012;22(3):490–503.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Zhang J, Liu P, He M, Wang Y, Kui H, Jin L, Li D, Li M. Reorganization of 3D genome architecture across wild Boar and Bama pig adipose tissues. J Anim Sci Biotechnol. 2022;13(1):32.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Teng J, Gao Y, Yin H, Bai Z, Liu S, Zeng H, Bai L, Cai Z, Zhao B, Li X et al. A compendium of genetic regulatory effects across pig tissues. Nat Genet 2024.

  27. Zhao Y, Hou Y, Xu Y, Luan Y, Zhou H, Qi X, Hu M, Wang D, Wang Z, Fu Y, et al. A compendium and comparative epigenomics analysis of cis-regulatory elements in the pig genome. Nat Commun. 2021;12(1):2217.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Durand NC, Robinson JT, Shamim MS, Machol I, Mesirov JP, Lander ES, Aiden EL. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 2016;3(1):99–101.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Peters D, Qiu K, Liang P. Faster short DNA sequence alignment with parallel BWA. AIP Conf Proc. 2011;1368(1):131–4.

    Article  CAS  Google Scholar 

  30. Warr A, Affara N, Aken B, Beiki H, Bickhart DM, Billis K, Chow W, Eory L, Finlayson HA, Flicek P et al. An improved pig reference genome sequence to enable pig genetics and genomics research. Gigascience 2020, 9(6).

  31. Nagano T, Lubling Y, Várnai C, Dudley C, Leung W, Baran Y, Mendelson Cohen N, Wingett S, Fraser P, Tanay A. Cell-cycle dynamics of chromosomal organization at single-cell resolution. Nature. 2017;547(7661):61–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Krueger F, James F, Ewels P, Afyounian E, Weinstein M, Schuster-Boeckler B, Hulselmans G. sclamons: FelixKrueger/TrimGalore: v0.6.10 - add default decompression path. In.: Zenodo; 2023.

  33. Kruse K, Hug CB, Vaquerizas JM. FAN-C: a feature-rich framework for the analysis and visualisation of chromosome conformation capture data. Genome Biol. 2020;21(1):303.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Yang T, Zhang F, Yardımcı GG, Song F, Hardison RC, Noble WS, Yue F, Li Q. HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient. Genome Res. 2017;27(11):1939–49.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Servant N, Varoquaux N, Lajoie BR, Viara E, Chen C-J, Vert J-P, Heard E, Dekker J, Barillot E. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015;16(1):259.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Carty M, Zamparo L, Sahin M, González A, Pelossof R, Elemento O, Leslie CS. An integrated model for detecting significant chromatin interactions from high-resolution Hi-C data. Nat Commun. 2017;8(1):15454.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Durinck S, Moreau Y, Kasprzyk A, Davis S, De Moor B, Brazma A, Huber W. BioMart and bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics. 2005;21(16):3439–40.

    Article  CAS  PubMed  Google Scholar 

  38. Yu G, Wang L-G, Han Y, He Q-Y. ClusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet. 2000;25(1):25–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Galan S, Machnik N, Kruse K, Díaz N, Marti-Renom MA, Vaquerizas JM. CHESS enables quantitative comparison of chromatin contact data and automatic feature extraction. Nat Genet. 2020;52(11):1247–55.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Dixon JR, Gorkin DU, Ren B. Chromatin domains: the unit of chromosome organization. Mol Cell. 2016;62(5):668–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26(6):841–2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Robinson MD, McCarthy DJ, Smyth GK. EdgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.

    Article  CAS  PubMed  Google Scholar 

  45. Li J, Xiang Y, Zhang L, Qi X, Zheng Z, Zhou P, Tang Z, Jin Y, Zhao Q, Fu Y, et al. Enhancer-promoter interaction maps provide insights into skeletal muscle-related traits in pig genome. BMC Biol. 2022;20(1):136.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Nakai K, Vandenbon A. Chap. 2 - Higher-order chromatin structure and gene regulation. In: Epigenetics in Organ Specific Disorders. Edited by Boosani CS, Goswami R: Academic Press; 2023;34:11–32.

  47. Porter RS, Iwase S. Modulation of chromatin architecture influences the neuronal nucleus through activity-regulated gene expression. Biochem Soc Trans. 2023;51(2):703–13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Zhao R, Talenti A, Fang L, Liu S, Liu G, Chue Hong NP, Tenesa A, Hassan M, Prendergast JGD. The conservation of human functional variants and their effects across livestock species. Commun Biology. 2022;5(1):1–13.

    Article  CAS  Google Scholar 

  49. Kentepozidou E, Aitken SJ, Feig C, Stefflova K, Ibarra-Soria X, Odom DT, Roller M, Flicek P. Clustered CTCF binding is an evolutionary mechanism to maintain topologically associating domains. Genome Biol. 2020;21(1):5.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Nanni L, Ceri S, Logie C. Spatial patterns of CTCF sites define the anatomy of tads and their boundaries. Genome Biol. 2020;21(1):197.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Islam Z, Saravanan B, Walavalkar K, Thakur J, Farooq U, Singh AK, Sabarinathan R, Pandit A, Henikoff S, Notani D. Active enhancers strengthen insulation by RNA-mediated CTCF binding at TAD boundaries. In.: bioRxiv; 2021.

  52. Lazar NH, Nevonen KA, O’Connell B, McCann C, O’Neill RJ, Green RE, Meyer TJ, Okhovat M, Carbone L. Epigenetic maintenance of topological domains in the highly rearranged gibbon genome. Genome Res. 2018;28(7):983–97.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Camargo GMFd, Costa RB, Albuquerque LGd, Regitano LCA, Baldi F, Tonhati H. Polymorphisms in TOX and NCOA2 genes and their associations with reproductive traits in cattle. Reprod Fertility Dev. 2015;27(3):523–8.

    Article  Google Scholar 

  54. WenXing S, ShuHua G, HaiYin H, WeiWei C, ShiGang Y, Jie C. Influence of Silencing NCOA2 gene by SiRNA on differentiation of Porcine intramuscular preadipocytes. J Nanjing Agricultural Univ. 2016;39(4):619–23.

    Google Scholar 

  55. Yamamuro T, Nakamura S, Yanagawa K, Tokumura A, Kawabata T, Fukuhara A, Teranishi H, Hamasaki M, Shimomura I, Yoshimori T. Loss of RUBCN/rubicon in adipocytes mediates the upregulation of autophagy to promote the fasting response. Autophagy. 2022;18(11):2686–96.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Ramayo-Caldas Y, Ballester M, Fortes MR, Esteve-Codina A, Castelló A, Noguera JL, Fernández AI, Pérez-Enciso M, Reverter A, Folch JM. From SNP co-association to RNA co-expression: novel insights into gene networks for intramuscular fatty acid composition in Porcine. BMC Genomics. 2014;15:232.

    Article  PubMed  PubMed Central  Google Scholar 

  57. Valdés-Hernández J, Ramayo-Caldas Y, Passols M, Sebastià C, Criado-Mesas L, Crespo-Piazuelo D, Esteve-Codina A, Castelló A, Sánchez A, Folch JM. Global analysis of the association between pig muscle fatty acid composition and gene expression using RNA-Seq. Sci Rep. 2023;13(1):535.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Duan X, An B, Du L, Chang T, Liang M, Yang B-G, Xu L, Zhang L, Li J, E G, et al. Genome-Wide association analysis of growth curve parameters in Chinese simmental beef cattle. Animals. 2021;11(1):192.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Zhao C, Gui L, Li Y, Plath M, Zan L. Associations between allelic polymorphism of the BMP binding endothelial regulator and phenotypic variation of cattle. Mol Cell Probes. 2015;29(6):358–64.

    Article  CAS  PubMed  Google Scholar 

  60. Banigan EJ, Tang W, van den Berg AA, Stocsits RR, Wutz G, Brandão HB, Busslinger GA, Peters J-M, Mirny LA. Transcription shapes 3D chromatin organization by interacting with loop extrusion. Proc Natl Acad Sci USA. 2023;120(11):e2210480120.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Liu Z, Sun W, Zhao Y, Xu C, Fu Y, Li Y, Chen J. The effect of variants in the promoter of BMPER on the intramuscular fat deposition in longissimus dorsi muscle of pigs. Gene. 2014;542(2):168–72.

    Article  CAS  PubMed  Google Scholar 

  62. Garritson JD, Zhang J, Achenbach A, Ferhat M, Eich E, Stubben CJ, Martinez PL, Ibele AR, Hilgendorf KI, Boudina S. BMPER is a marker of adipose progenitors and adipocytes and a positive modulator of adipogenesis. Commun Biology. 2023;6(1):638.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We are grateful to all members of the Key Laboratory of Livestock and Poultry Multi-omics of MARA, PigGTEx and FAANG for data and advice of data statistics.

Funding

This work was funded by the National Key Research and Developmental Program of China (2021YFF1000600), the National Natural Science Foundation of China (31972539, 31501933), the Science and Technology Program of Shenzhen (CJGJZD20210408092402006), the National Key R&D Program of China (2021YFD301201).

Author information

Authors and Affiliations

Authors

Contributions

The study was conceived and directed by FLZ and BLJ. ZQY wrote the manuscript. YHW and ZQY analyzed and interpreted the data. YL and YWY had provided the advices for analysis. OM, MG, YGQ, YHW, FLZ and BLJ had revised the paper. All the authors read and approved the final manuscript. All authors consent for publication.

Corresponding authors

Correspondence to Lingzhao Fang or Lijing Bai.

Ethics declarations

Ethics approval and consent to participate

No applicable.

Consent for publication

No applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yin, H., Zhao, Q., Yang, L. et al. A multi-tissue and -breed catalogue of chromatin conformations and their implications in gene regulation in pigs. BMC Genomics 26, 484 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-025-11490-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-025-11490-4

Keywords