- Research
- Open access
- Published:
Genome-wide analysis and functional validation of the cotton FAH gene family for salt stress
BMC Genomics volume 26, Article number: 271 (2025)
Abstract
Background
Fatty acid hydroxylases (FAHs) are a family of enzymes that includes fatty acid hydroxylases, carotenoid hydroxylases, and sterol desaturases. Fatty acids are highly important for plants. They are the main source of energy storage and the main component of the cell membrane. Saturated fatty acids can be divided into two categories: saturated fatty acids and unsaturated fatty acids. FAHs play a pivotal role in enhancing plant salt tolerance by modulating fatty acid metabolic pathways, thereby improving cell membrane stability and antioxidant capacity.
Results
In this study, we identified a total of 129 FAH gene family members in four cotton species, namely, Gossypium hirsutum, Gossypium darwinii, Gossypium arboreum, and Gossypium raimondii. The FAH genes were divided into five subgroups via evolutionary analysis. FAH genes located in the same subgroup presented similar gene structures and a consistent distribution of conserved motifs through the analysis of evolutionary trees, gene structures, and conserved motifs. Chromosomal localization analysis of the FAH gene family revealed that it has undergone chromosomal segment duplication events. Analysis of cis-acting elements suggested that the FAH gene may be involved in regulating biotic and abiotic stresses, plant growth and development, signaling pathways, and other physiological processes. The RT‒qPCR results revealed significant differences in the expression levels of FAH gene family members under salt stress conditions compared with those in the control group. Additionally, we successfully silenced Gohir.A03G045300 through VIGS experiments, and the results indicated that the silenced plants were more sensitive to salt stress than the control plants were. This suggests that Gohir.A03G045300 may be involved in the response of cotton to salt stress.
Conclusions
A total of 129 FAH genes were identified in four Gossypium species through bioinformatics analysis. Gene silencing of FAH members in G. hirsutum revealed that the FAH gene family plays a crucial role in the response of cotton to salt stress.
Background
Fatty acid hydroxylases (FAHs) are a family of enzymes that includes fatty acid hydroxylases, carotenoid hydroxylases, and sterol desaturases. Plant carotenoids are lipid-soluble pigments commonly found in the membranes of chloroplasts and chromoplasts, and they are also essential substances for plant survival. It protects chlorophyll from photooxidative damage caused by strong light and serves as an essential structural component of the photosynthetic antenna and reaction center complexes. Additionally, some carotenoids are key components of certain pigment‒protein complexes [1], and they play crucial roles in both photosynthesis and photoprotection [2]. β-Carotene hydroxylase is involved in the synthesis of lutein through the hydroxylation of β-carotene, making it a key enzyme in the carotenoid biosynthesis pathway. Lutein and zeaxanthin play important roles in scavenging free radicals [3, 4]. The gene sequence of β-carotene hydroxylase is relatively conserved and has been identified in Arabidopsis thaliana [5], Capsicum annuum, and sweet potato [6].
Studies have shown that the β-carotene hydroxylase gene plays an important role in the response to abiotic stress. In rice, the β-carotene hydroxylase gene DSM2 enhances resistance to drought and oxidative stress by increasing the biosynthesis of lutein and abscisic acid [7]. In addition, the FAH family also includes C-5 sterol desaturase and C-4 sterol methyl oxidase [8, 9], which play important roles in cholesterol biosynthesis and plant cuticular wax biosynthesis [10]. The catalytic center of these two enzymes contains a conserved domain sequence of HXHH (X is any amino acid), which can coordinate two iron ions and integrate into the endoplasmic reticulum to regulate the transmembrane transport of substances [11]. Phytosterols are important components of cell membranes and lipid rafts and play important roles in various physiological and biochemical processes and stress resistance during plant development [12] in response to various biotic and abiotic stresses. After salt stress, A. thaliana enhances its salt tolerance by reducing the contents of brassicasterol and stigmasterol, increasing the ratio of sitosterol to stigmasterol, and increasing the permeability of the cell membrane [13]. Moreover, the total sterol content in the roots of soybean decreased by 50%, and the saturated fatty acid content increased significantly after salt stress [14].
In addition, phytosterols are involved in the synthesis of brassinolides (BRs) as signaling molecules, regulating the expression of phytosterol synthase genes and thereby regulating plant tolerance to stress [15]. It can also serve as a component of the cell membrane or a key element of lipid rafts, participating in the regulation of plant responses to stress. When plants are subjected to salt stress, their membrane structure and function first change. Phytosterols exist primarily in the form of free sterols within the plant, where they interact with phospholipids and membrane proteins in the cell membrane under the influence of free sterols. This interaction enhances the adaptability of the plant membrane system to environmental stress. Fatty acids are highly important for plants. They are the main source of energy storage and the main component of the cell membrane. Saturated fatty acids can be divided into two categories: saturated fatty acids and unsaturated fatty acids. Unsaturated fatty acids can be further divided into monounsaturated fatty acids with only one double bond and polyunsaturated fatty acids containing two or more. At present, an increasing number of studies have shown that fatty acids and their derivatives participate in various signal transduction pathways by forming double bonds at specific positions to produce unsaturated fatty acids and then participate in the regulation of biotic and abiotic stresses in plants [16]. Many related functional genes have been identified and verified. Wei et al. reported that the PtFAD3/7/8 gene, which is induced by abiotic stress, plays an important role in regulating the transformation of linoleic acid and linolenic acid and is crucial for resisting osmotic stress [17]. Shi et al. isolated rice fatty acid desaturase gene 2 (OsFAD2) from rice, and the overexpression of OsFAD2 in the rice vegetative stage revealed that the cold tolerance of transgenic plants significantly increased and that the grain yield greatly improved [18]. In this study, candidate genes of the FAH family in four different cotton species were identified, a phylogenetic tree of FAH genes was constructed according to their genome files and protein sequences, and the evolutionary selection relationships of FAH gene members were analyzed. In addition, the physical and chemical properties, subcellular localization prediction, gene structures, conserved motifs, collinearity and cis-acting element distributions of the cotton FAH family members were analyzed. Moreover, the expression of FAH members in upland cotton after salt stress was further detected. The function of cotton FAH family members was further verified via VIGS experiments. This study helps elucidate the evolution and function of the FAH superfamily in cotton and provides useful information for further exploration of the molecular mechanism of how the FAH superfamily is involved in fatty acid synthesis, sterol biosynthesis and the carotenoid pathway.
Results
Identification of FAH gene family members in cotton
A total of 129 FAH gene family members were identified from four cotton species, including 43 members in Gossypium hirsutum, 42 members in Gossypium darwinii, and 22 members in Gossypium raimondii and Gossypium arboreum. In addition, the physicochemical properties, including the starting position of the genes on the chromosome, gene length, molecular weight, and isoelectric point of the proteins encoded by the 129 genes identified, were analyzed (Supplementary Table S1). We found that the number of amino acids encoded by FAH family members ranged from 147 to 857, and the average number of amino acids was 410. The molecular weights of the FAH family proteins ranged from 17410.1 Da to 97252.8 Da, with an average molecular weight of 7432 Da. The theoretical isoelectric point of these proteins ranged from 6.96 to 9.66, and the average isoelectric point was 8.36, which was weakly alkaline. To understand the location of this family in cells, we predicted the subcellular localization of these 129 FAH members (Supplementary Fig. S1). The results revealed that all FAH family members were highly expressed on the plasma membrane of plant cells, indicating that the FAH family may play an important role in the plasma membrane.
Phylogenetic and evolutionary selection analysis of the cotton FAH gene family
To study the evolutionary relationships among the members of the cotton FAH family, a phylogenetic tree was constructed using the FAH protein sequences of the four Gossypium species mentioned above (Supplementary Fig. S2). According to the study of genetic distance, 129 members of the FAH family were divided into five subgroups. The largest subgroup was Group 1, with a total of 49 members. In this branch, G. hirsutum, G. darwinii, G. arboreum and G. raimondii had 16, 15, 9 and 9 members, respectively. The smallest subgroup was Group 2, with only 6 members. The other three subgroups had 36, 13 and 26 FAH family members, respectively. Members of the same branch tend to have more similar evolutionary relationships and similar functions. According to the phylogenetic tree, most of the homologous genes between allotetraploid cotton and diploid cotton clustered together in the same group, indicating that the cotton FAH gene family is expanding.
To further study the selection events of the FAH family in the evolutionary process and determine whether there is selective pressure on the protein-coding genes of the family, we used KaKs_Calculator2.0 software to calculate the Ka (nonsynonymous substitution) and Ks (synonymous substitution) values of FAH homologous gene pairs in cotton and the Ka/Ks ratio (Supplementary Table S2). The results revealed that in the development and evolution of all FAH gene pairs, the incidence of synonymous base substitutions was much greater than that of nonsynonymous base substitutions, and the Ka/Ks ratio was far less than 1, so it was not affected by natural selection. The results revealed that these genes experienced purifying selection during evolution.
Analysis of the gene structure and conserved motifs of the FAH gene family
To better understand the evolutionary relationships among different members of the FAH gene family, we constructed a phylogenetic tree with an intron‒exon structure and conserved motifs for the four cottons on the basis of the FAH protein sequence. The FAH gene region features a rich distribution of introns, with each FAH gene containing at least one intron. Compared with the other four subgroups, the FAH gene in subgroup 1 contained more introns, with a maximum of 11 introns. The diversity of the FAH gene structure indicates that different selection events may have occurred during the process of gene evolution. Notably, in the four cotton species, the genes closely related and closely related in the phylogenetic tree tended to have more similar exon and intron arrangements, indicating that the exon‒intron structure is highly correlated with the phylogenetic relationship between FAH genes.
Conserved motifs are often related to the function of proteins. To reveal the characteristic motifs of the FAH gene family, MEME software was used to identify the conserved motifs in the FAH protein. A total of 10 conserved motifs, named motifs 1 to 10, were identified in the FAH gene family (Supplementary Fig. S3). The motif sequences are presented in Supplementary (Figs. S4-S7).Through the analysis of conserved motifs, we found that different motifs were missing in all 129 FAH gene family members. However, the FAH genes in G. hirsutum contained motif 3 and motif 5, the FAH genes in G. darwinii contained motif 4 and motif 6, the FAH genes in G. arboreum contained motif 7 and motif 3, and the FAH genes in G. raimondii contained motif 5 and motif 4. The FAH gene was highly conserved in five different subgroups of the family, and the number and arrangement of motifs in the group were similar.
Chromosome localization and collinearity analysis of the cotton FAH gene family
The position distribution map of the FAH gene family on the chromosomes of the four species of cotton was drawn via TBtools software (Fig. 1). The results revealed that 21 of the 43 FAH gene family members in G. hirsutum were distributed in the At subgenome and were distributed in A03, A05–A09, A12 and A13; 22 genes were distributed in the Dt subgenome and were distributed in D02–D04, D06–D09, D12 and D13. Most of these genes are distributed at both ends of the chromosome. In G. darwinii, which is also tetraploid, we also found that the FAH gene has a similar distribution on the chromosomes. Among the 42 FAH gene members, 20 were distributed on chromosomes A03, A05-A09, A12 and A13 of the At subgenome, and 22 were distributed on chromosomes D02-D04, D06-D09, D12 and D13 of the Dt subgenome. Notably, four genes on chromosome A03 of G. hirsutum and G. darwinii, two allotetraploid cottons, presented homologous genes on chromosome D02. This phenomenon may be due to the large fragment replication of chromosomes in upland cotton during evolution. The above results suggest that the At and Dt genomes of G. hirsutum and G. darwinii may have originated from the same two ancestors and that the FAH gene family is relatively conserved in evolution, but some gene deletions still occur. In the At genome-donor diploid cotton G. arboreum, 22 FAH genes were distributed on chromosomes A01, A03-A04, A06-A09, A12 and A13. In Gossypium raimondii, which is a diploid donor of the Dt genome, 22 FAH members are distributed on chromosomes D01, D03-D08, D12 and D13.
Collinearity analysis can explain the homology between genes well, and collinear homologous sequences may have similar functions. Therefore, MCScanX and Circos software were used to analyze and map the collinearity of the FAH gene families of the four different cotton varieties. In the G. arboreum genome, the FAH gene family has collinear genes located between chromosomes A08, A09 and A12 and between chromosomes A04 and A06 (Fig. 2C). The collinearity of the FAH gene in Gossypium raimondii occurred mainly among the D01, D05, D08, D09, D10 and D12 chromosomes (Fig. 2D). In the tetraploid cotton species Gossypium hirsutum and Gossypium darwinii, most of the collinear relationships between genes occur on the same chromosome or between homologous chromosomes (Fig. 2A-B). Moreover, we also found that the FAH gene has a collinear relationship with the A03 and D02 chromosomes of the two tetraploid cotton species, which is similar to the previous distribution of FAH homologous genes on chromosomes.
Collinearity analysis of cotton FAH genes in Gossypium hirsutum (A), Gossypium darwinii (B), Gossypium arboreum (C), and Gossypium raimondii (D). Note: The gray lines represent the collinearity relationships of all genes within the genomes of Gossypium hirsutum (A), Gossypium darwinii (B), Gossypium arboreum (C), and Gossypium raimondii (D). The red lines represent the collinearity relationships of the FAH gene family across the genomes
Analysis of cis-acting elements of the cotton FAH gene family
To understand the potential function of the FAH gene family, this study analyzed the 1500 bp promoter sequence upstream of the FAH gene to detect cis-acting elements (Fig. 3). In the FAH gene family, various cis-acting elements, including the basic TATA box and CAAT box, are involved in the response to both biotic stress and abiotic stress. Additionally, elements involved in abiotic stress and signal transduction, such as MYB, ABRE, CAT-box, and HD-zip1, have been identified. The binding sites of ABREs and MYBs play a role in the abscisic acid response, whereas the CAT box is a cis-regulatory element associated with meristem expression. HD-zip is involved in the regulation of adverse stress in plants. These results clearly indicate that FAH genes may be involved in the regulation of biotic or abiotic stresses, plant growth and development, signal transduction pathways, and other physiological processes.
RT‒qPCR analysis of the cotton FAH gene family
Through the published transcriptome data of upland cotton (Supplementary Table S3), the expression of 43 FAH genes at different time points after salt stress was analyzed (Fig. 4). The results revealed that the FAH gene of upland cotton was strongly affected after salt treatment. According to the transcriptome data, eight FAH genes were randomly selected for RT‒qPCR verification (Fig. 5). We found that the expression levels of the eight selected GhFAH genes significantly changed after salt stress. Among them, Gohir.A03G100200, Gohir.A05G333900, Gohir.A12G221400, Gohir.D07G113700 and Gohir.D02G125200 were significantly upregulated under salt stress. Gohir.A03G045300, Gohir.A12G165300, and Gohir.D13G211952 was significantly downregulated. The RT‒qPCR results for these eight genes were consistent with the trend of transcriptome expression. In summary, the FAH gene can quickly regulate its expression via transcription within 24 h after salt stress to cope with tolerance to salt stress.
The expression levels of FAH genes in different periods after salt stress in Gossypium hirsutum. Heatmap showing gene expression in the leaf tissues of the CRI12 and XiangFZ031 varities. The colors range from red to white, indicating the differential gene expression levels from high to low. * indicates P ≤ 0.05, ** indicates P ≤ 0.01. Not labeled indicates no significant difference
Virus-induced gene silencing (VIGS) of FAH genes leads to increased sensitivity of plants to salt stress
Virus-induced gene silencing (VIGS) is an effective method for studying gene function. To explore the role of the FAH gene family in cotton, we constructed a VIGS vector for silencing Gohir.A03G045300 in upland cotton ‘Xiang FZ031’. Approximately two weeks after Agrobacterium infection, the true leaves of TRV2:CLA1 plants presented an albino phenotype (Fig. 6A), indicating that the VIGS vector was active and effective. In addition to the positive control plants, we used 200 mM salt solution to test the stress tolerance of the cotton plants and observed their phenotype after 14 days. The growth status of the TRV2:Gohir.A03G045300 plants was much slower than that of TRV2:00. We subsequently measured the shoot fresh weight and plant height of TRV2:Gohir.A03G045300 and TRV2:00 (Fig. 6B, D, E). Compared with those of the control plants, the plant height and fresh weight of the silenced plants were significantly lower. We obtained true leaves from TRV2:Gohir.A03G045300 seedlings for RNA extraction to verify the effect of gene silencing. The expression levels of these genes in the TRV2:Gohir.A03G045300 plants were significantly inhibited (Fig. 6C), which proved that gene silencing was successful. In summary, gene silencing experiments revealed that the TRV2:Gohir.A03G045300 weakened its tolerance to salt stress after silencing, suggesting that Gohir.A03G045300 may be involved in the response of cotton to salt stress.
VIGS experiment of the FAH gene family in Gossypium hirsutum. (A): The phenotype of the plant after 14 days of Agrobacterium infection. The left side is TRV2: CLA1 (positive control), the middle is TRV2: 00 (negative control), and the right side represents TRV2: Gohir.A03G045300 (the silenced plants). (B): Phenotypic comparison between negative control plants and the silenced plants. The right side is TRV2: 00 (negative control), and the left side is TRV2: Gohir.A03G045300 (the silenced plants). (C) Comparison of expression levels of Gohir.A03G045300 between the silenced plants and the negative control plants., (D) Comparison of plant height between the silenced plants and the negative control plants. (E) Comparison of shoot fresh weight between the silenced plants and the negative control plants The calculation results were analyzed via a t test. The asterisk indicates that there is a significant difference at the 0.05 probability level according to the t test. * indicates P ≤ 0.05, * indicates P ≤ 0.01
Discussion
The fatty acid hydroxylase (FAH) family is a family of proteins containing fatty acids, carotenoid hydroxylases, and sterol desaturases, which are widely present in bacteria, plants, and animals. These proteins have been widely studied for their roles in plant growth and development [19] and in regulating biotic stress [20] and abiotic stress [21]. However, there are no reports on the regulation of the cotton FAH gene after salt stress. In this study, bioinformatics methods were used to systematically identify the members, evolutionary relationships, and structural characteristics of FAH genes in cotton, laying the foundation for further studies of the specific mechanism by which FAH genes regulate salt stress.
The formation of gene families is caused by genome-wide replication events or the polyploidization of genomes. This is a large-scale chromosome doubling event, which simultaneously increases the number of all genes in the species, resulting in many chromosome doubling fragments remaining in the genome [22]. A group of these genes are similar in structure and function and encode similar proteins [23]. Allotetraploid cotton (AADD) G. hirsutum and G. darwinii have undergone genome polyploidization events during evolution, and studies have shown that the At genomes of G. hirsutum and G. darwinii are derived from G. arboreum and that the Dt genome is derived from G. raimondii [24]. In this study, 43 and 42 FAH genes were identified in the allotetraploid cotton strains Gossypium hirsutum and Gossypium darwinii, respectively. Twenty-two FAH members were identified in Gossypium arboreum, and 22 FAH members were identified in Gossypium raimondii. The number of FAH genes in tetraploid upland cotton is roughly equal to the sum of the number of FAH genes in its two diploid ancestors. These findings indicate that there was no significant gene loss in the FAH gene family during the evolution of allotetraploid cotton. This phenomenon is similar to the results of genome-wide analysis of another PFK family, where 26 genes were identified in Gossypium hirsutum and Gossypium barbadense, and 14 PFK genes were identified in the diploid ancestors Gossypium arboreum and Gossypium raimondii [25].
According to the distribution of FAH family members on chromosomes, FAH members with similar structures and close locations on chromosomes A03, D02, A06 and D06 of tetraploid cotton exist. This may have been a tandem duplication event during the evolution of the FAH gene. Tandem duplication mainly occurs in the chromosome recombination region. These genes are closely arranged on the same chromosome, with similar sequences and similar functions [26]. Notably, we found four FAH members on the A03 chromosome of Gossypium hirsutum, namely Gohir.A03G100100, Gohir.A03G100200, Gohir.A03G098000, Gohir.A03G100100, and the homologous genes of these four members are located on chromosome D02, which is similar to the situation in the tetraploid Gossypium darwinii, probably because of fragment replication between chromosomes A03 and D02. Chromosome tandem duplication and fragment duplication may play important roles in the large-scale expansion of cotton species. A similar situation has been reported in the study of the cotton ABC gene family [27].
Ka/Ks evolutionary selection analysis and intragenomic collinearity analysis can be used to study evolutionary selection and tandem repeat events in gene families and then to study the evolutionary process of the family. Ka/Ks and collinearity analyses were performed on the FAH family members of four different cotton species. We found that in tetraploid cotton, the collinear region of FAH is concentrated mainly between homologous chromosomes. According to the Ka/Ks results, most genes underwent purifying selection during evolution. This finding indicates that base substitution in the coding sequence of these genes does not change the composition of the protein; thus, they are not affected by natural selection, and the function of these genes is well preserved [28]. Similarly, we also found that FAH family members have many gene pairs with a linear relationship between chromosomes A03 and D02. The results of collinearity, Ka/Ks, were consistent with the results of chromosome localization, indicating that many chromosome recombination events occurred on the homologous chromosomes of tetraploid cotton and that large chromosome duplications occurred on A03 and D02. Moreover, FAH genes are conserved in chromosome doubling and evolution. Fragment duplication has been important in evolution because many plants have multiple repetitive chromosomal blocks [29]. Furthermore, Arabidopsis has undergone two genome-wide duplications (WGDs) [30], and some studies have shown that the GhAAI gene family [31] and GhARF gene family [32] of cotton have also expanded. We concluded that large fragment replication and genome polyploidization are the main methods for the formation of the cotton FAH gene family. These findings improve our understanding of chromosome interactions, information exchange, and genetic evolution in cotton.
Through the analysis of conserved motifs and gene structure members of the FAH gene family, it was found that FAH had different degrees of deletion in motifs but still had similar motif distribution patterns, especially among FAH members of the same subfamily. The number of introns contained in different subfamilies of the FAH gene was also quite different. It has been reported that introns may play a key role in the evolution of different species [33]. In the early stages of gene expansion, the introns of some genes are lost over time [34]; when introns are subjected to weak selection pressure, genes without introns may evolve rapidly, and when a gene has more introns, it will promote species evolution and survival advantages [35]. Therefore, we speculate that some FAH genes in cotton have gradually lost introns over time and changed their functions during evolution.
In this study, the cotton FAH family members were predicted by subcellular localization, and all the members were found to be located on the plasma membrane, which means that FAH may be responsible for material transport and signal transduction between cells. Studies have shown that phytosterols, which are mainly components of the cell membrane, affect the structure and function of the cell membrane by mediating the biosynthesis of brassinolides (BRs), thereby increasing the adaptability of plants to abiotic stresses [36]. The prediction of cis-acting elements revealed that many regulatory elements, such as MYB, MBS, HD-zip1 and ABRE, are related to stress upstream of the cotton FAH gene promoter. MYB transcription factors have been widely reported to be associated with abiotic stress regulation in plants. It is known for its highly conserved MYB domain at the N-terminus, and its conserved sequence is CAACCA. In A. thaliana, after overexpression of the MYB12 gene, flavonoids significantly accumulate in transgenic plants, resulting in increased antioxidant capacity and drought tolerance in A. thaliana [37]. In addition, MYB transcription factors can regulate the expression levels of ABA signaling and SOS2 to increase plant salt tolerance [38]. During the germination stage of A. thaliana seeds, MYB7 significantly increases the germination rate under salt stress through negative regulation, and MYB42 enhances the salt tolerance of A. thaliana seedlings by positively regulating the expression of SOS2 [39].
Furthermore, the cotton FAH gene also has several elements related to light reactions, such as ACA motifs, G boxes, and GA motifs. In the process of plant evolution, complex mechanisms can be formed to perceive light intensity and other factors and affect a series of developmental events, such as seed germination and seedling formation [40]. The existence of these stress-related regulatory elements may be one of the important reasons for the rapid and significant differential expression of the cotton FAH gene after salt stress, but the specific regulatory mechanism is still unclear. Additionally, we further explored the function of the cotton FAH family through VIGS experiments. We silenced Gohir.A03G045300 in XiangFZ031 cotton, and the results revealed that the growth of the silenced plants was significantly affected. Moreover, we identified the homologous gene AT1G07420 in A. thaliana, which contains a fatty acid hydroxylase domain, and studies have shown that AT1G07420 can regulate the growth and development of A. thaliana [41]. In summary, this FAH genome-wide analysis facilitates further exploration of the regulatory mechanisms of salt stress within this gene family and establishes a foundation for identifying high-yield, salt-tolerant germplasm resources in cotton.
Conclusions
Cotton is one of the most salt-tolerant crops, but excessive salt can cause ionic toxicity, osmotic stress and oxidative harm to cotton. In this study, a series of bioinformatics analyses were used to analyze the cotton FAH family, and the 129 FAH genes of four Gossypium species were divided into five subgroups. FAH genes located in the same subgroup presented similar gene structures and a consistent distribution of conserved motifs. Moreover, we used RT-qPCR to detect changes in the expression of FAH genes in cotton leaves after salt stress. Gene silencing experiments of Gohir.A03G045300 revealed that this gene may be involved in the response of cotton to salt stress.
Materials and methods
Plant materials
The salt-tolerant line XiangFZ031 and the salt-sensitive upland cotton parent CRI12 were used as experimental materials. XiangFZ031, also known as Nationally Registered Cotton Variety No. 20,210,031, was developed by crossing Gossypium darwinii, a tetraploid wild cotton species with excellent salt tolerance, with an upland cotton variety CRI12. This process was followed by three backcrosses to CRI12 and marker-assisted selection from the BC3F8 generation. Healthy seeds of this species were selected and planted in pots at a ratio of 3:1 of nutrient soil and vermiculite. It was cultivated at 25 °C in a 16 h:8 h light: dark environment. When the second cotyledon fully expanded, the cotton plants were divided into control and experimental groups. The experimental group was exposed to 200 mM NaCl solution once. After 24 h, the first true leaves of the two groups of cotton plants were collected for subsequent RNA extraction and RT‒qPCR experiments. For the VIGS experiment, the salt-tolerant strain XiangFZ031 was used as the material. Agrobacterium infection was performed when the two cotyledons of the plant were fully expanded. After the albino phenotype appeared, salt treatment was performed. Saline (200 mM) was added every two days, and the phenotype was measured after 14 days.
Identification analysis of FAH gene family members in cotton
The genomic data of the tetraploid cotton species Gossypium hirsutum (V2.1) and Gossypium darwinii (V1.0) and their common ancestor Gossypium raimondii (V2.1) were downloaded from the Phytozome V13 database (https://phytozome-next.jgi.doe.gov/), including CDSs, gene annotation files and protein sequences [42,43,44]. The G. arboreum CRI genome and protein sequence were obtained from the CottonGen genome database (https://www.cottongen.org/) [45]. The Pfam number PF04116 of the FAH gene family was searched through the Pfam database (http://pfam-legacy.xfam.org/) [46], and the hidden Markov model (HMM) of the FAH gene family was downloaded. The sequences containing FAH protein domains in the protein files of the four cotton species were searched via HMMER 3.0 software and BLASTP. The E value was set to 1e-10 to screen candidate protein sequences. The candidate protein sequences were uploaded to the SMART database (http://smart.embl.de/) [47], Pfam database (http://pfam-legacy.xfam.org/) and CDD search in the NCBI database (https://www.ncbi.nlm.nih.gov/cdd/) [48] for reidentification. The amino acid sequences of all members of the FAH gene family of the four cotton species were analyzed via ExPASy proteomics server software (http://www.expasy.org) [49], and the amino acid length and isoelectric point (PI) were calculated. In addition, the subcellular localization of FAH family members was predicted via the WoLF PSORT website (https://wolfpsort.hgc.jp/) [50].
Construction and evolution analysis of the FAH gene family
To further analyze the relationships between the evolution of FAH family members, the obtained protein sequences were subjected to multiple sequence alignment via MEGA7 [51] software, and multiple sequence alignment was performed via MEGA7 with the Alignment - Align by ClustalW option, with default parameters. A phylogenetic tree was constructed via the neighbor‒joining method with default parameters [52]. The online site Evolview (https://evolgenius.info//evolview-v2/#login) [53] was used to further modify and beautify the phylogenetic tree. This study also constructed a local index of the cotton FAH genome gene sequence, and compared the whole-CDS data with Blastp via the Blastall program with an E value of 1e-20, and the results of the genome comparison were obtained. The synonymous substitution rate (Ks), nonsynonymous substitution rate (Ka), and Ka/Ks of cotton FAH genes were calculated via Calculator 3.0 [54] to analyze the gene selection events of the FAH gene family during evolution and development.
Gene structure and conserved motif analysis
The MEME website (http://memesuite.org/) [55] was used to analyze the conserved motifs of FAH gene family members. The total number of parameters used to establish the search motifs was 10, the shortest motif length was 6 base pairs, and the longest motif length was 50. To analyze the structural information of FAH genes, the exons of FAH genes on chromosomes and the location information of the CDS, 3’-UTR and 5’-UTR were extracted. The online website GSDS (http://gsds.gao-lab.org/) [56] was subsequently used to analyze the structural information of the FAH gene family, and a gene structure map was drawn. Finally, TBtools software was used to merge and visualize the phylogenetic tree, gene structure and conserved motif images of the FAH genes.
Chromosomal localization and collinearity analysis
The position information of the FAH genes on the chromosomes was obtained from four cotton gene annotation files, and then the position distribution map of the FAH genes on the cotton chromosomes was drawn via MapChart [57]. By comparing the sequences of all the FAH proteins, multiple Collinearity Scan toolkit (MCSCANX) [58] software was used to determine and analyze the duplication doubling events and collinearity between the FAH gene family in the cotton genome.
Analysis of cis-acting elements
To explore the related functions of gene expression regulation, the promoter sequences of 1500 base pairs upstream of the start codon of FAH genes were obtained from the genomic files of four cotton species. The cis-acting elements of genes were identified and analyzed via the PlantCARE database (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) [59], and the results were visualized via the GSDS online website (http://gsds.gao-lab.org/).
Expression analysis of the FAH gene family under salt stress
Through the online G. hirsutum transcriptome public database, the expression of FAH gene family members after salt treatment at different time points was obtained, and the unit was FPKM. TBtools software was used to construct the expression heatmap. Ten genes were randomly selected from the FAH gene family members of upland cotton for real-time quantitative PCR (RT‒qPCR) analysis. RT‒qPCR primers for the FAH gene family were designed via Primer Premier 5 software (Supplementary Table S4). For the RT‒qPCR experiments, a ChamQ SYBR qPCR Master Mix (LowROX Premixed) kit was used. The reaction volume was 20 µL. The amplification procedure was as follows: predenaturation at 95 °C for 30 s; denaturation at 95 °C for 10 s; and annealing at 60 °C for 30 s for 40 cycles. GhUBQ was used as an internal reference gene, and the relative expression level of the gene was quantified via the 2−ΔΔCt method [60]. Three biological replicates and four technical replicates were used in the experiment. Finally, the GraphPad Prism software was used to visualize the RT‒qPCR results.
Virus-induced gene silencing (VIGS) experiment
The upland cotton line XiangFZ031 was selected as the material for VIGS. Healthy seeds were soaked in carbendazim, sterilized, and then planted in pots mixed with nutrient soil and vermiculite (3:1). The climate room temperature was maintained at 25 °C, and the ratio of light to darkness was 16 h/8 h. When the cotyledons of the cotton plants had fully expanded and before the first true leaf tips had just appeared, the VIGS experiment was carried out. The primers designed by Primer Premier 5 were used for the VIGS reaction and ligated into the pTRV2 vector to obtain the recombinant expression vector. Gohir.A03G045300 was amplified with an upstream primer (5’-AGTAGTCTGCCATTTCCATCCT-3’) and a downstream primer (5’-CAATGTGCCTCAACTGTCTCTA-3’). The primer sequence was inserted into the pTRV vector and treated with the BamHI-I and SacI enzymes to construct pTRV2: Gohir.A03G045300. The resulting plasmid was subsequently transformed into Agrobacterium tumefaciens (GV3101). After positive clones were selected, the bacterial mixture was injected into the cotyledons of the cotton plants via a sterile syringe. After 24 h of dark treatment, the plants were transferred to the artificial climate chamber, and the positive control presented an albino phenotype after approximately two weeks. After that, the plants were treated with 200 mMof saline for 14 days, after which the plant height and shoot fresh weight were measured.
Data availability
All the data generated in the current study are included in this published article and its supplementary information files.
Abbreviations
- FAH:
-
Fatty acid hydroxylases
- RT-qPCR:
-
Quantitative real-time polymerase chain reaction
- GSDS:
-
Gene structure display server
- HMM:
-
Hidden markov model
- MCScanX:
-
Multiple collinearity scan toolkit
- VIGS:
-
Virus-induced gene silencing
References
Toth TN, Chukhutsina V, Domonkos I, et al. Carotenoids are essential for the assembly of cyanobacterial photosynthetic complexes. BBA-Bioenergetics. 2015;1847(10):1153–65.
Liang MH, Li XY. Involvement of transcription factors and regulatory proteins in the regulation of carotenoid accumulation in plants and algae. J Agric Food Chem. 2023;71(48):18660–73.
Bendich A, Olson JA. Biological actions of carotenoids. FASEB J. 1989;3(8):1927–32.
Gupta HS, Hossain F, Muthusamy V. Biofortification of maize: an Indian perspective. Indian J Genet Plant Breed. 2015;75(1):1–22.
Schaub P, Rodriguez-Franco M, Cazzonelli CI, et al. Establishment of an Arabidopsis callus system to study the interrelations of biosynthesis, degradation and accumulation of carotenoids. PLoS ONE. 2018;13(2):e0192158.
Kang L, Ji CY, Kim SH, Ke Q, Park SC, Kim HS. Suppression of the β-carotene hydroxylase gene increases β-carotene content and tolerance to abiotic stress in Transgenic Sweetpotato plants. Plant Physiol Biochem. 2017;117:24–33.
Du H, Wang N, Cui F, et al. Characterization of the β-Carotene hydroxylase gene DSM2 conferring drought and oxidative stress resistance by increasing xanthophylls and abscisic acid synthesis in rice. Plant Physiol. 2010;154(3):1304–18.
Bard M, Bruner DA, Pierson CA, et al. Cloning and characterization of ERG25, the saccharomyces cerevisiae gene encoding C-4 sterol Methyl oxidase. PNAS. 1996;93(1):186–90.
Husselstein T, Schaller H, Gachotte D, et al. Delta7-sterol-C5-desaturase: molecular characterization and functional expression of wild-type and mutant alleles. Plant Mol Biol. 1999;39(5):891–906.
Kunst L, Samuels AL. Biosynthesis and secretion of plant cuticular wax. Prog Lipid Res. 2003;42(1):51–80.
Zhu G, Koszelak-Rosenblum M, Connelly SM, et al. The crystal structure of an integral membrane fatty acid α-Hydroxylase. J Biol Chem. 2015;290(50):29820–33.
Du Y, Fu X, Chu Y, Wu P, Liu Y, Ma L, Tian H, Zhu B. Biosynthesis and the roles of plant sterols in development and stress responses. Int J Mol Sci. 2022;23(4):2332.
Beck JG, Mathieu D, Loudet C, Buchoux S, Dufourc EJ. Plant sterols in rafts: a better way to regulate membrane thermal shocks. FASEB J. 2007;21(8):1714–23.
Surjus A, Durand M. Lipid changes in soybean root membranes in response to salt treatment. J Exp Bot. 1996; 17–23.
Clouse SD, Sasse JM. Brassinosteroids: essential regulators of plant growth and development. Annu Rev Plant Biol. 1998;49:427–51.
Upchurch RG. Fatty acid unsaturation, mobilization, and regulation in the response of plants to stress. Biotechnol Lett. 2008;30(6):967–77.
Wei H, Movahedi A, Xu SZ, Zhang YY, Liu GY, Aghaei-Dargiri S, Zefrehei MG, Zhu S, Yu CM, Chen YH et al. Genome-wide characterization and expression analysis of fatty acid desaturase gene family in poplar. Int. J. Mol. Sci. 2022; 23(19).
Shi J, Cao Y, Fan X, et al. A rice microsomal delta-12 fatty acid desaturase can enhance resistance to cold stress in yeast and Oryza sativa. Mol Breed. 2012;29(3):743–57.
Singh AK, Fu DQ, El-Habbak M, et al. Silencing genes encoding omega-3 fatty acid desaturase alters seed size and accumulation of bean pod mottle virus in soybean. Mol Plant Microbe Interact. 2011;24(4):506–15.
Lai EC. Lipid rafts make for slippery platforms. J Cell Biol. 2003;162(3):365–70.
Tanaka K, Asami T, Yoshida S, et al. Brassinosteroid homeostasis in Arabidopsis is ensured by feedback expressions of multiple genes involved in its metabolism. Plant Physiol. 2005;138(2):1117–25.
Xu G, Guo C, Shan H, et al. Divergence of duplicate genes in exon–intron structure. PNAS. 2012;109(4):1187–92.
Yu J, Wang J, Lin W, et al. The genomes of Oryza sativa: a history of duplications. PLoS Biol. 2005;3(2):266–81.
Li F, Fan G, Lu C, et al. Genome sequence of cultivated upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution. Nat Biotechnol. 2015;33(5):524–30.
Mehari TG, Xu Y, Umer MJ, et al. Genome-wide identification and expression analysis elucidates the potential role of PFK gene family in drought stress tolerance and sugar metabolism in cotton. Front Genet. 2022;13:922024.
Holub EB. The arms race is ancient history in arabidopsis, the wildflower. Nat Rev Genet. 2001;2(7):516–27.
Malik WA, Afzal M, Chen X, et al. Systematic analysis and comparison of ABC proteins superfamily confer structural, functional and evolutionary insights into four cotton species. Ind Crops Prod. 2022;177:114433.
Hurst LD. The Ka/Ks ratio: diagnosing the form of sequence evolution. Trends Genet. 2002;18(9):486.
Cannon SB, Mitra A, Baumgarten A, et al. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 2004;4:10.
Wang X, Wang H, Wang J, et al. The genome of the mesopolyploid crop species Brassica rapa. Nat Genet. 2011;43(10):1035–9.
Qanmber G, Lu L, Liu Z, et al. Genome-wide identification of GhAAI genes reveals that GhAAI66 triggers a phase transition to induce early flowering. J Exp Bot. 2019;70(18):4721–35.
Xiao G, He P, Zhao P, et al. Genome-wide identification of the GhARF gene family reveals that GhARF2 and GhARF18 are involved in cotton fibre cell initiation. J Exp Bot. 2018;69(18):4323–37.
Rearick D, Prakash A, Mcsweeny A, et al. Critical association of NcRNA with introns. Nucleic Acids Res. 2011;39(6):2357–66.
Roy SW, Gilbert W. The evolution of spliceosomal introns: patterns, puzzles and progress. Nat Rev Genet. 2006;7(3):211–21.
Roy SW, Penny D. A very high fraction of unique intron positions in the intron-rich diatom Thalassiosira pseudonana indicates widespread intron gain. Mol Biol Evol. 2007;24(7):1447–57.
Tanaka K, Nakamura Y, Asami T, et al. Physiological roles of brassinosteroids in early growth of Arabidopsis: brassinosteroids have a synergistic relationship with Gibberellin as well as auxin in light-grown hypocotyl elongation. J Plant Growth Regul. 2003;22(3):259–71.
Wang F, Kong W, Wong G, et al. AtMYB12 regulates flavonoids accumulation and abiotic stress tolerance in Transgenic Arabidopsis thaliana. Mol Genet Genom. 2016;291(4):1545–59.
Wang X, Niu Y, Zheng Y. Multiple functions of MYB transcription factors in abiotic stress responses. Int J Mol Sci. 2021;22(11):6125.
Sun Y, Zhao J, Li X, et al. E2 conjugases UBC1 and UBC2 regulate MYB42-mediated SOS pathway in response to salt stress in Arabidopsis. New Phytol. 2020;227(2):455–72.
Jiao Y, Lau OS, Deng XW. Light-regulated transcriptional networks in higher plants. Nat Rev Genet. 2007;8(3):217–30.
Zhang X, Sun S, Nie X, Boutte Y, Grison M, Li P, Kuang S, Men S. Sterol Methyl oxidases affect embryo development via auxin-associated mechanisms. Plant Physiol. 2016;171(1):468–82.
Goodstein DM, Shu S, Howson R, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40(D1):D1178–86.
Chen ZJ, Sreedasyam A, Ando A, et al. Genomic diversifications of five Gossypium allopolyploid species and their impact on cotton improvement. Nat Genet. 2020;52(5):525.
Paterson AH, Wendel JF, Gundlach H, et al. Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres. Nature. 2012;492(7429):423.
Yu J, Jung S, Cheng CH, et al. CottonGen: a genomics, genetics and breeding database for cotton research. Nucleic Acids Res. 2014;42(D1):D1229–36.
Bateman A, Coin L, Durbin R, et al. The Pfam protein families database. Nucleic Acids Res. 2004;32:D138–41.
Letunic I, Copley RR, Pils B, et al. SMART 5: domains in the context of genomes and networks. Nucleic Acids Res. 2006;34:D257–60.
Marchler-Bauer A, Anderson JB, Chitsaz F, et al. CDD: specific functional annotation with the conserved domain database. Nucleic Acids Res. 2009;37:D205–10.
Gasteiger E, Gattiker A, Hoogland C, et al. ExPASy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003;31(13):3784–8.
Horton P, Park KJ, Obayashi T, et al. WoLF PSORT: protein localization predictor. Nucleic Acids Res. 2007;35:W585–7.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4.
Mello B. Estimating timetrees with MEGA and the timetree resource. Mol Biol Evol. 2018;35(9):2334–42.
Subramanian B, Gao S, Lercher MJ, et al. Evolview v3: a webserver for visualization, annotation, and management of phylogenetic trees. Nucleic Acids Res. 2019;47(W1):W270–5.
Zhang Z. KaKs_Calculator 3.0: calculating selective pressure on coding and non-coding sequences. GPB. 2022;20(3):536–40.
Bailey TL, Boden M, Buske FA, et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009;37:W202–8.
Hu B, Jin J, Guo AY, et al. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics. 2015;31(8):1296–7.
Voorrips RE. MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered. 2002;93(1):77–8.
Wang Y, Tang H, Debarry JD, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49.
Lescot M, Dehais P, Thijs G, et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in Silico analysis of promoter sequences. Nucleic Acids Res. 2002;30(1):325–7.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Volume 25. Calif: Methods San Diego; 2001. pp. 402–8. 4.
Acknowledgements
The authors acknowledge School of Life Sciences, Nantong university for providing the laboratory and input facility for this experiment.
Funding
We appreciate financial help from Key Research and Development Project of Jiangsu Province, China (Modern Agriculture, BE2022364); Natural Science Foundation Project of Xinjiang Production and Construction Corps (2024DA027); State Key Laboratory of Cotton Bio-breeding and Integrated Utilization Open Fund (CB2024A06); The Practice Innovation Training Program Projects for College Students (202410304022Z, 2024131); High School Talent Student Training Program of Jiangsu Province (Su Qing Ke Fa [2024] No. 20).
Author information
Authors and Affiliations
Contributions
HG, WF and TGM performed most of the experiments and data analysis. YW, ZW, YX, YZ, JT, KZ, ZZ and WW helped in sample preparation and data analysis. RZ, JW and BW designed the experiments and edited the manuscript. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
All the seeds used for planting materials during the experiment were provided by our school and all the procedures were performed in accordance with international guidelines.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Supplementary Material 1:
Fig. S1: Subcellular localization prediction distribution map of the FAH family in Gossypium hirsutum (A), Gossypium darwinii (B), Gossypium arboreum (C), and Gossypium raimondii (D). Note that the color and size of the circles indicate the reliability of the prediction results
Supplementary Material 2:
Fig. S2: Phylogenetic relationships of FAH gene family members in Gossypium hirsutum, Gossypium darwinii, Gossypium arboreum and Gossypium raimondii. Each color represents a subgroup. Note: The phylogenetic tree was constructed via MEGA 7 neighbor-joining (NJ) with 1000 bootstrap replicates. The symbols in the figure represent different species of cotton. The star represents Gossypium hirsutum, the circle represents Gossypium darwinii, the triangle represents Gossypium arboreum, and the checkmark represents Gossypium raimondii. The numbers represent bootstrap values, ranging from 0 to 1.
Supplementary Material 3:
Fig. S3: Phylogenetic tree, gene structure and conserved motif analysis of the FAH gene family in Gossypium hirsutum (A), Gossypium darwinii (B), Gossypium arboreum (C), and Gossypium raimondii (D). Note: The phylogenetic tree was constructed via MEGA 7 neighbor-joining (NJ) with 1000 bootstrap replicates. The conserved motifs in FAH proteins were identified via MEME software. The gray lines indicate nonconserved sequences, and each motif is indicated by a colored box. The lengths of the motifs in each protein are presented to scale. The exon‒intron structure of the FAH gene is based on evolutionary relationships. Yellow rectangles indicate exons, and gray lines indicate introns
Supplementary Material 4:
Fig. S4: Motif Sequence Logos in Gossypium hirsutum. The size of each letter at each position reflects the degree of conservation in the analyzed sequences: larger letters indicate greater conservation and smaller letters indicate greater variability at that position
Supplementary Material 5:
Fig. S5: Motif Sequence Logos in Gossypium darwinii
Supplementary Material 6:
Fig. S6: Motif Sequence Logos in Gossypium arboretum
Supplementary Material 7:
Fig. S7: Motif Sequence Logos in Gossypium raimondii
Supplementary Material 8:
Table S1: Physical and chemical properties of FAH gene family members
Supplementary Material 9:
Table S2: Evolutionary selection analysis of FAH gene family members
Supplementary Material 10:
Table S3: FAH gene accession numbers from NCBI in Gossypium hirsutum
Supplementary Material 11:
Table S4: Primers used for RT‒qPCR analysis in Gossypium hirsutum
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Gu, H., Feng, W., Mehari, T.G. et al. Genome-wide analysis and functional validation of the cotton FAH gene family for salt stress. BMC Genomics 26, 271 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-025-11450-y
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-025-11450-y