Journal of Proteomics & Bioinformatics

Journal of Proteomics & Bioinformatics
Open Access

ISSN: 0974-276X

Research Article - (2024)Volume 17, Issue 2

A Comprehensive Study of ZmFBN Gene Family and their Biotic and Abiotic Stress Response in Maize

Salma Bashir, Babar Farid*, Muhammad Zeshan Zafar, Shoaib Ur Rehman, Muhammad Rauf Shah and Zunaira Atta
 
*Correspondence: Babar Farid, Department of Plant Breeding and Biotechnology, Muhammad Nawaz Sharif University of Agriculture, Multan, Punjab, Pakistan, Email:

Author info »

Abstract

The FBN gene family is essential to many biological processes, including plant development and response to a variety of biotic and abiotic stresses. However, there is currently no information about this gene family in maize. In this study, we performed a genome-wide study of the FBN gene family in maize, identified 6 ZmFBN genes by a different bioinformatics method, and further characterized them to understand their structure and function. For this purpose, genome-wide analysis is performed to identify the non-redundant FBN genes in the genome of maize. This research aims to explore the function of FBN genes in maize. Genomic and CDS sequences were used to reveal structural features. The evolutionary analysis was performed by constructing a phylogenetic tree. Synonymous and non-synonymous (Ka/Ks) ratios were also calculated for ZmFBN. Gene expression analysis was performed by using the NCBI GEO dataset and a heat map generated to identify the abiotic stress genes of FBN in maize. According to our results one out of six members ZmFBN3 (Zm00001d002384) of FBN gene family was regulated under stress condition, and supported the maize plant under abiotic condition.

Keywords

Maize; FBN gene; Grain yield; Abiotic stress; Gene expression

Introduction

The most frequently cultivated crop in the world is maize (Z. mays). Its reaction to numerous environmental stress factors is highly dynamic and complex in nature. On the other hand, climate change is predicted to intensify and spread biotic and abiotic stress factors more frequently. 80% of human food is produced by crops, with grains contributing to 50% of the world's food supply [1]. It provides the continually increasing human population by fulfilling their nutritional requirements either directly as a food for human consumption or indirectly as feed for livestock. Furthermore, due to its ability to produce bio-ethanol, maize, referred to as corn in the United States, serves as an environmentally friendly substitute to fossil fuels [2]. Because changing climatic conditions increase variability in maize production, it is essential to evaluate the overall impact of climate change on the growth and development of this staple crop in order to estimate its consumption. Temperature extremes, such as drought, nutrient shortage and minerals are among the various abiotic stresses that are predicted to have the biggest effect on the yield of maize overall. The two most significant climate variables, based on recent research, temperature and precipitation, with radiation additionally performing a significant role in yield. However, the current climate of droughts, temperature extremes and waterlogging has significantly lowered maize development thus decreasing its yield [3]. Due to drought or waterlogging, every year loses in maize yield [4].

Members of the fibrillin family play a role in a variety of functions including growth, development and resistance against abiotic stress. The most common type of protein found in chloroplast Plastoglobules (PGs) is a fibrillin (FBN) protein, which is plastid lipid-associated and highly conserved [5]. In plastid stability, plant growth and development, and stress response, FBNs have essential regulatory functions [6]. Widely present in photosynthetic species, including cyanobacteria and plants, FBN proteins have a conserved Plastid-lipids Associated Protein (PAP) domain [7,8].

In many processes including plastid structure stabilization, organ development, stress response and hormonal signal transmission, the FBN gene family has demonstrated important functions [9].

Many biological functions, including stress response, are significantly regulated by genes of the FBN family. FBN family genes in maize, yet, remain unknown and no comprehensive study of this family in maize has been published. In this study, bioinformatics tools were used to analyze the phylogenetic relationship, conserved domain, collinearity and cis-acting element in the ZmFBN gene family promoter regions. The results of this study, helps in future research into the biological function of the FBN gene during various phases of maize development.

Materials and Methods

ZmFBN gene family identification in maize and other crop

To identify all sequences related to FBN family, from TAIR database reference gene AtFBN was downloaded. Identified domain was BLAST in maize Genomic Database (GDB) to get putative genes of FBN gene family in maize. The domain was also blasted to find the putative FBN genes in A. thaliana, T. aestivum, G. hirsutum, A. comosus, B. rapa, G. max and P. trichocarpa. All the sequences were downloaded from Phytozome v13.

Protein motif analysis of ZmFBN

The motif analysis will be performed by using ZmFBN protein sequences on MEME. It will generate an XML file as result. The graphical presentation and sequence logos of each motif will appear on the result page. The graphic representation of motifs will also be downloaded TBtools software by using a MEME XML file.

Conserved domain analysis of ZmFBN

In each sequence, conserved domains were identified by submitting ZmFBN peptide sequences on the Conserved Domains Database (CDD) tool of National Center for Biotechnology Information (NCBI) and the generated CDD file used as input in TBtool software to generate its graphical structure. The CDD tool help in determining the molecular function of the protein [10,11].

Gene structure analysis of ZmFBN

Genomic and Complementary DNA Sequences (CDS) will be retrieved from https://gamma.maizegdb.org and the obtained sequences will be subjected to Gene Structure Display (GSD) server 2.0 for the development of the gene structure of ZmFBN.

Sequence logos and phylogenetic analysis

The peptide sequences of Arabidopsis thaliana and Z. mays were aligned by MEGA11. By using CLustal W all sequences were aligned and the structure was constructed using the TBtool software. For comparative phylogenetic and evolutionary analyses of FBN genes, downloaded and aligned by CLustal W, the FBN peptide sequences of maize and other crops were used to create a phylogenetic tree that used the neighbor-joining method to estimate the evolutionary history [12].

Chromosomal distribution, prediction of Ka and Ks ration and synteny analysis

To create the chromosomal length file, the maize genome assembly file will be used. Physicochemical properties data were used to perfumed chromosomal distribution analysis. The online tool PhenoGram will be used for chromosomal analysis. It will produce plots that depict where genes are located on chromosomes. Website selected is http://visualization.ritchielab.org/phenograms/plot as it is a versatile and user-friendly tool. The created pairings were processed through the TBtool program to calculate the rates of synonymous and non-synonymous substitution (Ka). The type of codon selection that resulted during evolution was further determined using the Ks/Ka ratio [13]. Further, using the formula T=Ks/2 and assuming a clock rate of (2*6.1*10^-9) substitutions/synonymous site/year for maize, the approximate period of duplication event was calculated. For collinearity analysis, the genome and General Feature Format (GFF) 3 files of maize and rice were used. Required files generated using one step MCscan. The results were visualized using TBtool software [14].

Sub-cellular localization, physicochemical properties and cis acting elements analysis of ZmFBN genes

By CELLO, Subcellular Localization Predictive System (SLPS) and WoLF PSORT subcellular localization was predicted. The physicochemical properties of ZmFBN genes were also identified by using Phytozome v13 and the Expasy Protparam tool. It describes the chromosome number, base pairs of peptides, CDS, transcript and genomic sequence, the molecular weight of protein and the number of introns and exons. For cis-element analysis of FBN genes, promoter regions ~ 2 kb upstream sequence region of the start codon were retrieved from Phytozome v13 and subjected to the PlantCare database for the identification of binding sites for the known transcription factor [15].

Protein-protein interaction network of ZmFBN

The structure prediction analysis has performed by using ZmFBNs protein sequences. Structures of all the FBNs were predicted through the online website https://yanglab.nankai.edu.cn/trRosetta/. To predict the Protein-Protein Interaction (PPI) of ZmFBN genes, protein sequences were subjected to the STRING search portal for the determination of PPI. A biological database and online resource for protein-protein interactions, known as STRING.

Gene expression profile of ZmFBN genes

For in-silico expression analysis of ZmFBN genes at different stages, transcriptional data of the genes were obtained from NCBI GEO DataSet. A heat map was produced by using TBtool software to identify the yield and quality controlling genes. All the analysis helped in the identification and characterization of the FBN gene family in maize.

Results

Identification and classification of ZmFBN genes

Through the use of the BLAST method, 6 ZmFBN genes were found in maize in the present research. The results provided basic information on the gene familial, CDS lengths, protein lengths, amino acid counts and molecular weights as well as their equipotential points, aliphatic indexes and GRAVY values as shown in Table 1. The protein length varied from 528 bp to 5857 bp in ZmFBN gene family, and number of exons in genes ZmFBN1 to ZmFBN6 ranges from 1 to 10. ZmFBN6 are hydrophobic proteins, according to the expected average hydrophilic coefficient (GRAVY), while others are hydrophilic proteins. Proteins found in the ZmFBN gees ranged from 176 bp to 361 bp amino acids in length from ZmFBN1 to ZmFBN6. Gene characteristics of ZmFBN genes, including Coding Sequence (CDS) length varied from 528 bp to 1084 bp, the molecular weight predicted for these proteins ranged from 18828.75 to 39237.95 kDa in ZmFBN genes respectively (Table 1).

Gene ID Gene name Genomic sequence Transcription sequence CDS bp Peptide bp PI Gravity Molecular weight  Amino acid
Zm00001d021351 ZmFBN1 5857 1458 1084 361 8.7 -0.062 39237.95 360
Zm00001d025574 ZmFBN2 3591 1475 945 315 5.9 -0.141 33610.25 314
Zm00001d002384 ZmFBN3 1994 1495 750 250 9.2 -0.081 27194.41 249
Zm00001d003470 ZmFBN4 5260 1498 957 319 5.4 -0.142 34280.94 318
Zm00001d033666 ZmFBN5 3767 1278 864 288 9.9 -0.277 31115.86 287
Zm00001d018348 ZmFBN6 528 528 528 176 10 0.185 18828.75 175

Table 1: Basic information of FBNs gene family in Z. mays.

Conserved domain, protein motif, gene structure and cis acting elements of ZmFBN genes

Exon/intron distribution patterns of genes have a relationship with their biological functions and the evolutionary relations between different species of plants. The conserved gene structure and domain analysis was analyzed by GSDS server 2.0 and CDD tool of NCBI. The results showed that ZmFBN family is divided into two subfamilies with typical subfamily domain.  The PAP fibrillin (PF04755) domain was present in all motifs contained the fibrillin which is plastid-lipids associated protein that plant growth and development, as well as plant response to both abiotic and biotic stress. The results indicated that the PAP fibrillin domain was present in two genes and PAP fibrillin super family was present in all genes as shown in Figure 1 and number of exons in genes ZmFBN1 to ZmFBN6 ranges from 1 to 10. One gene has no introns (ZmFBN6). The five of the six genes analyzed in this study contain introns. The two genes (ZmFBN4 and ZmFBN2) contained the two intron and three exons, one gene (ZmFBN2) have three introns and four exons and one gene (ZmFBN1) have nine introns and ten exons as shown in Figure 2. MEME analysis was used to determine the putative motifs of the FBN family in maize and all members contained one different motif for each subfamily, verifying that they belonged to the same subfamily as shown in Figure 3. The results showed that most of the members that are closely linked share common patterns. Plant growth, development and acceptability to biotic and abiotic stress were predicted functions of the cis acting essential components in the maize region. such as ABRE (involved in abscisic acid) and GARE (involved in gibberellins) and AUXR (involved in auxin responsiveness) to environmental challenges would increase in part because such stresses would trigger the activation of expansion genes with their responsive cis-acting elements. The gibberellin responsive elements were most abundant in the promoters of the ZmFBN2 gene. The Light Response (LR) elements are highest in ZmFBN1 and AUX and MYB elements are lowest in this gene. The MYB element was present only in one gene (ZmFBN1). The AUXNR was lowest in ZmFBN1 and ZmFBN3 genes and MeJAR elements were not present in ZmFBN3 gene (Figures 1-4a).

domain

Figure 1: Conserved domain analysis of FBN genes in maize. Equation

protein

Figure 2: Protein motif analysis of ZmFBN genes. Conserved motifs in maize FBN are indicated by color boxes. Equation Equation

maize

Figure 3: Representation of gene structure of FBN genes in maize. Exons, introns and UTRs are indicated with yellow boxes, black lines and blue boxes, respectively. Equation

analysis

Figure 4: Promoter analysis of ZmFBN genes different elements represented by different color schemes in graphical representation. (A): Bar fraph showing cis-acting elements; (B): Pie-chart showing cis-acting elements. Equation

Phylogenetic analysis, sequence logo, sub-cellular localization and protein interaction analysis

The 6 member of FBN genes in maize and divided into two groups. The red color indicates group1 and blue color group2 as shown in Figure 5a. To better understand the evolutionary relationships of the FBN family, a phylogenetic tree was constructed for 90 members of this family in seven different species, including OsFBN, AtFBN, GhFBN, GmFBN, AcFBN, PtFBN and BrFBN proteins as shown in Figure 5b. According to the phylogenetic tree, FBN family members can be classified into five groups, including group 1, group 2, group 3, group 4 and group 5. The group 3 contained 3 members, group 4 with 1 and group 5 contained 2 members of ZmFBN gene family. The phylostarum analysis of FBN gene family identified the earliest plant lineage. Further, the FBN genes were present in G. hirsutum, G. max and P. trichocarpa, dicotyledons (B. rapa), A. comosus (angiosperm), monocotyledons (B. distachyon) dicots (A. thaliana and monocots (Z. mays). The results of this study showed that the FBN gene family originated in phylostratum, an early land plant and that FBN orthologous genes may exist through the plant kingdom. These findings showed that the genes for FBN came from early land plants called phylostratum and that the plant kingdom contains probable orthologous genes for FBN (Figures 5a and 5b and 6).

Phylogenetic

Figure 5: (A): Phylogenetic analysis of ZmFBN gene family; (B): The phylostarum analysis of FBN gene family.

colors

Figure 6: Phylogenetic analysis of FBN genes in different species and different colors indicated different groups.

The subcellular localization of proteins using CELLO (Subcellular Localization Predictive System) results predicted that ZmFBN1 is located in outer membrane with variability value (2.500), ZmFBN2 is located in outer membrane with variability value (2.625), ZmFBN3 is located in periplasmic with variability value (1.996), ZmFBN4 is located in outer membrane with variability value (2.462), ZmFBN5 is located in periplasmic with variability value (3.121) and ZmFBN6 is located in periplasmic with variability value (1.805) as shown in Figures 6 and 7. Using WoLF PSORT, proteins' subcellular localization (Protein Subcellular Localization Prediction) predicted that ZmFBN located in outer membrane and periplasmic membrane as shown in Table 2. Sequence logos provide more specific information regarding sequence similarities, significant alignment characteristics and patterns of sequence conservation. The sequence logos of Arabidopsis and maize are showing divergence. Our results showed that the FBN gene family among all species is not conserved (Figures 7 and 8 and Table 2).

localization

Figure 7: Heat map of ZmFBN gene family for subcellular localization.

logos

Figure 8: Sequences logos of A. thaliana, Z. mays and G. max.

Gene Ids Chloroplast Nucleus Mitochondria Extracellular matrix Sub-cellular localization (significant)
ZmFBN1 14  - Outer membrane
ZmFBN2 14  - Outer membrane
ZmFBN3 12 1 1 Periplasmic
ZmFBN4 13  - 1 Outer membrane
ZmFBN5 10  - 4 Periplasmic
ZmFBN6 7  - 7 Periplasmic

Table 2: Prediction of subcellular localization of FBN protein through Wolf PSORT.

To understand the functions and metabolic pathways of the ZmFBN members, the protein-protein interaction network was predicted with the online String database as shown in Figure 9. Analysis predicted that protein ZmFBN has interaction with other proteins. Protein structure of FBN gens was restrained Rosetta (trRosetta) online tool as shown in Figure 10. The protein ZmFBN4 showed interaction with APE1, ZmFBN1, and GRMZM5G359786. The ZmFBN2 showed highest interaction with ZmFBN4, GRMZM5G359786, GRMZM5G124473 and ZmFBN3. The protein structure was determined for six ZmFBNs representative proteins, which shared the 90% similarities with each other. This level of structural similarity was adequate for analysis protein structures (Figures 9 and 10).

interaction

Figure 9: Protein-protein interaction of ZmFBN genes with other genes.

structures

Figure 10: Predicted structures of ZmFBN genes by trRosetta.

Chromosomal distribution, prediction of Ka and Ks ration and synteny analysis

The 6 FBN genes were distributed on five maize chromosomes on based of genomic data. ZmFBN5 gene were located on the chromosome 1, ZmFBN3 and ZmFBN4 genes were located on the chromosome 2, ZmFBN6 on chromosome 5, ZmFBN1 was located on chromosome 7 and ZmFBN2 gene was located on chromosome 10 as shown in Figure 11. The ZmFBN phylogenetic tree revealed only 2 gene pair (ZmFBN1, ZmFBN6 and ZmFBN2, ZmFBN4). The synonymous and non-synonymous (Ka/Ks) value of this gene pair was calculated to determine the extent and nature of selection pressure on the duplicated gene pair. The calculated Ka/Ks value=0.24608463. In this study, we found that ZmFBN genes showed Ka/Ks values greater than 1. The Ka and Ks value revealed that for 1 pair of segmental gene was present in maize as shown in Table 3. The synteny between species is useful for understanding how and why specific gene families and functions have evolved over time. The FBN genes in maize share homologous pairs with FBN genes in other plants, suggesting that FBN may have been crucial to the evolution of Z. mays. There were collinear gene pairs in Z. mays and O. sativa. 1.0 represented collinearity between chromosome 1 in Z. mays (ZmFBN5) and chromosome 3 in O.sativa (OsFBN1). The 2.0 represented collinearity between chromosome 2 in Z. mays (ZmFBN3) and chromosome 4 in O. sativa (OsFBN2). The 7.0 represented collinearity between chromosome 7 in Z. mays (ZmFBN1) and chromosome 7 in O. sativa (OsFBN3) (Figures 11 and 12 and Table 3).

chromosomes

Figure 11: ZmFBN gene positions on the chromosomes of maize. Members are divided in two groups present on different chromosomes. Equation

Synteny

Figure 12: Synteny analysis for duplicated genes pairs in Z. mays and O. sativa. Chromosomal lines represented by red color. The FBN genes syntenic regions indicated.

Gene-1 Gene-2 Ka Ks Ka/ks Time (MYA) Duplication Selection pressure
ZmFBN2 ZmFBN4 0.05120049 0.2080604 0.24608463 4 Segmental Positive

Table 3: Synonymous (Ks) and non-synonymous (Ka) calculation.

Gene expression analysis of ZmFBN genes

For gene expression analysis Gene Expression Omnibus (GEO) dataset was used. The high throughput sequencing for expression profiling. A group of hormones known as Brassinosteroids (BRs) are unique to plants and serve important functions in their physiology as well as the control regarding the way they react to stress. The effects of 24-epibrassinolide (EBL), a synthetic BR, on hydroponically grown maize seedlings were studied in this research using a combined physiological and molecular approach. To further examine the effects of this chemical on shoot growth, a 48-hour exposure period and a treatment concentration of 1 nM EBL were used. RNA sequencing was used to study the root gravitropic response. Exogenous management of EBL during the seedling stage resulted in a considerable decrease in root and shoot length, according to our results. The highest concentrations (10 and 100 nM) severely inhibited primary root growth (10% and 25%, respectively) after 24 hours. After 24 hours, the lowest concentration (1 nM EBL) showed no effect on PR formation. However, after 48 hours of treatment, the impacts on root growth were visible even at the lowest dose (1 nM) [16]. The expression of all FBNs in both leaf control, leaf EBL, shoot control and shoot EBL samples on vegetative and reproductive stages. Gene expression analysis revealed that one out of six members ZmFBN3 gene was highly regulated under both leaf control, leaf EBL, shoot control and shoot EBL conditions and supported the maize plant under stress condition (Figure 13).

stress

Figure 13: Expression of ZmFBN genes in leaf and shoots against EBL stress conditions. The high expression level is indicated by the red color and blue color and the low expression level indicated by the black color.

Discussion

As a unique transcription factor family in plants, the FBN gene family is essential for controlling plant growth, physiological functions, and stress response. Fibrillins (FBNs) are named after fibrils since these proteins were discovered in fibrils in dog rose chromoplasts (Rosa rugosa) and bell pepper (Capsicum annuum) fruit [17]. The creation of lipoprotein structures, photosynthesis and reactions to both abiotic and biotic stress are only a few of the crucial biological roles that FBN proteins play.

Conserved domains, motif patterns, gene chromosomal distributions, phylogenetic relationships, gene structures, synteny analyses and protein-protein structure analysis have been studied with bioinformatics in the present study. ZmFBN proteins were found in the periplasmic and outer membranes of cells based on predicted subcellular locations, indicating that these genes function as transcription factors.

ZmFBN genes in the same subgroup typically have intron and exon positions that are comparable, indicating that their functions are similar. One gene has no introns (ZmFBN6). The two genes (ZmFBN4 and ZmFBN2) contained the two intron and three exons, one gene (ZmFBN2) have three introns and four exons and one gene (ZmFBN1) have nine introns and ten exons.

The majority of ZmFBN proteins belonging to the same subfamily typically include similar motifs, according to the conserved motif analysis. The domains that we found can be divided into two groups. For ZmFBN members, the PAP family and PAP superfamilies are two of the most important domain groups. These ZmFBN genes are important in the response to environmental stress, according to the promoter analysis. The FBN family has been analyzed for synteny between maize and rice, and it was shown that the collinearity blocks between FBN members of the PAP subfamily were the highest. These findings point to segmental duplication events in the evolution of the FBN family, which allowed for its further expansion. Furthermore, fibril-like structures can be produced in vitro using a mixture of bicyclic carotenoids, lipids and FBN protein [18]. The most common proteins in chloroplast PGs are fibrillin (FBN) proteins, which are plastid lipid-associated and highly conserved proteins that are encoded by nucleus genes. Plant growth and development, plastid stability, and stress response all showed significant regulatory roles for FBNs.

The results may provide some helpful guidelines for developing new maize varieties with high susceptibility to develop in stress condition and also provided the theoretical basis of FBN-domain for agricultural applications. Although, this study might make easier to understand how FBN genes work in maize and other crops.

Conclusion

In this study, 6 FBN gene family members were investigated. We analyzed ZmFBNs using bioinformatics techniques such as gene identification, sequence logo, gene expression analysis phylogenetic analysis, chromosomal placement, interaction protein network analysis, cis-regulatory elements and gene structure. Determining the mechanism of stress management is also challenging. In-silico analysis provides useful information for future functional investigations in stress biology considering the fact that stress control is a complex system. The expression profile data also indicate that ZmFBN were responsive to numerous biotic and abiotic stressors. As a result, this study provides functional information on FBN genes to help improve plant growth, stress tolerance, and to help us comprehend the many developmental processes in maize.

References

Author Info

Salma Bashir, Babar Farid*, Muhammad Zeshan Zafar, Shoaib Ur Rehman, Muhammad Rauf Shah and Zunaira Atta
 
Department of Plant Breeding and Biotechnology, Muhammad Nawaz Sharif University of Agriculture, Multan, Punjab, Pakistan
 

Citation: Bashir S, Farid B, Zafar MZ, Rehman SU, Shah MR, Atta Z (2024) A Comprehensive Study of ZmFBN Gene Family and their Biotic and Abiotic Stress Response in Maize. J Proteomics Bioinform.17:668

Received: 14-Aug-2023, Manuscript No. JPB-23-26079; Editor assigned: 16-Aug-2023, Pre QC No. JPB-23-26079 (PQ); Reviewed: 30-Aug-2023, QC No. JPB-23-26079; Revised: 06-Sep-2023, Manuscript No. JPB-23-26079 (R); Published: 24-Jun-2024 , DOI: 10.35248/0974-276X.24.17.668

Copyright: © 2024 Bashir S, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Top