首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
In the post-genomic era, various computational methods that predict proteinprotein interactions at the genome level are available; however, each method has its own advantages and disadvantages, resulting in false predictions. Here we developed a unique integrated approach to identify interacting partner(s) of Semaphorin 5A (SEMA5A), beginning with seven proteins sharing similar ligand interacting residues as putative binding partners. The methods include Dwyer and Root- Bernstein/Dillon theories of protein evolution, hydropathic complementarity of protein structure, pattern of protein functions among molecules, information on domain-domain interactions, co-expression of genes and protein evolution. Among the set of seven proteins selected as putative SEMA5A interacting partners, we found the functions of Plexin B3 and Neuropilin-2 to be associated with SEMA5A. We modeled the semaphorin domain structure of Plexin B3 and found that it shares similarity with SEMA5A. Moreover, a virtual expression database search and RT-PCR analysis showed co-expression of SEMA5A and Plexin B3 and these proteins were found to have co-evolved. In addition, we confirmed the interaction of SEMA5A with Plexin B3 in co-immunoprecipitation studies. Overall, these studies demonstrate that an integrated method of prediction can be used at the genome level for discovering many unknown protein binding partners with known ligand binding domains.  相似文献   

3.
The unannotated regions of the Escherichia coli genome DNA sequence from the EcoSeq6 database, totaling 1,278 'intergenic' sequences of the combined length of 359,279 basepairs, were analyzed using computer-assisted methods with the aim of identifying putative unknown genes. The proposed strategy for finding new genes includes two key elements: i) prediction of expressed open reading frames (ORFs) using the GeneMark method based on Markov chain models for coding and non-coding regions of Escherichia coli DNA, and ii) search for protein sequence similarities using programs based on the BLAST algorithm and programs for motif identification. A total of 354 putative expressed ORFs were predicted by GeneMark. Using the BLASTX and TBLASTN programs, it was shown that 208 ORFs located in the unannotated regions of the E. coli chromosome are significantly similar to other protein sequences. Identification of 182 ORFs as probable genes was supported by GeneMark and BLAST, comprising 51.4% of the GeneMark 'hits' and 87.5% of the BLAST 'hits'. 73 putative new genes, comprising 20.6% of the GeneMark predictions, belong to ancient conserved protein families that include both eubacterial and eukaryotic members. This value is close to the overall proportion of highly conserved sequences among eubacterial proteins, indicating that the majority of the putative expressed ORFs that are predicted by GeneMark, but have no significant BLAST hits, nevertheless are likely to be real genes. The majority of the putative genes identified by BLAST search have been described since the release of the EcoSeq6 database, but about 70 genes have not been detected so far. Among these new identifications are genes encoding proteins with a variety of predicted functions including dehydrogenases, kinases, several other metabolic enzymes, ATPases, rRNA methyltransferases, membrane proteins, and different types of regulatory proteins.  相似文献   

4.
Barley Mlo defines the founder of a novel class of plant integral membrane proteins. Lack of the wild type protein leads to broad spectrum disease resistance against the pathogenic powdery mildew fungus and deregulated leaf cell death. Scanning N-glycosylation mutagenesis and Mlo-Lep fusion proteins demonstrated that Mlo is membrane-anchored by 7 transmembrane (TM) helices such that the N terminus is located extracellularly and the C terminus intracellularly. Fractionation of leaf cells and immunoblotting localized the protein to the plant plasma membrane. A genome-wide search for Mlo sequence-related genes in Arabidopsis thaliana revealed approximately 35 family members, the only abundant gene family encoding 7 TM proteins in higher plants. The sequence variability of Mlo family members within a single species, their topology and subcellular localization are reminiscent of the most abundant class of metazoan 7 TM receptors, the G-protein-coupled receptors.  相似文献   

5.
A database search for similarities between sequenced parts of the Arabidopsis thaliana genome with known sulfurtransferase sequences from Escherichia coli and mammals was undertaken to obtain information about plant sulfurtransferase-like proteins. One gene and several homologous EST clones were identified. One of the EST clones was used for screening an Arabidopsis cDNA library. The isolated full-length clone consists of 1134 bp and encodes a 42.6 kDa protein that includes a putative transit peptide sequence of about 7.1 kDa. Sequence comparisons with known sulfurtransferases from different organisms confirmed high homology between them and the existence of several highly conserved regions. Results of a Southern blot performed with genomic Arabidopsis DNA showed the occurrence of at least two sulfurtransferase-like isozymes in Arabidopsis. Recombinant proteins with and without the putative transit peptide were expressed in E. coli with an N-terminal His6-tag, purified by affinity chromatography and tested for enzyme activity using different sulfur donors and acceptors. Both recombinant proteins catalyzed the formation of SCN- from thiosulfate and cyanide as a rhodanese per definition; however, both recombinant proteins preferred 3-mercaptopyruvate to thiosulfate. A monospecific antibody produced by using the mature recombinant protein as an antigen recognized a single protein band in total extracts of Arabidopsis plants equating to the full-length protein size. A single band equating to the size of the mature protein was detected from purified Arabidopsis mitochondria, but there was no antigenic reaction with any protein from chloroplasts. The function of the protein is still speculative. Now tools are available to elucidate the roles and substrates of this sulfurtransferase in higher plants.  相似文献   

6.
Plastids are actively involved in numerous plant processes critical to growth, development and adaptation. They play a primary role in photosynthesis, pigment and monoterpene synthesis, gravity sensing, starch and fatty acid synthesis, as well as oil, and protein storage. We applied two complementary methods to analyze the recently published apple genome (Malus × domestica) to identify putative plastid-targeted proteins, the first using TargetP and the second using a custom workflow utilizing a set of predictive programs. Apple shares roughly 40% of its 10,492 putative plastid-targeted proteins with that of the Arabidopsis (Arabidopsis thaliana) plastid-targeted proteome as identified by the Chloroplast 2010 project and ∼57% of its entire proteome with Arabidopsis. This suggests that the plastid-targeted proteomes between apple and Arabidopsis are different, and interestingly alludes to the presence of differential targeting of homologs between the two species. Co-expression analysis of 2,224 genes encoding putative plastid-targeted apple proteins suggests that they play a role in plant developmental and intermediary metabolism. Further, an inter-specific comparison of Arabidopsis, Prunus persica (Peach), Malus × domestica (Apple), Populus trichocarpa (Black cottonwood), Fragaria vesca (Woodland Strawberry), Solanum lycopersicum (Tomato) and Vitis vinifera (Grapevine) also identified a large number of novel species-specific plastid-targeted proteins. This analysis also revealed the presence of alternatively targeted homologs across species. Two separate analyses revealed that a small subset of proteins, one representing 289 protein clusters and the other 737 unique protein sequences, are conserved between seven plastid-targeted angiosperm proteomes. Majority of the novel proteins were annotated to play roles in stress response, transport, catabolic processes, and cellular component organization. Our results suggest that the current state of knowledge regarding plastid biology, preferentially based on model systems is deficient. New plant genomes are expected to enable the identification of potentially new plastid-targeted proteins that will aid in studying novel roles of plastids.  相似文献   

7.
8.
The functions of approximately one-third of the proteins encoded by the Arabidopsis thaliana genome are completely unknown. Moreover, many annotations of the remainder of the genome supply tentative functions, at best. Knowing the ultimate localization of these proteins, as well as the pathways used for getting there, may provide clues as to their functions. The putative localization of most proteins currently relies on in silico-based bioinformatics approaches, which, unfortunately, often result in erroneous predictions. Emerging proteomics techniques coupled with other systems biology approaches now provide researchers with a plethora of methods for elucidating the final location of these proteins on a large scale, as well as the ability to dissect protein-sorting pathways in plants.  相似文献   

9.
Phosphoinositide-specific phospholipase Cs (PI-PLCs) are important enzymes in eukaryotes, which catalyze the hydrolysis of phosphatidylinositol 4,5-bisphosphate into the two second messengers inositol 1,4,5-trisphosphate and diacylglycerol. The Arabidopsis genome contains nine putative PI-PLC genes. AtPLC4, an abiotic stress induced gene, has been reported to encode an active PI-PLC isoform. However, the exact roles of putative AtPLC4 in plant remain to be elicited. The first 108 amino acid residues of the N-terminal of AtPLC4, referred to as AtPLC4 N, was expressed as a recombinant protein in Escherichia coli and used as antigen in generating antibody. Purified recombinant proteins including AtPLC1 to AtPLC5, AtPLC8, AtPLC9 and AtPLC4 N were transferred onto the same blot to test specificity of the prepared antibody. Western blot result shows that only AtPLC4 and AtPLC4 N can be recognized by the antibody. The antibody recognized a protein of approximately 68kDa in the plasma membrane fraction and cytosolic fractions prepared from Arabidopsis thaliana plants. This corresponds very well with the calculated molecular weight of AtPLC4. The results suggest that AtPLC4 may encode a plasma membrane-associated protein.  相似文献   

10.
11.
12.
13.
The functions of approximately one-third of the proteins encoded by the Arabidopsis thaliana genome are completely unknown. Moreover, many annotations of the remainder of the genome supply tentative functions, at best. Knowing the ultimate localization of these proteins, as well as the pathways used for getting there, may provide clues as to their functions. The putative localization of most proteins currently relies on in silico-based bioinformatics approaches, which, unfortunately, often result in erroneous predictions. Emerging proteomics techniques coupled with other systems biology approaches now provide researchers with a plethora of methods for elucidating the final location of these proteins on a large scale, as well as the ability to dissect protein-sorting pathways in plants.  相似文献   

14.
Tail-anchored (TA) proteins are a class of polypeptides integrated into the membrane by a C-terminally located hydrophobic sequence which are present in all three domains of life. Proteins of this class lack an N-terminal signal peptide and reach their destination within the cell by posttranslational mechanisms. TA proteins perform a variety of essential functions on the cytosolic face of cellular membranes and, in several cases, determine the organelle identity. Some TA proteins insert directly into the lipid bilayer without the help of molecular machinery, suggesting that they may be ancestral proteins able to recruit lipids, contributing to the formation of intracellular compartments during cell evolution. Relevant progress has been made in recent years on the identification of TA protein sorting and the posttranslational translocation machineries. Interestingly, membrane lipid components were also found to be involved in the insertion mechanism. A bioinformatic approach is used to produce a catalogue of putative TA proteins encoded by the Arabidopsis thaliana genome, and intracellular localization is predicted based on features of well-characterized TA proteins. A recent strategy aimed at improving the accumulation of recombinant proteins expressed in transgenic plants is also discussed. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

15.
MOTIVATION: The genome of Arabidopsis thaliana, which has the best understood plant genome, still has approximately one-third of its genes with no functional annotation at all from either MIPS or TAIR. We have applied our Data Mining Prediction (DMP) method to the problem of predicting the functional classes of these protein sequences. This method is based on using a hybrid machine-learning/data-mining method to identify patterns in the bioinformatic data about sequences that are predictive of function. We use data about sequence, predicted secondary structure, predicted structural domain, InterPro patterns, sequence similarity profile and expressions data. RESULTS: We predicted the functional class of a high percentage of the Arabidopsis genes with currently unknown function. These predictions are interpretable and have good test accuracies. We describe in detail seven of the rules produced.  相似文献   

16.
The Arabidopsis thaliana genome encodes about 386 proteins with coiled-coil domains of at least 50 amino acids in length. In mammalian systems, many coiled-coil proteins are part of various cytoskeletal networks including intermediate filament protein, actin-binding proteins and MAP (microtubule-associated proteins). Immunological evidence suggests that some of these cytoskeletal proteins, such as lamins, keratins and tropomyosins, may be conserved in Arabidopsis. However, coiled-coil proteins are of low complexity, and thus, traditional sequence comparison algorithms, such as BLAST may not detect homologies. Here, we use the PROPSEARCH algorithm to detect putative coiled-coil cytoskeletal protein homologues in Arabidopsis. This approach reveals putative intermediate filament protein homologues of filensin, lamin and keratin; putative actin-binding homologues of ERM (ezrin/radixin/moesin), periplakin, utrophin, tropomyosin and paramyosin, and putative MAP homologues of restin/CLIP-170 (cytoplasmic linker protein-170). We suggest that the AtFPP (Arabiopsis thaliana filament-like plant protein) and AtMAP70 (Arabidopsis microtubule-associated protein 70) families of coiled-coil proteins may, in fact, be related to lamins and function as intermediate filament proteins.  相似文献   

17.
Young meristematic plant cells contain a large number of small vacuoles, while the largest part of the vacuome in mature cells is composed by a large central vacuole, occupying 80% to 90% of the cell volume. Thus far, only a limited number of vacuolar membrane proteins have been identified and characterized. The proteomic approach is a powerful tool to identify new vacuolar membrane proteins. To analyze vacuoles from growing tissues we isolated vacuoles from cauliflower (Brassica oleracea) buds, which are constituted by a large amount of small cells but also contain cells in expansion as well as fully expanded cells. Here we show that using purified cauliflower vacuoles and different extraction procedures such as saline, NaOH, acetone, and chloroform/methanol and analyzing the data against the Arabidopsis (Arabidopsis thaliana) database 102 cauliflower integral proteins and 214 peripheral proteins could be identified. The vacuolar pyrophosphatase was the most prominent protein. From the 102 identified proteins 45 proteins were already described. Nine of these, corresponding to 46% of peptides detected, are known vacuolar proteins. We identified 57 proteins (55.9%) containing at least one membrane spanning domain with unknown subcellular localization. A comparison of the newly identified proteins with expression profiles from in silico data revealed that most of them are highly expressed in young, developing tissues. To verify whether the newly identified proteins were indeed localized in the vacuole we constructed and expressed green fluorescence protein fusion proteins for five putative vacuolar membrane proteins exhibiting three to 11 transmembrane domains. Four of them, a putative organic cation transporter, a nodulin N21 family protein, a membrane protein of unknown function, and a senescence related membrane protein were localized in the vacuolar membrane, while a white-brown ATP-binding cassette transporter homolog was shown to reside in the plasma membrane. These results demonstrate that proteomic analysis of highly purified vacuoles from specific tissues allows the identification of new vacuolar proteins and provides an additional view of tonoplastic proteins.  相似文献   

18.
19.
To better understand the mechanisms governing cellular traffic, storage of various metabolites, and their ultimate degradation, Arabidopsis thaliana vacuole proteomes were established. To this aim, a procedure was developed to prepare highly purified vacuoles from protoplasts isolated from Arabidopsis cell cultures using Ficoll density gradients. Based on the specific activity of the vacuolar marker alpha-mannosidase, the enrichment factor of the vacuoles was estimated at approximately 42-fold with an average yield of 2.1%. Absence of significant contamination by other cellular compartments was validated by Western blot using antibodies raised against specific markers of chloroplasts, mitochondria, plasma membrane, and endoplasmic reticulum. Based on these results, vacuole preparations showed the necessary degree of purity for proteomics study. Therefore, a proteomics approach was developed to identify the protein components present in both the membrane and soluble fractions of the Arabidopsis cell vacuoles. This approach includes the following: (i) a mild oxidation step leading to the transformation of cysteine residues into cysteic acid and methionine to methionine sulfoxide, (ii) an in-solution proteolytic digestion of very hydrophobic proteins, and (iii) a prefractionation of proteins by short migration by SDS-PAGE followed by analysis by liquid chromatography coupled to tandem mass spectrometry. This procedure allowed the identification of more than 650 proteins, two-thirds of which copurify with the membrane hydrophobic fraction and one-third of which copurifies with the soluble fraction. Among the 416 proteins identified from the membrane fraction, 195 were considered integral membrane proteins based on the presence of one or more predicted transmembrane domains, and 110 transporters and related proteins were identified (91 putative transporters and 19 proteins related to the V-ATPase pump). With regard to function, about 20% of the proteins identified were known previously to be associated with vacuolar activities. The proteins identified are involved in ion and metabolite transport (26%), stress response (9%), signal transduction (7%), and metabolism (6%) or have been described to be involved in typical vacuolar activities, such as protein and sugar hydrolysis. The subcellular localization of several putative vacuolar proteins was confirmed by transient expression of green fluorescent protein fusion constructs.  相似文献   

20.
The prediction of transmembrane (TM) helix and topology provides important information about the structure and function of a membrane protein. Due to the experimental difficulties in obtaining a high-resolution model, computational methods are highly desirable. In this paper, we present a hierarchical classification method using support vector machines (SVMs) that integrates selected features by capturing the sequence-to-structure relationship and developing a new scoring function based on membrane protein folding. The proposed approach is evaluated on low- and high-resolution data sets with cross-validation, and the topology (sidedness) prediction accuracy reaches as high as 90%. Our method is also found to correctly predict both the location of TM helices and the topology for 69% of the low-resolution benchmark set. We also test our method for discrimination between soluble and membrane proteins and achieve very low overall false positive (0.5%) and false negative rates (0 to approximately 1.2%). Lastly, the analysis of the scoring function suggests that the topogeneses of single-spanning and multispanning TM proteins have different levels of complexity, and the consideration of interloop topogenic interactions for the latter is the key to achieving better predictions. This method can facilitate the annotation of membrane proteomes to extract useful structural and functional information. It is publicly available at http://bio-cluster.iis.sinica.edu.tw/~bioapp/SVMtop.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号