共查询到20条相似文献,搜索用时 15 毫秒
1.
Background
Proteins are composed of domains, protein segments that fold independently from the rest of the protein and have a specific function. During evolution the arrangement of domains can change: domains are gained, lost or their order is rearranged. To facilitate the analysis of these changes we propose the use of multiple domain alignments.Results
We developed an alignment program, called MDAT, which aligns multiple domain arrangements. MDAT extends earlier programs which perform pairwise alignments of domain arrangements. MDAT uses a domain similarity matrix to score domain pairs and aligns the domain arrangements using a consistency supported progressive alignment method.Conclusion
MDAT will be useful for analysing changes in domain arrangements within and between protein families and will thus provide valuable insights into the evolution of proteins and their domains. MDAT is coded in C++, and the source code is freely available for download at http://www.bornberglab.org/pages/mdat.Electronic supplementary material
The online version of this article (doi:10.1186/s12859-014-0442-7) contains supplementary material, which is available to authorized users. 相似文献2.
Background
Plant resistance genes (R genes) exist in large families and usually contain both a nucleotide-binding site domain and a leucine-rich repeat domain, denoted NBS-LRR. The genome sequence of cassava (Manihot esculenta) is a valuable resource for analysing the genomic organization of resistance genes in this crop.Results
With searches for Pfam domains and manual curation of the cassava gene annotations, we identified 228 NBS-LRR type genes and 99 partial NBS genes. These represent almost 1% of the total predicted genes and show high sequence similarity to proteins from other plant species. Furthermore, 34 contained an N-terminal toll/interleukin (TIR)-like domain, and 128 contained an N-terminal coiled-coil (CC) domain. 63% of the 327 R genes occurred in 39 clusters on the chromosomes. These clusters are mostly homogeneous, containing NBS-LRRs derived from a recent common ancestor.Conclusions
This study provides insight into the evolution of NBS-LRR genes in the cassava genome; the phylogenetic and mapping information may aid efforts to further characterize the function of these predicted R genes.Electronic supplementary material
The online version of this article (doi:10.1186/s12864-015-1554-9) contains supplementary material, which is available to authorized users. 相似文献3.
4.
Background
Protein structural domains are evolutionary units whose relationships can be detected over long evolutionary distances. The evolutionary history of protein domains, including the origin of protein domains, the identification of domain loss, transfer, duplication and combination with other domains to form new proteins, and the formation of the entire protein domain repertoire, are of great interest.Methodology/Principal Findings
A methodology is presented for providing a parsimonious domain history based on gain, loss, vertical and horizontal transfer derived from the complete genomic domain assignments of 1015 organisms across the tree of life. When mapped to species trees the evolutionary history of domains and domain combinations is revealed, and the general evolutionary trend of domain and combination is analyzed.Conclusions/Significance
We show that this approach provides a powerful tool to study how new proteins and functions emerged and to study such processes as horizontal gene transfer among more distant species. 相似文献5.
Background
Visual and β-arrestins are scaffolding proteins involved in the regulation of receptor-dependent intracellular signaling and their trafficking. The arrestin superfamilly includes several arrestin domain-containing proteins and the structurally related protein Vps26. In Dictyostelium discoideum, the arrestin-domain containing proteins form a family of six members, namely AdcA to -F. In contrast to canonical arrestins, Dictyostelium Adc proteins show a more complex architecture, as they possess, in addition to the arrestin core, other domains, such as C2, FYVE, LIM, MIT and SAM, which potentially mediate selective interactions with either lipids or proteins.Methodology and Principal Findings
A detailed analysis of AdcA has been performed. AdcA extends on both sides of the arrestin core, in particular by a FYVE domain which mediates selective interactions with PI(3)P, as disclosed by intrinsic fluorescence measurements and lipid overlay assays. Localization studies showed an enrichment of tagged- and endogenous AdcA on the rim of early macropinosomes and phagosomes. This vesicular distribution relies on a functional FYVE domain. Our data also show that the arrestin core binds the ADP-ribosylation factor ArfA, the unique amoebal Arf member, in its GDP-bound conformation.Significance
This work describes one of the 6 arrestin domain-containing proteins of Dictyostelium, a novel and atypical member of the arrestin clan. It provides the basis for a better understanding of arrestin-related protein involvement in trafficking processes and for further studies on the expanding roles of arrestins in eukaryotes. 相似文献6.
Background and Aims
ADP-glucose pyrophosphorylase (AGPase) is a key enzyme of starch biosynthesis. In the green plant lineage, it is composed of two large (LSU) and two small (SSU) sub-units encoded by paralogous genes, as a consequence of several rounds of duplication. First, our aim was to detect specific patterns of molecular evolution following duplication events and the divergence between monocotyledons and dicotyledons. Secondly, we investigated coevolution between amino acids both within and between sub-units.Methods
A phylogeny of each AGPase sub-unit was built using all gymnosperm and angiosperm sequences available in databases. Accelerated evolution along specific branches was tested using the ratio of the non-synonymous to the synonymous substitution rate. Coevolution between amino acids was investigated taking into account compensatory changes between co-substitutions.Key Results
We showed that SSU paralogues evolved under high functional constraints during angiosperm radiation, with a significant level of coevolution between amino acids that participate in SSU major functions. In contrast, in the LSU paralogues, we identified residues under positive selection (1) following the first LSU duplication that gave rise to two paralogues mainly expressed in angiosperm source and sink tissues, respectively; and (2) following the emergence of grass-specific paralogues expressed in the endosperm. Finally, we found coevolution between residues that belong to the interaction domains of both sub-units.Conclusions
Our results support the view that coevolution among amino acid residues, especially those lying in the interaction domain of each sub-unit, played an important role in AGPase evolution. First, within SSU, coevolution allowed compensating mutations in a highly constrained context. Secondly, the LSU paralogues probably acquired tissue-specific expression and regulatory properties via the coevolution between sub-unit interacting domains. Finally, the pattern we observed during LSU evolution is consistent with repeated sub-functionalization under ‘Escape from Adaptive Conflict’, a model rarely illustrated in the literature. 相似文献7.
Thomas E. Edwards Isabelle Phan Jan Abendroth Shellie H. Dieterich Amir Masoudi Wenjin Guo Stephen N. Hewitt Angela Kelley David Leibly Mitch J. Brittnacher Bart L. Staker Samuel I. Miller Wesley C. Van Voorhis Peter J. Myler Lance J. Stewart 《PloS one》2010,5(9)
Background
Pathogenic bacteria adhere to the host cell surface using a family of outer membrane proteins called Trimeric Autotransporter Adhesins (TAAs). Although TAAs are highly divergent in sequence and domain structure, they are all conceptually comprised of a C-terminal membrane anchoring domain and an N-terminal passenger domain. Passenger domains consist of a secretion sequence, a head region that facilitates binding to the host cell surface, and a stalk region.Methodology/Principal Findings
Pathogenic species of Burkholderia contain an overabundance of TAAs, some of which have been shown to elicit an immune response in the host. To understand the structural basis for host cell adhesion, we solved a 1.35 Å resolution crystal structure of a BpaA TAA head domain from Burkholderia pseudomallei, the pathogen that causes melioidosis. The structure reveals a novel fold of an intricately intertwined trimer. The BpaA head is composed of structural elements that have been observed in other TAA head structures as well as several elements of previously unknown structure predicted from low sequence homology between TAAs. These elements are typically up to 40 amino acids long and are not domains, but rather modular structural elements that may be duplicated or omitted through evolution, creating molecular diversity among TAAs.Conclusions/Significance
The modular nature of BpaA, as demonstrated by its head domain crystal structure, and of TAAs in general provides insights into evolution of pathogen-host adhesion and may provide an avenue for diagnostics. 相似文献8.
Background
Bro1 domains are elongated, banana-shaped domains that were first identified in the yeast ESCRT pathway protein, Bro1p. Humans express three Bro1 domain-containing proteins: ALIX, BROX, and HD-PTP, which function in association with the ESCRT pathway to help mediate intraluminal vesicle formation at multivesicular bodies, the abscission stage of cytokinesis, and/or enveloped virus budding. Human Bro1 domains share the ability to bind the CHMP4 subset of ESCRT-III proteins, associate with the HIV-1 NCGag protein, and stimulate the budding of viral Gag proteins. The curved Bro1 domain structure has also been proposed to mediate membrane bending. To date, crystal structures have only been available for the related Bro1 domains from the Bro1p and ALIX proteins, and structures of additional family members should therefore aid in the identification of key structural and functional elements.Methodology/Principal Findings
We report the crystal structure of the human BROX protein, which comprises a single Bro1 domain. The Bro1 domains from BROX, Bro1p and ALIX adopt similar overall structures and share two common exposed hydrophobic surfaces. Surface 1 is located on the concave face and forms the CHMP4 binding site, whereas Surface 2 is located at the narrow end of the domain. The structures differ in that only ALIX has an extended loop that projects away from the convex face to expose the hydrophobic Phe105 side chain at its tip. Functional studies demonstrated that mutations in Surface 1, Surface 2, or Phe105 all impair the ability of ALIX to stimulate HIV-1 budding.Conclusions/Significance
Our studies reveal similarities in the overall folds and hydrophobic protein interaction sites of different Bro1 domains, and show that a unique extended loop contributes to the ability of ALIX to function in HIV-1 budding. 相似文献9.
Background
The PWWP domain was first identified as a structural motif of 100–130 amino acids in the WHSC1 protein and predicted to be a protein-protein interaction domain. It belongs to the Tudor domain ‘Royal Family’, which consists of Tudor, chromodomain, MBT and PWWP domains. While Tudor, chromodomain and MBT domains have long been known to bind methylated histones, PWWP was shown to exhibit histone binding ability only until recently.Methodology/Principal Findings
The PWWP domain has been shown to be a DNA binding domain, but sequence analysis and previous structural studies show that the PWWP domain exhibits significant similarity to other ‘Royal Family’ members, implying that the PWWP domain has the potential to bind histones. In order to further explore the function of the PWWP domain, we used the protein family approach to determine the crystal structures of the PWWP domains from seven different human proteins. Our fluorescence polarization binding studies show that PWWP domains have weak histone binding ability, which is also confirmed by our NMR titration experiments. Furthermore, we determined the crystal structures of the BRPF1 PWWP domain in complex with H3K36me3, and HDGF2 PWWP domain in complex with H3K79me3 and H4K20me3.Conclusions
PWWP proteins constitute a new family of methyl lysine histone binders. The PWWP domain consists of three motifs: a canonical β-barrel core, an insertion motif between the second and third β-strands and a C-terminal α-helix bundle. Both the canonical β-barrel core and the insertion motif are directly involved in histone binding. The PWWP domain has been previously shown to be a DNA binding domain. Therefore, the PWWP domain exhibits dual functions: binding both DNA and methyllysine histones.Enhanced version
This article can also be viewed as an enhanced version in which the text of the article is integrated with interactive 3D representations and animated transitions. Please note that a web plugin is required to access this enhanced functionality. Instructions for the installation and use of the web plugin are available in Text S1. 相似文献10.
Garima Agarwal Malligarjunan Rajavel Balasubramanian Gopal Narayanaswamy Srinivasan 《PloS one》2009,4(5)
Background
The members of cupin superfamily exhibit large variations in their sequences, functions, organization of domains, quaternary associations and the nature of bound metal ion, despite having a conserved β-barrel structural scaffold. Here, an attempt has been made to understand structure-function relationships among the members of this diverse superfamily and identify the principles governing functional diversity. The cupin superfamily also contains proteins for which the structures are available through world-wide structural genomics initiatives but characterized as “hypothetical”. We have explored the feasibility of obtaining clues to functions of such proteins by means of comparative analysis with cupins of known structure and function.Methodology/Principal Findings
A 3-D structure-based phylogenetic approach was undertaken. Interestingly, a dendrogram generated solely on the basis of structural dissimilarity measure at the level of domain folds was found to cluster functionally similar members. This clustering also reflects an independent evolution of the two domains in bicupins. Close examination of structural superposition of members across various functional clusters reveals structural variations in regions that not only form the active site pocket but are also involved in interaction with another domain in the same polypeptide or in the oligomer.Conclusions/Significance
Structure-based phylogeny of cupins can influence identification of functions of proteins of yet unknown function with cupin fold. This approach can be extended to other proteins with a common fold that show high evolutionary divergence. This approach is expected to have an influence on the function annotation in structural genomics initiatives. 相似文献11.
Phylogenetic analyses provide the first insights into the evolution of OVATE family proteins in land plants 总被引:1,自引:0,他引:1
Di Liu Wei Sun Yaowu Yuan Ning Zhang Alice Hayward Yongliang Liu Ying Wang 《Annals of botany》2014,113(7):1219-1233
Background and Aims
The OVATE gene encodes a nuclear-localized regulatory protein belonging to a distinct family of plant-specific proteins known as the OVATE family proteins (OFPs). OVATE was first identified as a key regulator of fruit shape in tomato, with nonsense mutants displaying pear-shaped fruits. However, the role of OFPs in plant development has been poorly characterized.Methods
Public databases were searched and a total of 265 putative OVATE protein sequences were identified from 13 sequenced plant genomes that represent the major evolutionary lineages of land plants. A phylogenetic analysis was conducted based on the alignment of the conserved OVATE domain from these 13 selected plant genomes. The expression patterns of tomato SlOFP genes were analysed via quantitative real-time PCR. The pattern of OVATE gene duplication resulting in the expansion of the gene family was determined in arabidopsis, rice and tomato.Key Results
Genes for OFPs were found to be present in all the sampled land plant genomes, including the early-diverged lineages, mosses and lycophytes. Phylogenetic analysis based on the amino acid sequences of the conserved OVATE domain defined 11 sub-groups of OFPs in angiosperms. Different evolutionary mechanisms are proposed for OVATE family evolution, namely conserved evolution and divergent expansion. Characterization of the AtOFP family in arabidopsis, the OsOFP family in rice and the SlOFP family in tomato provided further details regarding the evolutionary framework and revealed a major contribution of tandem and segmental duplications towards expansion of the OVATE gene family.Conclusions
This first genome-wide survey on OFPs provides new insights into the evolution of the OVATE protein family and establishes a solid base for future functional genomics studies on this important but poorly characterized regulatory protein family in plants. 相似文献12.
Olga Novokhatska Mykola Dergai Liudmyla Tsyba Inessa Skrypkina Valeriy Filonenko Jacques Moreau Alla Rynditch 《PloS one》2013,8(7)
Background
Scaffolding proteins of the intersectin (ITSN) family, ITSN1 and ITSN2, are crucial for the initiation stage of clathrin-mediated endocytosis. These proteins are closely related but have implications in distinct pathologies. To determine how these proteins could be separated in certain cell pathways we performed a comparative study of ITSNs.Methodology/Principal Findings
We have shown that endogenous ITSN1 and ITSN2 colocalize and form a complex in cells. A structural comparison of five SH3 domains, which mediated most ITSNs protein-protein interactions, demonstrated a similarity of their ligand-binding sites. We showed that the SH3 domains of ITSN2 bound well-established interactors of ITSN1 as well as newly identified ITSNs protein partners. A search for a novel interacting interface revealed multiple tyrosines that could be phosphorylated in ITSN2. Phosphorylation of ITSN2 isoforms but not ITSN1 short isoform was observed in various cell lines. EGF stimulation of HeLa cells enhanced tyrosine phosphorylation of ITSN2 isoforms and enabled their recognition by the SH2 domains of the Fyn, Fgr and Abl1 kinases, the regulatory subunit of PI3K, the adaptor proteins Grb2 and Crk, and phospholipase C gamma. The SH2 domains mentioned were unable to bind ITSN1 short isoform.Conclusions/Significance
Our results indicate that during evolution of vertebrates ITSN2 acquired a novel protein-interaction interface that allows its specific recognition by the SH2 domains of signaling proteins. We propose that these data could be important to understand the functional diversity of paralogous ITSN proteins. 相似文献13.
Background
Polyketides are a diverse group of biotechnologically important secondary metabolites that are produced by multi domain enzymes called polyketide synthases (PKS).Methodology/Principal Findings
We have estimated frequencies of type I PKS (PKS I) – a PKS subgroup – in natural environments by using Hidden-Markov-Models of eight domains to screen predicted proteins from six metagenomic shotgun data sets. As the complex PKS I have similarities to other multi-domain enzymes (like those for the fatty acid biosynthesis) we increased the reliability and resolution of the dataset by maximum-likelihood trees. The combined information of these trees was then used to discriminate true PKS I domains from evolutionary related but functionally different ones. We were able to identify numerous novel PKS I proteins, the highest density of which was found in Minnesota farm soil with 136 proteins out of 183,536 predicted genes. We also applied the protocol to UniRef database to improve the annotation of proteins with so far unknown function and identified some new instances of horizontal gene transfer.Conclusions/Significance
The screening approach proved powerful in identifying PKS I sequences in large sequence data sets and is applicable to many other protein families. 相似文献14.
Insulator function and topological domain border strength scale with architectural protein occupancy
Kevin Van Bortle Michael H Nichols Li Li Chin-Tong Ong Naomi Takenaka Zhaohui S Qin Victor G Corces 《Genome biology》2014,15(5):R82
Background
Chromosome conformation capture studies suggest that eukaryotic genomes are organized into structures called topologically associating domains. The borders of these domains are highly enriched for architectural proteins with characterized roles in insulator function. However, a majority of architectural protein binding sites localize within topological domains, suggesting sites associated with domain borders represent a functionally different subclass of these regulatory elements. How topologically associating domains are established and what differentiates border-associated from non-border architectural protein binding sites remain unanswered questions.Results
By mapping the genome-wide target sites for several Drosophila architectural proteins, including previously uncharacterized profiles for TFIIIC and SMC-containing condensin complexes, we uncover an extensive pattern of colocalization in which architectural proteins establish dense clusters at the borders of topological domains. Reporter-based enhancer-blocking insulator activity as well as endogenous domain border strength scale with the occupancy level of architectural protein binding sites, suggesting co-binding by architectural proteins underlies the functional potential of these loci. Analyses in mouse and human stem cells suggest that clustering of architectural proteins is a general feature of genome organization, and conserved architectural protein binding sites may underlie the tissue-invariant nature of topologically associating domains observed in mammals.Conclusions
We identify a spectrum of architectural protein occupancy that scales with the topological structure of chromosomes and the regulatory potential of these elements. Whereas high occupancy architectural protein binding sites associate with robust partitioning of topologically associating domains and robust insulator function, low occupancy sites appear reserved for gene-specific regulation within topological domains. 相似文献15.
Background
Genome variation is very high in influenza A viruses. However, viral evolution and spreading is strongly influenced by immunogenic features and capacity to bind host cells, depending in turn on the two major capsidic proteins. Therefore, such viruses are classified based on haemagglutinin and neuraminidase types, e.g. H5N1. Current analyses of viral evolution are based on serological and primary sequence comparison; however, comparative structural analysis of capsidic proteins can provide functional insights on surface regions possibly crucial to antigenicity and cell binding.Results
We performed extensive structural comparison of influenza virus haemagglutinins and of their domains and subregions to investigate type- and/or domain-specific variation. We found that structural closeness and primary sequence similarity are not always tightly related; moreover, type-specific features could be inferred when comparing surface properties of haemagglutinin subregions, monomers and trimers, in terms of electrostatics and hydropathy. Focusing on H5N1, we found that variation at the receptor binding domain surface intriguingly relates to branching of still circulating clades from those ones that are no longer circulating.Conclusions
Evidence from this work suggests that integrating phylogenetic and serological analyses by extensive structural comparison can help in understanding the ‘functional evolution’ of viral surface determinants. In particular, variation in electrostatic and hydropathy patches can provide molecular evolution markers: intriguing surface charge redistribution characterizing the haemagglutinin receptor binding domains from circulating H5N1 clades 2 and 7 might have contributed to antigenic escape hence to their evolutionary success and spreading.Electronic supplementary material
The online version of this article (doi:10.1186/s12859-014-0363-5) contains supplementary material, which is available to authorized users. 相似文献16.
17.
Harris JA Roy K Woo-Rasberry V Hamilton DJ Kansal R Qadri F Fleckenstein JM 《PLoS neglected tropical diseases》2011,5(12):e1428
Background
Enterotoxigenic Escherichia coli (ETEC) is a major diarrheal pathogen in developing countries, where it accounts for millions of infections and hundreds of thousands of deaths annually. While vaccine development to prevent diarrheal illness due to ETEC is feasible, extensive effort is needed to identify conserved antigenic targets. Pathogenic Escherichia coli, including ETEC, use the autotransporter (AT) secretion mechanism to export virulence factors. AT proteins are comprised of a highly conserved carboxy terminal outer membrane beta barrel and a surface-exposed amino terminal passenger domain. Recent immunoproteomic studies suggesting that multiple autotransporter passenger domains are recognized during ETEC infection prompted the present studies.Methodology
Available ETEC genomes were examined to identify AT coding sequences present in pathogenic isolates, but not in the commensal E. coli HS strain. Passenger domains of the corresponding autotransporters were cloned and expressed as recombinant antigens, and the immune response to these proteins was then examined using convalescent sera from patients and experimentally infected mice.Principal Findings
Potential AT genes shared by ETEC strains, but absent in the E. coli commensal HS strain were identified. Recombinant passenger domains derived from autotransporters, including Ag43 and an AT designated pAT, were recognized by antibodies from mice following intestinal challenge with , and both Ag43 and pAT were identified on the surface of ETEC by flow cytometry. Likewise, convalescent sera from patients with ETEC diarrhea recognized Ag43 and pAT, suggesting that these proteins are expressed during both experimental and naturally occurring ETEC infections and that they are immunogenic. Vaccination of mice with recombinant passenger domains from either pAT or Ag43 afforded protection against intestinal colonization with ETEC. H10407Conclusions
Passenger domains of conserved autotransporter proteins could contribute to protective immune responses that develop following infection with ETEC, and these antigens consequently represent potential targets to explore in vaccine development. 相似文献18.
Sushmita Roy Diego Martinez Harriett Platero Terran Lane Margaret Werner-Washburne 《PloS one》2009,4(11)
Background
Computational prediction of protein interactions typically use protein domains as classifier features because they capture conserved information of interaction surfaces. However, approaches relying on domains as features cannot be applied to proteins without any domain information. In this paper, we explore the contribution of pure amino acid composition (AAC) for protein interaction prediction. This simple feature, which is based on normalized counts of single or pairs of amino acids, is applicable to proteins from any sequenced organism and can be used to compensate for the lack of domain information.Results
AAC performed at par with protein interaction prediction based on domains on three yeast protein interaction datasets. Similar behavior was obtained using different classifiers, indicating that our results are a function of features and not of classifiers. In addition to yeast datasets, AAC performed comparably on worm and fly datasets. Prediction of interactions for the entire yeast proteome identified a large number of novel interactions, the majority of which co-localized or participated in the same processes. Our high confidence interaction network included both well-studied and uncharacterized proteins. Proteins with known function were involved in actin assembly and cell budding. Uncharacterized proteins interacted with proteins involved in reproduction and cell budding, thus providing putative biological roles for the uncharacterized proteins.Conclusion
AAC is a simple, yet powerful feature for predicting protein interactions, and can be used alone or in conjunction with protein domains to predict new and validate existing interactions. More importantly, AAC alone performs at par with existing, but more complex, features indicating the presence of sequence-level information that is predictive of interaction, but which is not necessarily restricted to domains. 相似文献19.
Background
Src homology 2 (SH2) domain is a conserved module involved in various biological processes. Tensin family member was reported to be involved in tumor suppression by interacting with DLC-1 (deleted-in-liver-cancer-1) via its SH2 domain. We explore here the important questions that what the structure of tensin2 SH2 domain is, and how it binds to DLC-1, which might reveal a novel binding mode.Principal Findings
Tensin2 SH2 domain adopts a conserved SH2 fold that mainly consists of five β-strands flanked by two α-helices. Most SH2 domains recognize phosphorylated ligands specifically. However, tensin2 SH2 domain was identified to interact with nonphosphorylated ligand (DLC-1) as well as phosphorylated ligand.Conclusions
We determined the solution structure of tensin2 SH2 domain using NMR spectroscopy, and revealed the interactions between tensin2 SH2 domain and its ligands in a phosphotyrosine-independent manner. 相似文献20.