首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background

Proteins are composed of domains, protein segments that fold independently from the rest of the protein and have a specific function. During evolution the arrangement of domains can change: domains are gained, lost or their order is rearranged. To facilitate the analysis of these changes we propose the use of multiple domain alignments.

Results

We developed an alignment program, called MDAT, which aligns multiple domain arrangements. MDAT extends earlier programs which perform pairwise alignments of domain arrangements. MDAT uses a domain similarity matrix to score domain pairs and aligns the domain arrangements using a consistency supported progressive alignment method.

Conclusion

MDAT will be useful for analysing changes in domain arrangements within and between protein families and will thus provide valuable insights into the evolution of proteins and their domains. MDAT is coded in C++, and the source code is freely available for download at http://www.bornberglab.org/pages/mdat.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0442-7) contains supplementary material, which is available to authorized users.  相似文献   

2.

Background

Plant resistance genes (R genes) exist in large families and usually contain both a nucleotide-binding site domain and a leucine-rich repeat domain, denoted NBS-LRR. The genome sequence of cassava (Manihot esculenta) is a valuable resource for analysing the genomic organization of resistance genes in this crop.

Results

With searches for Pfam domains and manual curation of the cassava gene annotations, we identified 228 NBS-LRR type genes and 99 partial NBS genes. These represent almost 1% of the total predicted genes and show high sequence similarity to proteins from other plant species. Furthermore, 34 contained an N-terminal toll/interleukin (TIR)-like domain, and 128 contained an N-terminal coiled-coil (CC) domain. 63% of the 327 R genes occurred in 39 clusters on the chromosomes. These clusters are mostly homogeneous, containing NBS-LRRs derived from a recent common ancestor.

Conclusions

This study provides insight into the evolution of NBS-LRR genes in the cassava genome; the phylogenetic and mapping information may aid efforts to further characterize the function of these predicted R genes.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1554-9) contains supplementary material, which is available to authorized users.  相似文献   

3.
4.

Background

Protein structural domains are evolutionary units whose relationships can be detected over long evolutionary distances. The evolutionary history of protein domains, including the origin of protein domains, the identification of domain loss, transfer, duplication and combination with other domains to form new proteins, and the formation of the entire protein domain repertoire, are of great interest.

Methodology/Principal Findings

A methodology is presented for providing a parsimonious domain history based on gain, loss, vertical and horizontal transfer derived from the complete genomic domain assignments of 1015 organisms across the tree of life. When mapped to species trees the evolutionary history of domains and domain combinations is revealed, and the general evolutionary trend of domain and combination is analyzed.

Conclusions/Significance

We show that this approach provides a powerful tool to study how new proteins and functions emerged and to study such processes as horizontal gene transfer among more distant species.  相似文献   

5.
Guetta D  Langou K  Grunwald D  Klein G  Aubry L 《PloS one》2010,5(12):e15249

Background

Visual and β-arrestins are scaffolding proteins involved in the regulation of receptor-dependent intracellular signaling and their trafficking. The arrestin superfamilly includes several arrestin domain-containing proteins and the structurally related protein Vps26. In Dictyostelium discoideum, the arrestin-domain containing proteins form a family of six members, namely AdcA to -F. In contrast to canonical arrestins, Dictyostelium Adc proteins show a more complex architecture, as they possess, in addition to the arrestin core, other domains, such as C2, FYVE, LIM, MIT and SAM, which potentially mediate selective interactions with either lipids or proteins.

Methodology and Principal Findings

A detailed analysis of AdcA has been performed. AdcA extends on both sides of the arrestin core, in particular by a FYVE domain which mediates selective interactions with PI(3)P, as disclosed by intrinsic fluorescence measurements and lipid overlay assays. Localization studies showed an enrichment of tagged- and endogenous AdcA on the rim of early macropinosomes and phagosomes. This vesicular distribution relies on a functional FYVE domain. Our data also show that the arrestin core binds the ADP-ribosylation factor ArfA, the unique amoebal Arf member, in its GDP-bound conformation.

Significance

This work describes one of the 6 arrestin domain-containing proteins of Dictyostelium, a novel and atypical member of the arrestin clan. It provides the basis for a better understanding of arrestin-related protein involvement in trafficking processes and for further studies on the expanding roles of arrestins in eukaryotes.  相似文献   

6.

Background and Aims

ADP-glucose pyrophosphorylase (AGPase) is a key enzyme of starch biosynthesis. In the green plant lineage, it is composed of two large (LSU) and two small (SSU) sub-units encoded by paralogous genes, as a consequence of several rounds of duplication. First, our aim was to detect specific patterns of molecular evolution following duplication events and the divergence between monocotyledons and dicotyledons. Secondly, we investigated coevolution between amino acids both within and between sub-units.

Methods

A phylogeny of each AGPase sub-unit was built using all gymnosperm and angiosperm sequences available in databases. Accelerated evolution along specific branches was tested using the ratio of the non-synonymous to the synonymous substitution rate. Coevolution between amino acids was investigated taking into account compensatory changes between co-substitutions.

Key Results

We showed that SSU paralogues evolved under high functional constraints during angiosperm radiation, with a significant level of coevolution between amino acids that participate in SSU major functions. In contrast, in the LSU paralogues, we identified residues under positive selection (1) following the first LSU duplication that gave rise to two paralogues mainly expressed in angiosperm source and sink tissues, respectively; and (2) following the emergence of grass-specific paralogues expressed in the endosperm. Finally, we found coevolution between residues that belong to the interaction domains of both sub-units.

Conclusions

Our results support the view that coevolution among amino acid residues, especially those lying in the interaction domain of each sub-unit, played an important role in AGPase evolution. First, within SSU, coevolution allowed compensating mutations in a highly constrained context. Secondly, the LSU paralogues probably acquired tissue-specific expression and regulatory properties via the coevolution between sub-unit interacting domains. Finally, the pattern we observed during LSU evolution is consistent with repeated sub-functionalization under ‘Escape from Adaptive Conflict’, a model rarely illustrated in the literature.  相似文献   

7.

Background

Pathogenic bacteria adhere to the host cell surface using a family of outer membrane proteins called Trimeric Autotransporter Adhesins (TAAs). Although TAAs are highly divergent in sequence and domain structure, they are all conceptually comprised of a C-terminal membrane anchoring domain and an N-terminal passenger domain. Passenger domains consist of a secretion sequence, a head region that facilitates binding to the host cell surface, and a stalk region.

Methodology/Principal Findings

Pathogenic species of Burkholderia contain an overabundance of TAAs, some of which have been shown to elicit an immune response in the host. To understand the structural basis for host cell adhesion, we solved a 1.35 Å resolution crystal structure of a BpaA TAA head domain from Burkholderia pseudomallei, the pathogen that causes melioidosis. The structure reveals a novel fold of an intricately intertwined trimer. The BpaA head is composed of structural elements that have been observed in other TAA head structures as well as several elements of previously unknown structure predicted from low sequence homology between TAAs. These elements are typically up to 40 amino acids long and are not domains, but rather modular structural elements that may be duplicated or omitted through evolution, creating molecular diversity among TAAs.

Conclusions/Significance

The modular nature of BpaA, as demonstrated by its head domain crystal structure, and of TAAs in general provides insights into evolution of pathogen-host adhesion and may provide an avenue for diagnostics.  相似文献   

8.

Background

Bro1 domains are elongated, banana-shaped domains that were first identified in the yeast ESCRT pathway protein, Bro1p. Humans express three Bro1 domain-containing proteins: ALIX, BROX, and HD-PTP, which function in association with the ESCRT pathway to help mediate intraluminal vesicle formation at multivesicular bodies, the abscission stage of cytokinesis, and/or enveloped virus budding. Human Bro1 domains share the ability to bind the CHMP4 subset of ESCRT-III proteins, associate with the HIV-1 NCGag protein, and stimulate the budding of viral Gag proteins. The curved Bro1 domain structure has also been proposed to mediate membrane bending. To date, crystal structures have only been available for the related Bro1 domains from the Bro1p and ALIX proteins, and structures of additional family members should therefore aid in the identification of key structural and functional elements.

Methodology/Principal Findings

We report the crystal structure of the human BROX protein, which comprises a single Bro1 domain. The Bro1 domains from BROX, Bro1p and ALIX adopt similar overall structures and share two common exposed hydrophobic surfaces. Surface 1 is located on the concave face and forms the CHMP4 binding site, whereas Surface 2 is located at the narrow end of the domain. The structures differ in that only ALIX has an extended loop that projects away from the convex face to expose the hydrophobic Phe105 side chain at its tip. Functional studies demonstrated that mutations in Surface 1, Surface 2, or Phe105 all impair the ability of ALIX to stimulate HIV-1 budding.

Conclusions/Significance

Our studies reveal similarities in the overall folds and hydrophobic protein interaction sites of different Bro1 domains, and show that a unique extended loop contributes to the ability of ALIX to function in HIV-1 budding.  相似文献   

9.
Wu H  Zeng H  Lam R  Tempel W  Amaya MF  Xu C  Dombrovski L  Qiu W  Wang Y  Min J 《PloS one》2011,6(6):e18919

Background

The PWWP domain was first identified as a structural motif of 100–130 amino acids in the WHSC1 protein and predicted to be a protein-protein interaction domain. It belongs to the Tudor domain ‘Royal Family’, which consists of Tudor, chromodomain, MBT and PWWP domains. While Tudor, chromodomain and MBT domains have long been known to bind methylated histones, PWWP was shown to exhibit histone binding ability only until recently.

Methodology/Principal Findings

The PWWP domain has been shown to be a DNA binding domain, but sequence analysis and previous structural studies show that the PWWP domain exhibits significant similarity to other ‘Royal Family’ members, implying that the PWWP domain has the potential to bind histones. In order to further explore the function of the PWWP domain, we used the protein family approach to determine the crystal structures of the PWWP domains from seven different human proteins. Our fluorescence polarization binding studies show that PWWP domains have weak histone binding ability, which is also confirmed by our NMR titration experiments. Furthermore, we determined the crystal structures of the BRPF1 PWWP domain in complex with H3K36me3, and HDGF2 PWWP domain in complex with H3K79me3 and H4K20me3.

Conclusions

PWWP proteins constitute a new family of methyl lysine histone binders. The PWWP domain consists of three motifs: a canonical β-barrel core, an insertion motif between the second and third β-strands and a C-terminal α-helix bundle. Both the canonical β-barrel core and the insertion motif are directly involved in histone binding. The PWWP domain has been previously shown to be a DNA binding domain. Therefore, the PWWP domain exhibits dual functions: binding both DNA and methyllysine histones.

Enhanced version

This article can also be viewed as an enhanced version in which the text of the article is integrated with interactive 3D representations and animated transitions. Please note that a web plugin is required to access this enhanced functionality. Instructions for the installation and use of the web plugin are available in Text S1.  相似文献   

10.

Background

The members of cupin superfamily exhibit large variations in their sequences, functions, organization of domains, quaternary associations and the nature of bound metal ion, despite having a conserved β-barrel structural scaffold. Here, an attempt has been made to understand structure-function relationships among the members of this diverse superfamily and identify the principles governing functional diversity. The cupin superfamily also contains proteins for which the structures are available through world-wide structural genomics initiatives but characterized as “hypothetical”. We have explored the feasibility of obtaining clues to functions of such proteins by means of comparative analysis with cupins of known structure and function.

Methodology/Principal Findings

A 3-D structure-based phylogenetic approach was undertaken. Interestingly, a dendrogram generated solely on the basis of structural dissimilarity measure at the level of domain folds was found to cluster functionally similar members. This clustering also reflects an independent evolution of the two domains in bicupins. Close examination of structural superposition of members across various functional clusters reveals structural variations in regions that not only form the active site pocket but are also involved in interaction with another domain in the same polypeptide or in the oligomer.

Conclusions/Significance

Structure-based phylogeny of cupins can influence identification of functions of proteins of yet unknown function with cupin fold. This approach can be extended to other proteins with a common fold that show high evolutionary divergence. This approach is expected to have an influence on the function annotation in structural genomics initiatives.  相似文献   

11.

Background and Aims

The OVATE gene encodes a nuclear-localized regulatory protein belonging to a distinct family of plant-specific proteins known as the OVATE family proteins (OFPs). OVATE was first identified as a key regulator of fruit shape in tomato, with nonsense mutants displaying pear-shaped fruits. However, the role of OFPs in plant development has been poorly characterized.

Methods

Public databases were searched and a total of 265 putative OVATE protein sequences were identified from 13 sequenced plant genomes that represent the major evolutionary lineages of land plants. A phylogenetic analysis was conducted based on the alignment of the conserved OVATE domain from these 13 selected plant genomes. The expression patterns of tomato SlOFP genes were analysed via quantitative real-time PCR. The pattern of OVATE gene duplication resulting in the expansion of the gene family was determined in arabidopsis, rice and tomato.

Key Results

Genes for OFPs were found to be present in all the sampled land plant genomes, including the early-diverged lineages, mosses and lycophytes. Phylogenetic analysis based on the amino acid sequences of the conserved OVATE domain defined 11 sub-groups of OFPs in angiosperms. Different evolutionary mechanisms are proposed for OVATE family evolution, namely conserved evolution and divergent expansion. Characterization of the AtOFP family in arabidopsis, the OsOFP family in rice and the SlOFP family in tomato provided further details regarding the evolutionary framework and revealed a major contribution of tandem and segmental duplications towards expansion of the OVATE gene family.

Conclusions

This first genome-wide survey on OFPs provides new insights into the evolution of the OVATE protein family and establishes a solid base for future functional genomics studies on this important but poorly characterized regulatory protein family in plants.  相似文献   

12.

Background

Scaffolding proteins of the intersectin (ITSN) family, ITSN1 and ITSN2, are crucial for the initiation stage of clathrin-mediated endocytosis. These proteins are closely related but have implications in distinct pathologies. To determine how these proteins could be separated in certain cell pathways we performed a comparative study of ITSNs.

Methodology/Principal Findings

We have shown that endogenous ITSN1 and ITSN2 colocalize and form a complex in cells. A structural comparison of five SH3 domains, which mediated most ITSNs protein-protein interactions, demonstrated a similarity of their ligand-binding sites. We showed that the SH3 domains of ITSN2 bound well-established interactors of ITSN1 as well as newly identified ITSNs protein partners. A search for a novel interacting interface revealed multiple tyrosines that could be phosphorylated in ITSN2. Phosphorylation of ITSN2 isoforms but not ITSN1 short isoform was observed in various cell lines. EGF stimulation of HeLa cells enhanced tyrosine phosphorylation of ITSN2 isoforms and enabled their recognition by the SH2 domains of the Fyn, Fgr and Abl1 kinases, the regulatory subunit of PI3K, the adaptor proteins Grb2 and Crk, and phospholipase C gamma. The SH2 domains mentioned were unable to bind ITSN1 short isoform.

Conclusions/Significance

Our results indicate that during evolution of vertebrates ITSN2 acquired a novel protein-interaction interface that allows its specific recognition by the SH2 domains of signaling proteins. We propose that these data could be important to understand the functional diversity of paralogous ITSN proteins.  相似文献   

13.

Background

Polyketides are a diverse group of biotechnologically important secondary metabolites that are produced by multi domain enzymes called polyketide synthases (PKS).

Methodology/Principal Findings

We have estimated frequencies of type I PKS (PKS I) – a PKS subgroup – in natural environments by using Hidden-Markov-Models of eight domains to screen predicted proteins from six metagenomic shotgun data sets. As the complex PKS I have similarities to other multi-domain enzymes (like those for the fatty acid biosynthesis) we increased the reliability and resolution of the dataset by maximum-likelihood trees. The combined information of these trees was then used to discriminate true PKS I domains from evolutionary related but functionally different ones. We were able to identify numerous novel PKS I proteins, the highest density of which was found in Minnesota farm soil with 136 proteins out of 183,536 predicted genes. We also applied the protocol to UniRef database to improve the annotation of proteins with so far unknown function and identified some new instances of horizontal gene transfer.

Conclusions/Significance

The screening approach proved powerful in identifying PKS I sequences in large sequence data sets and is applicable to many other protein families.  相似文献   

14.

Background

Chromosome conformation capture studies suggest that eukaryotic genomes are organized into structures called topologically associating domains. The borders of these domains are highly enriched for architectural proteins with characterized roles in insulator function. However, a majority of architectural protein binding sites localize within topological domains, suggesting sites associated with domain borders represent a functionally different subclass of these regulatory elements. How topologically associating domains are established and what differentiates border-associated from non-border architectural protein binding sites remain unanswered questions.

Results

By mapping the genome-wide target sites for several Drosophila architectural proteins, including previously uncharacterized profiles for TFIIIC and SMC-containing condensin complexes, we uncover an extensive pattern of colocalization in which architectural proteins establish dense clusters at the borders of topological domains. Reporter-based enhancer-blocking insulator activity as well as endogenous domain border strength scale with the occupancy level of architectural protein binding sites, suggesting co-binding by architectural proteins underlies the functional potential of these loci. Analyses in mouse and human stem cells suggest that clustering of architectural proteins is a general feature of genome organization, and conserved architectural protein binding sites may underlie the tissue-invariant nature of topologically associating domains observed in mammals.

Conclusions

We identify a spectrum of architectural protein occupancy that scales with the topological structure of chromosomes and the regulatory potential of these elements. Whereas high occupancy architectural protein binding sites associate with robust partitioning of topologically associating domains and robust insulator function, low occupancy sites appear reserved for gene-specific regulation within topological domains.  相似文献   

15.

Background

Genome variation is very high in influenza A viruses. However, viral evolution and spreading is strongly influenced by immunogenic features and capacity to bind host cells, depending in turn on the two major capsidic proteins. Therefore, such viruses are classified based on haemagglutinin and neuraminidase types, e.g. H5N1. Current analyses of viral evolution are based on serological and primary sequence comparison; however, comparative structural analysis of capsidic proteins can provide functional insights on surface regions possibly crucial to antigenicity and cell binding.

Results

We performed extensive structural comparison of influenza virus haemagglutinins and of their domains and subregions to investigate type- and/or domain-specific variation. We found that structural closeness and primary sequence similarity are not always tightly related; moreover, type-specific features could be inferred when comparing surface properties of haemagglutinin subregions, monomers and trimers, in terms of electrostatics and hydropathy. Focusing on H5N1, we found that variation at the receptor binding domain surface intriguingly relates to branching of still circulating clades from those ones that are no longer circulating.

Conclusions

Evidence from this work suggests that integrating phylogenetic and serological analyses by extensive structural comparison can help in understanding the ‘functional evolution’ of viral surface determinants. In particular, variation in electrostatic and hydropathy patches can provide molecular evolution markers: intriguing surface charge redistribution characterizing the haemagglutinin receptor binding domains from circulating H5N1 clades 2 and 7 might have contributed to antigenic escape hence to their evolutionary success and spreading.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0363-5) contains supplementary material, which is available to authorized users.  相似文献   

16.
17.

Background

Enterotoxigenic Escherichia coli (ETEC) is a major diarrheal pathogen in developing countries, where it accounts for millions of infections and hundreds of thousands of deaths annually. While vaccine development to prevent diarrheal illness due to ETEC is feasible, extensive effort is needed to identify conserved antigenic targets. Pathogenic Escherichia coli, including ETEC, use the autotransporter (AT) secretion mechanism to export virulence factors. AT proteins are comprised of a highly conserved carboxy terminal outer membrane beta barrel and a surface-exposed amino terminal passenger domain. Recent immunoproteomic studies suggesting that multiple autotransporter passenger domains are recognized during ETEC infection prompted the present studies.

Methodology

Available ETEC genomes were examined to identify AT coding sequences present in pathogenic isolates, but not in the commensal E. coli HS strain. Passenger domains of the corresponding autotransporters were cloned and expressed as recombinant antigens, and the immune response to these proteins was then examined using convalescent sera from patients and experimentally infected mice.

Principal Findings

Potential AT genes shared by ETEC strains, but absent in the E. coli commensal HS strain were identified. Recombinant passenger domains derived from autotransporters, including Ag43 and an AT designated pAT, were recognized by antibodies from mice following intestinal challenge with H10407, and both Ag43 and pAT were identified on the surface of ETEC by flow cytometry. Likewise, convalescent sera from patients with ETEC diarrhea recognized Ag43 and pAT, suggesting that these proteins are expressed during both experimental and naturally occurring ETEC infections and that they are immunogenic. Vaccination of mice with recombinant passenger domains from either pAT or Ag43 afforded protection against intestinal colonization with ETEC.

Conclusions

Passenger domains of conserved autotransporter proteins could contribute to protective immune responses that develop following infection with ETEC, and these antigens consequently represent potential targets to explore in vaccine development.  相似文献   

18.

Background

Computational prediction of protein interactions typically use protein domains as classifier features because they capture conserved information of interaction surfaces. However, approaches relying on domains as features cannot be applied to proteins without any domain information. In this paper, we explore the contribution of pure amino acid composition (AAC) for protein interaction prediction. This simple feature, which is based on normalized counts of single or pairs of amino acids, is applicable to proteins from any sequenced organism and can be used to compensate for the lack of domain information.

Results

AAC performed at par with protein interaction prediction based on domains on three yeast protein interaction datasets. Similar behavior was obtained using different classifiers, indicating that our results are a function of features and not of classifiers. In addition to yeast datasets, AAC performed comparably on worm and fly datasets. Prediction of interactions for the entire yeast proteome identified a large number of novel interactions, the majority of which co-localized or participated in the same processes. Our high confidence interaction network included both well-studied and uncharacterized proteins. Proteins with known function were involved in actin assembly and cell budding. Uncharacterized proteins interacted with proteins involved in reproduction and cell budding, thus providing putative biological roles for the uncharacterized proteins.

Conclusion

AAC is a simple, yet powerful feature for predicting protein interactions, and can be used alone or in conjunction with protein domains to predict new and validate existing interactions. More importantly, AAC alone performs at par with existing, but more complex, features indicating the presence of sequence-level information that is predictive of interaction, but which is not necessarily restricted to domains.  相似文献   

19.

Background

Src homology 2 (SH2) domain is a conserved module involved in various biological processes. Tensin family member was reported to be involved in tumor suppression by interacting with DLC-1 (deleted-in-liver-cancer-1) via its SH2 domain. We explore here the important questions that what the structure of tensin2 SH2 domain is, and how it binds to DLC-1, which might reveal a novel binding mode.

Principal Findings

Tensin2 SH2 domain adopts a conserved SH2 fold that mainly consists of five β-strands flanked by two α-helices. Most SH2 domains recognize phosphorylated ligands specifically. However, tensin2 SH2 domain was identified to interact with nonphosphorylated ligand (DLC-1) as well as phosphorylated ligand.

Conclusions

We determined the solution structure of tensin2 SH2 domain using NMR spectroscopy, and revealed the interactions between tensin2 SH2 domain and its ligands in a phosphotyrosine-independent manner.  相似文献   

20.

Background

While many authors have discussed models and tools for studying protein evolution at the sequence level, molecular function is usually mediated by complex, higher order features such as independently folding domains and linear motifs that are based on or embedded in a particular arrangment of features such as secondary structure elements, transmembrane domains and regions with intrinsic disorder. This ‘protein architecture’ can, in its most simplistic representation, be visualized as domain organization cartoons that can be used to compare proteins in terms of the order of their mostly globular domains.

Methodology

Here, we describe a visual approach and a webserver for protein comparison that extend the domain organization cartoon concept. By developing an information-rich, compact visualization of different protein features above the sequence level, potentially related proteins can be compared at the level of propensities for secondary structure, transmembrane domains and intrinsic disorder, in addition to PFAM domains. A public Web server is available at www.proteinarchitect.net, while the code is provided at protarchitect.sourceforge.net.

Conclusions/Significance

Due to recent advances in sequencing technologies we are now flooded with millions of predicted proteins that await comparative analysis. In many cases, mature tools focused on revealing hits with considerable global or local similarity to well-characterized proteins will not be able to lead us to testable hypotheses about a protein''s function, or the function of a particular region. The visual comparison of different types of protein features with ProteinArchitect will be useful when assessing the relevance of similarity search hits, to discover subgroups in protein families and superfamilies, and to understand protein regions with conserved features outside globular regions. Therefore, this approach is likely to help researchers to develop testable hypotheses about a protein''s function even if is somewhat distant from the more characterized proteins, by facilitating the discovery of features that are conserved above the sequence level for comparison and further experimental investigation.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号