期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

The X-ray crystallographic structure and activity analysis of a Pseudomonas-specific subfamily of the HAD enzyme superfamily evidences a novel biochemical function

Peisach E Wang L Burroughs AM Aravind L Dunaway-Mariano D Allen KN 《Proteins》2008,70(1):197-207

The haloacid dehalogenase (HAD) superfamily is a large family of proteins dominated by phosphotransferases. Thirty-three sequence families within the HAD superfamily (HADSF) have been identified to assist in function assignment. One such family includes the enzyme phosphoacetaldehyde hydrolase (phosphonatase). Phosphonatase possesses the conserved Rossmanniod core domain and a C1-type cap domain. Other members of this family do not possess a cap domain and because the cap domain of phosphonatase plays an important role in active site desolvation and catalysis, the function of the capless family members must be unique. A representative of the capless subfamily, PSPTO_2114, from the plant pathogen Pseudomonas syringae, was targeted for catalytic activity and structure analyses. The X-ray structure of PSPTO_2114 reveals a capless homodimer that conserves some but not all of the intersubunit contacts contributed by the core domains of the phosphonatase homodimer. The region of the PSPTO_2114 that corresponds to the catalytic scaffold of phosphonatase (and other HAD phosphotransfereases) positions amino acid residues that are ill suited for Mg+2 cofactor binding and mediation of phosphoryl group transfer between donor and acceptor substrates. The absence of phosphotransferase activity in PSPTO_2114 was confirmed by kinetic assays. To explore PSPTO_2114 function, the conservation of sequence motifs extending outside of the HADSF catalytic scaffold was examined. The stringently conserved residues among PSPTO_2114 homologs were mapped onto the PSPTO_2114 three-dimensional structure to identify a surface region unique to the family members that do not possess a cap domain. The hypothesis that this region is used in protein-protein recognition is explored to define, for the first time, HADSF proteins which have acquired a function other than that of a catalyst. 相似文献

2.

Clustering of multi‐domain protein sequences

下载免费PDF全文

Prachi Mehrotra Vimla Kany G. Ami Narayanaswamy Srinivasan 《Proteins》2018,86(7):759-776

The overall function of a multi‐domain protein is determined by the functional and structural interplay of its constituent domains. Traditional sequence alignment‐based methods commonly utilize domain‐level information and provide classification only at the level of domains. Such methods are not capable of taking into account the contributions of other domains in the proteins, and domain‐linker regions and classify multi‐domain proteins. An alignment‐free protein sequence comparison tool, CLAP (CLAssification of Proteins) was previously developed in our laboratory to especially handle multi‐domain protein sequences without a requirement of defining domain boundaries and sequential order of domains. Through this method we aim to achieve a biologically meaningful classification scheme for multi‐domain protein sequences. In this article, CLAP‐based classification has been explored on 5 datasets of multi‐domain proteins and we present detailed analysis for proteins containing (1) Tyrosine phosphatase and (2) SH3 domain. At the domain‐level CLAP‐based classification scheme resulted in a clustering similar to that obtained from an alignment‐based method. CLAP‐based clusters obtained for full‐length datasets were shown to comprise of proteins with similar functions and domain architectures. Our study demonstrates that multi‐domain proteins could be classified effectively by considering full‐length sequences without a requirement of identification of domains in the sequence. 相似文献

3.

The X-ray crystal structures of human alpha-phosphomannomutase 1 reveal the structural basis of congenital disorder of glycosylation type 1a

Silvaggi NR Zhang C Lu Z Dai J Dunaway-Mariano D Allen KN 《The Journal of biological chemistry》2006,281(21):14918-14926

Congenital disorder of glycosylation type 1a (CDG-1a) is a congenital disease characterized by severe defects in nervous system development. It is caused by mutations in alpha-phosphomannomutase (of which there are two isozymes, alpha-PMM1 and alpha-PPM2). Here we report the x-ray crystal structures of human alpha-PMM1 in the open conformation, with and without the bound substrate, alpha-D-mannose 1-phosphate. Alpha-PMM1, like most haloalkanoic acid dehalogenase superfamily (HADSF) members, consists of two domains, the cap and core, which open to bind substrate and then close to provide a solvent-exclusive environment for catalysis. The substrate phosphate group is observed at a positively charged site of the cap domain, rather than at the core domain phosphoryl-transfer site defined by the Asp(19) nucleophile and Mg(2+) cofactor. This suggests that substrate binds first to the cap and then is swept into the active site upon cap closure. The orientation of the acid/base residue Asp(21) suggests that alpha-phosphomannomutase (alpha-PMM) uses a different method of protecting the aspartylphosphate from hydrolysis than the HADSF member beta-phosphoglucomutase. It is hypothesized that the electrostatic repulsion of positive charges at the interface of the cap and core domains stabilizes alpha-PMM1 in the open conformation and that the negatively charged substrate binds to the cap, thereby facilitating its closure over the core domain. The two isozymes, alpha-PMM1 and alpha-PMM2, are shown to have a conserved active-site structure and to display similar kinetic properties. Analysis of the known mutation sites in the context of the structures reveals the genotype-phenotype relationship underlying CDG-1a. 相似文献

4.

Adeno-associated virus type 2 VP2 capsid protein is nonessential and can tolerate large peptide insertions at its N terminus

Warrington KH Gorbatyuk OS Harrison JK Opie SR Zolotukhin S Muzyczka N 《Journal of virology》2004,78(12):6595-6609

Direct insertion of amino acid sequences into the adeno-associated virus type 2 (AAV) capsid open reading frame (cap ORF) is one strategy currently being developed for retargeting this prototypical gene therapy vector. While this approach has successfully resulted in the formation of AAV particles that have expanded or retargeted viral tropism, the inserted sequences have been relatively short, linear receptor binding ligands. Since many receptor-ligand interactions involve nonlinear, conformation-dependent binding domains, we investigated the insertion of full-length peptides into the AAV cap ORF. To minimize disruption of critical VP3 structural domains, we confined the insertions to residue 138 within the VP1-VP2 overlap, which has been shown to be on the surface of the particle following insertion of smaller epitopes. The insertion of coding sequences for the 8-kDa chemokine binding domain of rat fractalkine (CX3CL1), the 18-kDa human hormone leptin, and the 30-kDa green fluorescent protein (GFP) after residue 138 failed to lead to formation of particles due to the loss of VP3 expression. To test the ability to complement these insertions with the missing capsid proteins in trans, we designed a system for producing AAV vectors in which expression of one capsid protein is isolated and combined with the remaining two capsid proteins expressed separately. Such an approach allows for genetic modification of a specific capsid protein across its entire coding sequence leaving the remaining capsid proteins unaffected. An examination of particle formation from the individual components of the system revealed that genome-containing particles formed as long as the VP3 capsid protein was present and demonstrated that the VP2 capsid protein is nonessential for viral infectivity. Viable particles composed of all three capsid proteins were obtained from the capsid complementation groups regardless of which capsid proteins were supplied separately in trans. Significant overexpression of VP2 resulted in the formation of particles with altered capsid protein stoichiometry. The key finding was that by using this system we successfully obtained nearly wild-type levels of recombinant AAV-like particles with large ligands inserted after residue 138 in VP1 and VP2 or in VP2 exclusively. While insertions at residue 138 in VP1 significantly decreased infectivity, insertions at residue 138 that were exclusively in VP2 had a minimal effect on viral assembly or infectivity. Finally, insertion of GFP into VP1 and VP2 resulted in a particle whose trafficking could be temporally monitored by using confocal microscopy. Thus, we have demonstrated a method that can be used to insert large (up to 30-kDa) peptide ligands into the AAV particle. This system allows greater flexibility than current approaches in genetically manipulating the composition of the AAV particle and, in particular, may allow vector retargeting to alternative receptors requiring interaction with full-length conformation-dependent peptide ligands. 相似文献

5.

Ribosomal DNA in Drosophila melanogaster. II. Heteroduplex mapping of cloned and uncloned rDNA. 总被引：5，自引：0，他引：5

P K Wellauer I B Dawid 《Journal of molecular biology》1978,126(4):769-782

Sequences in the cloned Drosophila melanogaster rDNA fragments described by Dawid et al. (1978) were compared by heteroduplex mapping. The nontranscribed spacer regions in all fragments are homologous but vary in length. Deletion loops were observed at variable positions in the spacer region suggesting that spacers are internally repetitious.Many rDNA repeats in D. melanogaster have a 28 S gene interrupted by a region named the ribosomal insertion. Insertions of 0.5, 1 and 5 kb were found in repeat-length EcoRI fragments. These DNA regions, named type 1 insertions, are homologous at their right ends. Although 1 kb insertions are quite precisely twice as large as 0.5 kb insertions they do not represent a duplication of the shorter sequence. Some insertions have at least one EcoRI site and therefore yield EcoRI fragments which are only part of a repeat. The sequences in two cloned right-hand partial insertion sequences are homologous, but the sequences in two lefthand partial insertions are not. None of the EcoRI-restrictable insertion sequences has any homology to any part of type 1 insertions; they are thus grouped together as type 2. Evidence for insertion sequences of at least two types in uncloned rDNA was obtained by annealing a cloned fragment with a 1 kb insertion to genomic rDNA. About 15% of the rDNA repeats show substitution type loops between the 1 kb type 1 insertion derived from the cloned fragment and type 2 insertions in the rDNA. 相似文献

6.

TU elements: a heterogeneous family of modularly structured eucaryotic transposons. 总被引：7，自引：5，他引：2

下载免费PDF全文

B Hoffman-Liebermann D Liebermann L H Kedes S N Cohen 《Molecular and cellular biology》1985,5(5):991-1001

We describe here a family of foldback transposons found in the genome of the higher eucaryote, the sea urchin Strongylocentrotus purpuratus. Two major classes of TU elements have been identified by analysis of genomic DNA and TU element clones. One class consists of largely similar elements with long terminal inverted repeats (IVRs) containing outer and inner domains and sharing a common middle segment that can undergo deletions. Some of these elements contain insertions. The second class is highly heterogeneous, with many different middle segments nonhomologous to those of the first-class and variable-sized inverted repeats that contain only an outer domain. The middle and insertion segments of both classes carry sequences that also are found unassociated from the inverted repeats at many other genomic locations. We conclude that the TU elements are modular structures composed of inverted repeats plus other sequence domains that are themselves members of different families of dispersed repetitive sequences. Such modular elements may have a role in the dispersion and rearrangement of genomic DNA segments. 相似文献

7.

Diversity and evolution of the thyroglobulin type-1 domain superfamily 总被引：1，自引：0，他引：1

Novinec M Kordis D Turk V Lenarcic B 《Molecular biology and evolution》2006,23(4):744-755

Multidomain proteins are gaining increasing consideration for their puzzling, flexible utilization in nature. The presence of the characteristic thyroglobulin type-1 (Tg1) domain as a protein module in a variety of multicellular organisms suggests pivotal roles for this building block. To gain insight into the evolution of Tg1 domains, we performed searches of protein, expressed sequence tag, and genome databases. Tg1 domains were found to be Metazoa specific, and we retrieved a total of 170 Tg1 domain-containing protein sequences. Their architectures revealed a wide taxonomic distribution of proteins containing Tg1 domains followed or preceded by secreted protein, acidic, rich in cysteines (SPARC)-type extracellular calcium-binding domains. Other proteins contained lineage-specific domain combinations of peptidase inhibitory modules or domains with different biological functions. Phylogenetic analysis showed that Tg1 domains are highly conserved within protein structures, whereas insertion into novel proteins is followed by rapid diversification. Seven different basic types of protein architecture containing the Tg1 domain were identified in vertebrates. We examined the evolution of these protein groups by combining Tg1 domain phylogeny with additional analyses based on other characteristic domains. Testicans and secreted modular calcium binding protein (SMOCs) evolved from invertebrate homologs by introduction of vertebrate-specific domains, nidogen evolved by insertion of a Tg1 domain into a preexisting architecture, and the remaining four have unique architectures. Thyroglobulin, Trops, and the major histocompatibility complex class II-associated invariant chain are vertebrate specific, while an insulin-like growth factor-binding protein and nidogen were also identified in urochordates. Among vertebrates, we observed differences in protein repertoires, which result from gene duplication and domain duplication. Members of five groups have been characterized at the molecular level. All exhibit subtle differences in their specificities and function either as peptidase inhibitors (thyropins), substrates, or both. As far as the sequence is concerned, only a few conserved residues were identified. In combination with structural data, our analysis shows that the Tg1 domain fold is highly adaptive and comprises a relatively well-conserved core surrounded by highly variable loops that account for its multipurpose function in the animal kingdom. 相似文献

8.

Linking the functions of unrelated proteins using a novel directed evolution domain insertion method

Edwards WR Busse K Allemann RK Jones DD 《Nucleic acids research》2008,36(13):e78

We have successfully developed a new directed evolution method for generating integral protein fusions comprising of one domain inserted within another. Creating two connections between the insert and accepting parent domain can result in the inter-dependence of the separate protein activities, thus providing a general strategy for constructing molecular switches. Using an engineered transposon termed MuDel, contiguous trinucleotide sequences were removed at random positions from the bla gene encoding TEM-1 beta-lactamase. The deleted trinucleotide sequence was then replaced by a DNA cassette encoding cytochrome b(562) with differing linking sequences at each terminus and sampling all three reading frames. The result was a variety of chimeric genes encoding novel integral fusion proteins that retained TEM-1 activity. While most of the tolerated insertions were observed in loops, several also occurred close to the termini of alpha-helices and beta-strands. Several variants conferred a switching phenotype on Escherichia coli, with bacterial tolerance to ampicillin being dependent on the presence of haem in the growth medium. The magnitude of the switching phenotype ranged from 4- to 128-fold depending on the insertion position within TEM-1 and the linker sequences that join the two domains. 相似文献

9.

A novel pro-Arg motif recognized by WW domains

Bedford MT Sarbassova D Xu J Leder P Yaffe MB 《The Journal of biological chemistry》2000,275(14):10359-10369

WW domains mediate protein-protein interactions through binding to short proline-rich sequences. Two distinct sequence motifs, PPXY and PPLP, are recognized by different classes of WW domains, and another class binds to phospho-Ser-Pro sequences. We now describe a novel Pro-Arg sequence motif recognized by a different class of WW domains using data from oriented peptide library screening, expression cloning, and in vitro binding experiments. The prototype member of this group is the WW domain of formin-binding protein 30 (FBP30), a p53-regulated molecule whose WW domains bind to Pro-Arg-rich cellular proteins. This new Pro-Arg sequence motif re-classifies the organization of WW domains based on ligand specificity, and the Pro-Arg class now includes the WW domains of FBP21 and FE65. A structural model is presented which rationalizes the distinct motifs selected by the WW domains of YAP, Pin1, and FBP30. The Pro-Arg motif identified for WW domains often overlaps with SH3 domain motifs within protein sequences, suggesting that the same extended proline-rich sequence could form discrete SH3 or WW domain complexes to transduce distinct cellular signals. 相似文献

10.

Prediction and identification of a permissive epitope insertion site in the vesicular stomatitis virus glycoprotein

下载免费PDF全文

Schlehuber LD Rose JK 《Journal of virology》2004,78(10):5079-5087

We developed a rational approach to identify a site in the vesicular stomatitis virus (VSV) glycoprotein (G) that is exposed on the protein surface and tolerant of foreign epitope insertion. The foreign epitope inserted was the six-amino-acid sequence ELDKWA, a sequence in a neutralizing epitope from human immunodeficiency virus type 1. This sequence was inserted into six sites within the VSV G protein (Indiana serotype). Four sites were selected based on hydrophilicity and high sequence variability identified by sequence comparison with other vesiculovirus G proteins. The site showing the highest variability was fully tolerant of the foreign peptide insertion. G protein containing the insertion at this site folded correctly, was transported normally to the cell surface, had normal membrane fusion activity, and could reconstitute fully infectious VSV. The virus was neutralized by the human 2F5 monoclonal antibody that binds the ELDKWA epitope. Additional studies showed that this site in G protein tolerated insertion of at least 16 amino acids while retaining full infectivity. The three other insertions in somewhat less variable sequences interfered with VSV G folding and transport to the cell surface. Two additional insertions were made in a conserved sequence adjacent to a glycosylation site and near the transmembrane domain. The former blocked G-protein transport, while the latter allowed transport to the cell surface but blocked membrane fusion activity of G protein. Identification of an insertion-tolerant site in VSV G could be important in future vaccine and targeting studies, and the general principle might also be useful in other systems. 相似文献

11.

Characterization of left‐handed beta helix‐domains,and identification and functional annotation of proteins containing such domains

Anu Prabha Petety V. Balaji 《Proteins》2021,89(1):6-20

Only about 0.3% of the entries in UniProt database have manually curated annotation. Annotation at the molecular level often relies on low‐throughput one‐protein‐at‐a‐time approach. Computational methods bridge this gap by assigning function based on sequence and/or fold similarity. Left‐handed beta helix (LbH) consists of three repeating six‐stranded beta‐strands forming an 18‐mer turn of the helix. Analysis of LbH‐domains showed that variations are found in the number of residues in a beta‐strand (5‐7, 6 being the most common), number of turns (4–10) of the helix, insertions of one or more loops of variable length (0‐36 residues), and the location of loop insertion. An 18‐mer HMM profile was created which identifies LbH‐domain containing proteins using sequence as the only input; the number of false positives is zero when proteins tested were those with known 3D structures. 136 474 entries of TrEMBL database were found to contain LbH‐domain. Rules developed by analyzing LbH‐domain containing acyltransferases, gamma‐class carbonic anhydrases, and nucleotidyltransferases have led to the annotation of 17 389 TrEMBL entries which currently have no functional tag. 相似文献

12.

Domain insertions in protein structures

Aroul-Selvam R Hubbard T Sasidharan R 《Journal of molecular biology》2004,338(4):633-641

相似文献

13.

Sensory transducers of E. coli are composed of discrete structural and functional domains 总被引：70，自引：0，他引：70

A Krikos N Mutoh A Boyd M I Simon 《Cell》1983,33(2):615-622

The tar and tsr genes of E. coli encode functionally analogous transducer proteins that mediate two distinct classes of chemotactic response. The tap gene lies adjacent to tar, and is thought to encode another transducer protein. We present here the complete nucleotide sequence of the tar-tap region of the E. coli genome, together with a comparative analysis of the sequences of the Tar, Tap, and Tsr proteins. The proteins appear to have a simple transmembrane structure consisting of an extracytoplasmic amino-terminal domain, a membrane-spanning domain, and an intracellular carboxy-terminal domain. The carboxy-terminal domains of three proteins possess highly homologous sequences and contain sites of methylation involved in sensory adaptation, while the amino-terminal sequences are only distantly related to one another, consistent with their serving as chemoreceptor domains that have diverged functionally. 相似文献

14.

The modular structure of ThDP‐dependent enzymes

Constantin Vogel Jürgen Pleiss 《Proteins》2014,82(10):2523-2537

Thiamine diphosphate (ThDP)‐dependent enzymes form a diverse protein family which was classified into nine superfamilies. The cofactor ThDP is bound at the interface between two catalytic domains, the PYR and the PP domain. The nine superfamilies were assigned to five different structural architectures. Two superfamilies, the sulfopyruvate decarboxylases and α‐ketoacid dehydrogenases 2, consist of separate PYR and PP domains. The oxidoreductase superfamily is of the intra‐monomer/PYR‐PP type with an N‐terminal PYR and a subsequent PP domain. The active enzymes form homodimers with the ThDP cofactor bound at the interface between a PYR and a PP domain of the same monomer. Decarboxylases are of the inter‐monomer/PYR‐PP type with the cofactor bound between domains from different monomers. 1‐Deoxy‐d ‐xylulose‐5‐phosphate synthases are of the intra‐monomer/PP‐PYR type. The transketolases, α‐ketoglutarate dehydrogenases, and α‐ketoacid dehydrogenases 1 are of the inter‐monomer/PP‐PYR type. For the phosphonopyruvate decarboxylases, definitive assessment of the structural architecture is not possible due to lack of structure information. By applying a structure‐based domain alignment method, sequences of more than 62,000 PYR and PP domains were identified and aligned. Although the sequence similarity of the catalytic domains is low between different superfamilies, seven positions were identified to be highly conserved, including the cofactor binding GDGX_24,27N motif, the cofactor‐activating glutamic acid, and two structurally equivalent glycines in both the PYR and the PP domain. An evolutionary pathway of ThDP‐dependent enzymes is proposed which explains the sequence and structure diversity of this family by three basic evolutionary events: domain recruitment, domain linkage, and structural rearrangement of catalytic domains. Proteins 2014; 82:2523–2537. © 2014 Wiley Periodicals, Inc. 相似文献

15.

Altered Motor Activity of Alternative Splice Variants of the Mammalian Kinesin-3 Protein KIF1B

Masafumi Matsushita Ruri Yamamoto Keiji Mitsui Hiroshi Kanazawa 《Traffic (Copenhagen, Denmark)》2009,10(11):1647-1654

Several mammalian kinesin motor proteins exist as multiple isoforms that arise from alternative splicing of a single gene. However, the roles of many motor protein splice variants remain unclear. The kinesin-3 motor protein KIF1B has alternatively spliced isoforms distinguished by the presence or absence of insertion sequences in the conserved amino-terminal region of the protein. The insertions are located in the loop region containing the lysine-rich cluster, also known as the K-loop, and in the hinge region adjacent to the motor domain. To clarify the functions of these alternative splice variants of KIF1B, we examined the biochemical properties of recombinant KIF1B with and without insertion sequences. In a microtubule-dependent ATPase assay, KIF1B variants that contained both insertions had higher activity and affinity for microtubules than KIF1B variants that contained no insertions. Mutational analysis of the K-loop insertion revealed that variants with a longer insertion sequence at this site had higher activity. However, the velocity of movement in motility assays was similar between KIF1B with and without insertion sequences. Our results indicate that splicing isoforms of KIF1B that vary in their insertion sequences have different motor activities. 相似文献

16.

Genetic and physical analysis of the nodD3 region of Rhizobium meliloti. 总被引：6，自引：1，他引：5

下载免费PDF全文

B G Rushing M M Yelton S R Long 《Nucleic acids research》1991,19(4):921-927

相似文献

17.

Structural and mutational analysis of E2 trans-activating proteins of papillomaviruses reveals three distinct functional domains. 总被引：42，自引：10，他引：32

I Giri M Yaniv 《The EMBO journal》1988,7(9):2823-2829

相似文献

18.

Analysis of the role of brome mosaic virus 1a protein domains in RNA replication, using linker insertion mutagenesis 总被引：3，自引：23，他引：3

下载免费PDF全文

P A Kroner B M Young P Ahlquist 《Journal of virology》1990,64(12):6110-6120

Brome mosaic virus (BMV) belongs to a "superfamily" of plant and animal positive-strand RNA viruses that share, among other features, three large domains of conserved sequence in nonstructural proteins involved in RNA replication. Two of these domains reside in the 109-kDa BMV 1a protein. To examine the role of 1a, we used biologically active cDNA clones of BMV RNA1 to construct a series of linker insertion mutants bearing two-codon insertions dispersed throughout the 1a gene. The majority of these mutations blocked BMV RNA replication in protoplasts, indicating that both intervirally conserved domains function in RNA replication. Coinoculation tests with a large number of mutant combinations failed to reveal detectable complementation between mutations in the N- and C-terminal conserved domains, implying that these two domains either function in some directly interdependent fashion or must be present in the same protein. Four widely spaced mutations with temperature-sensitive (ts) defects in RNA replication were identified, including a strongly ts insertion near the nucleotide-binding consensus of the helicaselike C-terminal domain. Temperature shift experiments with this mutant show that 1a protein is required for continued accumulation of all classes of viral RNA (positive strand, negative strand, and subgenomic) and is required for at least the first 10 h of infection. ts mutations were also identified in the 3' noncoding region of RNA1, 5' to conserved sequences previously implicated in cis for replication. Under nonpermissive conditions, the cis-acting partial inhibition of RNA1 accumulation caused by these noncoding mutations was also associated with reduced levels of the other BMV genomic RNAs. Comparison with previous BMV mutant results suggests that RNA replication is more sensitive to reductions in expression of 1a than of 2a, the other BMV-encoded protein involved in replication. 相似文献

19.

Coiled‐coil length: Size does matter

下载免费PDF全文

Jaroslaw Surkont Yoan Diekmann Pearl V. Ryder Jose B. Pereira‐Leal 《Proteins》2015,83(12):2162-2169

Protein evolution is governed by processes that alter primary sequence but also the length of proteins. Protein length may change in different ways, but insertions, deletions and duplications are the most common. An optimal protein size is a trade‐off between sequence extension, which may change protein stability or lead to acquisition of a new function, and shrinkage that decreases metabolic cost of protein synthesis. Despite the general tendency for length conservation across orthologous proteins, the propensity to accept insertions and deletions is heterogeneous along the sequence. For example, protein regions rich in repetitive peptide motifs are well known to extensively vary their length across species. Here, we analyze length conservation of coiled‐coils, domains formed by an ubiquitous, repetitive peptide motif present in all domains of life, that frequently plays a structural role in the cell. We observed that, despite the repetitive nature, the length of coiled‐coil domains is generally highly conserved throughout the tree of life, even when the remaining parts of the protein change, including globular domains. Length conservation is independent of primary amino acid sequence variation, and represents a conservation of domain physical size. This suggests that the conservation of domain size is due to functional constraints. Proteins 2015; 83:2162–2169. © 2015 Wiley Periodicals, Inc. 相似文献

20.

Two‐domain analysis of JmjN‐JmjC and PHD‐JmjC lysine demethylases: Detecting an inter‐domain evolutionary stress

Patrick Slama 《Proteins》2018,86(1):3-12

Residues at different positions of a multiple sequence alignment sometimes evolve together, due to a correlated structural or functional stress at these positions. Co‐evolution has thus been evidenced computationally in multiple proteins or protein domains. Here, we wish to study whether an evolutionary stress is exerted on a sequence alignment across protein domains, i.e., on longer sequence separations than within a single protein domain. JmjC‐containing lysine demethylases were chosen for analysis, as a follow‐up to previous studies; these proteins are important multidomain epigenetic regulators. In these proteins, the JmjC domain is responsible for the demethylase activity, and surrounding domains interact with histones, DNA or partner proteins. This family of enzymes was analyzed at the sequence level, in order to determine whether the sequence of JmjC‐domains was affected by the presence of a neighboring JmjN domain or PHD finger in the protein. Multiple positions within JmjC sequences were shown to have their residue distributions significantly altered by the presence of the second domain. Structural considerations confirmed the relevance of the analysis for JmjN‐JmjC proteins, while among PHD‐JmjC proteins, the length of the linker region could be correlated to the residues observed at the most affected positions. The correlation of domain architecture with residue types at certain positions, as well as that of overall architecture with protein function, is discussed. The present results thus evidence the existence of an across‐domain evolutionary stress in JmjC‐containing demethylases, and provide further insights into the overall domain architecture of JmjC domain‐containing proteins. 相似文献