首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We examined sequences expressed in the brain and nervous system using EST data. A previous study including sequences thought to have neurological function found a deficiency of simple sequence within such sequences. This was despite many examples of neurodegenerative diseases, such as Huntington disease, which are thought to be caused by expansions of polyglutamine tracts within associated protein sequences. It may be that many of the sequences thought to have neurological function have other additional, non-neurological roles. For this reason, we examined sequences with specific expression in the brain and nervous system, using EST expression data to determine if they too are deficient of simple, repetitive sequences. Indeed, we find this class of sequences to be deficient. Unexpectedly, however, we find sequences expressed in the brain and nervous system to be consistently enriched for histidine-enriched simple sequence. Determining the function of these histidine-rich regions within brain-specific proteins requires more experimental data.  相似文献   

2.
Huntley MA  Golding GB 《Proteins》2002,48(1):134-140
A simple sequence is abundant in the proteins that have been sequenced to date. But unusual protein features, such as a simple sequence, are not present in the same high frequency within structural databases. A subset of these simple sequences, a group with a highly repetitive nature has been shown to be abundant in eukaryotes but not in prokaryotes. In this study, an examination of the eukaryotic proteins in the Protein Data Bank (PDB) has revealed a large deficiency of low complexity, highly repetitive protein repeats. Through simulated databases of similar samples of eukaryotic proteins taken from the National Center for Biotechnology Information (NCBI) database, it is shown that the PDB contains a significantly less highly repetitive, simple sequence than artificial databases of similar composition randomly derived from NCBI. When the structural data for those few PDB sequences that did contain a highly repetitive simple sequence is examined in detail, it is found that in most cases the tertiary structure is unknown for the regions consisting of a simple sequence. This lack of a simple sequence both in the PDB database and in the structural information suggests that this type of simple sequence may produce disordered structures that make structural characterization difficult.  相似文献   

3.
Amyloid fibrils often exhibit polymorphism. Polymorphs are formed when proteins or peptides with identical sequences self-assemble into fibrils containing substantially different arrangements of the β-strands. We used atomistic molecular-dynamics simulation to examine the thermodynamic stability of a amyloid fibrils in different polymorphic forms by performing a systematic investigation of sequence and symmetry space for a series of peptides with a range of physicochemical properties. We show that the stability of fibrils depends on both sequence and the symmetry because these factors determine the availability of favorable interactions between the peptide strands within a sheet and in intersheet packing. By performing a detailed analysis of these interactions as a function of symmetry, we obtained a series of simple design rules that can be used to determine which polymorphs of a given sequence are most likely to form thermodynamically stable fibrils. These rules can potentially be employed to design peptide sequences that aggregate into a preferred polymorphic form for nanotechnological purposes.  相似文献   

4.
Highly repetitive sequence within proteins is an abundant feature yet is considered by some to be the protein equivalent of "junk DNA." Homopolymer sequences, the most highly repetitive of this group, are typically encoded by trinucleotide repeats at the DNA level. It is thought that many of these sequences are produced by a replicative slippage mechanism. Recent studies suggest that these highly mutable regions within proteins may allow for rapid morphological evolution emerging from the increased variability afforded by such coding structures. However, in a homopolymer, it is difficult to determine if the repeated amino acid is due to slippage at the DNA level or due to selection at the protein level. Here we develop and test a model to detect cases for which the homopolymer tract has clearly been selected for, with no evidence of slippage at the DNA level. The polyserine tract within the phosphatidylserine receptor protein is used as an excellent example of one such case.  相似文献   

5.
We created artificial proteins that contained repeats of a short peptide motif, Asn-Gly-Asx. In nature this motif is repeated within shell proteins as an idiosyncratic domain, while in vitro it has been shown to suppress calcification. The motif was embedded within peptide sequences that did or did not have the ability to form secondary structures, which provided the motif with a variety of physicochemical properties. Although a short synthetic peptide containing the motif did not inhibit calcification in vitro, some of the artificial proteins carrying repeats of the motif did show robust suppression of calcification. Artificial proteins lacking the motif did not exhibit suppressive activity. Likewise, one construct containing multiple repeats of the motifs also did not exert an inhibitory effect on calcification. Apparently, carrying the Asn-Gly-Asx motif is not, by itself, sufficient for expression of its cryptic activity; instead, certain physicochemical properties of the polypeptides mediate its manifestation. We anticipate that syntheses using "motif programming", such as the one described here, will shed light on the origin of repetitive sequences as well as on the evolution of biomineralization proteins.  相似文献   

6.
Runs of identical amino acids encoded by triplet repeats (homopolymers) are components of numerous proteins, yet their role is poorly understood. Large numbers of homopolymers are present in the Drosophila melanogaster mastermind (mam) protein surrounding several unique charged amino acid clusters. Comparison of mam sequences from D. virilis and D. melanogaster reveals a high level of amino acid conservation in the charged clusters. In contrast, significant divergence is found in repetitive regions resulting from numerous amino acid replacements and large insertions and deletions. It appears that repetitive regions are under less selective pressure than unique regions, consistent with the idea that homopolymers act as flexible spacers separating functional domains in proteins. Notwithstanding extensive length variation in intervening homopolymers, there is extreme conservation of the amino acid spacing of specific charge clusters. The results support a model where homopolymer length variability is constrained by natural selection.Correspondence to: B. Yedvobnick  相似文献   

7.
While tandem mass spectrometry (MS/MS) is routinely used to identify proteins from complex mixtures, certain types of proteins present unique challenges for MS/MS analyses. The major wheat gluten proteins, gliadins and glutenins, are particularly difficult to distinguish by MS/MS. Each of these groups contains many individual proteins with similar sequences that include repetitive motifs rich in proline and glutamine. These proteins have few cleavable tryptic sites, often resulting in only one or two tryptic peptides that may not provide sufficient information for identification. Additionally, there are less than 14,000 complete protein sequences from wheat in the current NCBInr release. In this paper, MS/MS methods were optimized for the identification of the wheat gluten proteins. Chymotrypsin and thermolysin as well as trypsin were used to digest the proteins and the collision energy was adjusted to improve fragmentation of chymotryptic and thermolytic peptides. Specialized databases were constructed that included protein sequences derived from contigs from several assemblies of wheat expressed sequence tags (ESTs), including contigs assembled from ESTs of the cultivar under study. Two different search algorithms were used to interrogate the database and the results were analyzed and displayed using a commercially available software package (Scaffold). We examined the effect of protein database content and size on the false discovery rate. We found that as database size increased above 30,000 sequences there was a decrease in the number of proteins identified. Also, the type of decoy database influenced the number of proteins identified. Using three enzymes, two search algorithms and a specialized database allowed us to greatly increase the number of detected peptides and distinguish proteins within each gluten protein group.  相似文献   

8.
By controlling the growth of inorganic crystals, macro-biomolecules, including proteins, play pivotal roles in modulating biomineralization. Natural proteins that promote biomineralization are often composed of simple repeats of peptide sequences; however, the relationship between these repetitive structures and their functions remains largely unknown. Here we show that an artificial protein containing a repeated peptide sequence allows NaCl, KCl, CuSO4 and sucrose to form a variety of macroscopic structures, as represented by their dendritic configurations. Mutational analyses revealed that the physicochemical characteristics of the protein, not the peptide sequence per se, were responsible for formation of the dendritic structures. This suggests that proteins that modulate crystal growth may have evolved as repeat-containing forms at a relatively high rate. These observations could serve as the basis for developing new genetic programming systems for creation of artificial proteins able to modulate crystal growth from inorganic compounds, and may thus provide a new tool for nano-biotechnology.  相似文献   

9.
Amyloid fibrils are fibrous beta-structures that derive from abnormal folding and assembly of peptides and proteins. Despite a wealth of structural studies on amyloids, the nature of the amyloid structure remains elusive; possible connections to natural, beta-structured fibrous motifs have been suggested. In this work we focus on understanding amyloid structure and formation from sequences of a natural, beta-structured fibrous protein. We show that short peptides (25 to 6 amino acids) corresponding to repetitive sequences from the adenovirus fiber shaft have an intrinsic capacity to form amyloid fibrils as judged by electron microscopy, Congo Red binding, infrared spectroscopy, and x-ray fiber diffraction. In the presence of the globular C-terminal domain of the protein that acts as a trimerization motif, the shaft sequences adopt a triple-stranded, beta-fibrous motif. We discuss the possible structure and arrangement of these sequences within the amyloid fibril, as compared with the one adopted within the native structure. A 6-amino acid peptide, corresponding to the last beta-strand of the shaft, was found to be sufficient to form amyloid fibrils. Structural analysis of these amyloid fibrils suggests that perpendicular stacking of beta-strand repeat units is an underlying common feature of amyloid formation.  相似文献   

10.
Oligopeptidases, and the emergence of the prolyl oligopeptidase family.   总被引:7,自引:0,他引:7  
Oligopeptidases are endopeptidases that are not proteinases in the strict sense, since they do not hydrolyse peptide bonds in proteins, but act only on smaller polypeptides or oligopeptides. These enzymes apparently perform important, specialized biological functions that include the modification or destruction of peptide messenger molecules. Oligopeptidases have few naturally occurring inhibitors, and their distinctive specificity prevents them from interacting with alpha 2-macroglobulin, unlike the great majority of endopeptidases. The specificity of these specialized endopeptidases doubtless depends upon the three-dimensional structure of the active site, but no crystallographic structure is yet available for an oligopeptidase. Study of the primary structure of prolyl oligopeptidase has recently shown that it is a member of a new family of serine-type peptidases most of which are exopeptidases. The alignment of the sequences leads to the identification of some catalytic triad residues that have not yet been elucidated experimentally.  相似文献   

11.
We have isolated four repetitive DNA fragments from maize DNA. Only one of these sequences showed homology to sequences within the EMBL database, despite each having an estimated copy number of between 3 x 104 and 5 x 104 per haploid genome. Hybridization of the four repeats to maize mitotic chromosomes showed that the sequences are evenly dispersed throughout most, but not all, of the maize genome, whereas hybridization to yeast colonies containing random maize DNA fragments inserted into yeast artificial chromosomes (YACs) indicated that there was considerable clustering of the repeats at a local level. We have exploited the distribution of the repeats to produce repetitive sequence fingerprints of individual YAC clones. These fingerprints not only provide information about the occurrence and organization of the repetitive sequences within the maize genome, but they can also be used to determine the organization of overlapping maize YAC clones within a contiguous fragment (contigs). Key words : maize, repetitive DNA, YACs.  相似文献   

12.
Telomeres are specialized natural ends of eukaryotic chromosomes that, contrary to the ends of broken chromosomes, are stable and do not fuse with the ends of other chromosomes. In addition, telomeres protect chromosomal ends from degradation, facilitate completion of chromosomal DNA replication, and contribute to chromosome positioning within nuclei. Telomeric DNA consists of repetitive sequences and specific associated proteins, including the telomere repeat-binding factors TRF1 and TRF2. A lack of TRF2 enables end-to-end chromosome fusion. A structural disruption of telomeres not only causes chromosomal mechanical instability but also activates a programmed cell death cascade.  相似文献   

13.
Phage display technology provides a versatile tool for exploring the interactions between proteins, peptides and small molecule ligands. Quantitative analysis of peptide population sequence diversity and bias patterns has the power to significantly enhance the impact of these methods [1, 2]. We have developed a suite of computational tools for the analysis of peptide populations and made them accessible by integrating fifteen software programs for the analysis of combinatorial peptide sequences into the REceptor LIgand Contacts (RELIC) relational database and web-server. These programs have been developed for the analysis of statistical properties of peptide populations; identification of weak consensus sequences within these populations; and the comparison of these peptide sequences to those of naturally occurring proteins. RELIC is particularly suited to the analysis of peptide populations affinity selected with a small molecule ligand such as a drug or metabolite. Within this functional context, the ability to identify potential small molecule binding proteins using combinatorial peptide screening will accelerate as more ligands are screened and more genome sequences become available. The broader impact of this work is the addition of a novel means of analyzing peptide populations to the phage display community.  相似文献   

14.
D Tautz  M Renz 《Nucleic acids research》1984,12(10):4127-4138
Simple sequences are stretches of DNA which consist of only one, or a few tandemly repeated nucleotides, for example poly (dA) X poly (dT) or poly (dG-dT) X poly (dC-dA). These two types of simple sequence have been shown to be repetitive and interspersed in many eukaryotic genomes. Several other types have been found by sequencing eukaryotic DNA. In this report we have undertaken a systematical survey for simple sequences. We hybridized synthetical simple sequence DNA to genome blots of phylogenetically different organisms. We found that many, probably even all possible types of simple sequence are repetitive components of eukaryotic genomes. We propose therefore that they arise by common mechanisms namely slippage replication and unequal crossover and that they might have no general function with regards to gene expression. This latter inference is supported by the fact that we have detected simple sequences only in the metabolically inactive micronucleus of the protozoan Stylonychia, but not in the metabolically active macronucleus which is derived from the micronucleus by chromosome diminution.  相似文献   

15.
We investigated the sites of integration of exogenous DNA fragments introduced by DNA-mediated gene transfer. Mouse Ltk- cells were transformed with the herpes simplex virus thymidine kinase gene and pBR322 DNA by the calcium phosphate precipitation method. Some of the integrated exogenous DNA sequences were recovered from the stable tk+ transformants in the form of plasmids that were capable of propagation in bacteria. Four plasmids derived from two cloned cell lines were analyzed in detail by nucleotide sequencing and hybridization techniques. These plasmids contained a total of seven cellular-exogenous DNA junctions. In all cases, there was no sequence homology between the exogenous and cellular DNA sequences adjacent to the joining sites, and no specific exogenous or cellular sequences occurred at the junctions. Rearrangement or deletion of Ltk- DNA was always associated with the integration of exogenous DNA. All of the assignable cellular sequences at the junctions were repetitive sequences. Two of these sequences were from the MIF-1 repetitive sequence family, and a third consisted of a 40-base pair simple copolymer of alternating deoxyadenosine-deoxythymidine. Our results suggest that repetitive sequences are relatively favorable sites for the integration of exogenous DNA.  相似文献   

16.
Common fragile sites (CFSs) are inherently unstable genomic loci that are recurrently altered in human tumor cells. Despite their instability, CFS are ubiquitous throughout the human genome and associated with large tumor suppressor genes or oncogenes. CFSs are enriched with repetitive DNA sequences, one feature postulated to explain why these loci are inherently difficult to replicate, and sensitive to replication stress. We have shown that specialized DNA polymerases (Pols) η and κ replicate CFS-derived sequences more efficiently than the replicative Pol δ. However, we lacked an understanding of how these enzymes cooperate to ensure efficient CFS replication. Here, we designed a model of lagging strand replication with RFC loaded PCNA that allows for maximal activity of the four-subunit human Pol δ holoenzyme, Pol η, and Pol κ in polymerase mixing assays. We discovered that Pol η and κ are both able to exchange with Pol δ stalled at repetitive CFS sequences, enhancing Normalized Replication Efficiency. We used this model to test the impact of PCNA mono-ubiquitination on polymerase exchange, and found no change in polymerase cooperativity in CFS replication compared with unmodified PCNA. Finally, we modeled replication stress in vitro using aphidicolin and found that Pol δ holoenzyme synthesis was significantly inhibited in a dose-dependent manner, preventing any replication past the CFS. Importantly, Pol η and κ were still proficient in rescuing this stalled Pol δ synthesis, which may explain, in part, the CFS instability phenotype of aphidicolin-treated Pol η and Pol κ-deficient cells. In total, our data support a model wherein Pol δ stalling at CFSs allows for free exchange with a specialized polymerase that is not driven by PCNA.  相似文献   

17.
The binding of nuclear proteins from Drosophila melanogaster embryos to simple homopolymeric DNA sequences was studied. Nuclear proteins were electrophoresed, transferred onto nitrocellulose and incubated with labelled synthetic homopolymers or natural fragment containing simple sequences. Several protein bands were found in the 65-72 KDa region, which specifically bind both poly [(dG-dT).(dA-dC)] and a natural fragment containing 40 bp of this sequence. These proteins do not bind to homopolymers poly [(dA).(dT)] and poly [(dG-dA).(dC-dT)], or other foreign DNAs.  相似文献   

18.
19.
The role of primary amino acid sequences in protein pharmacokinetics, an issue of relevance in both basic knowledge and biotechnology, was addressed here using as a starting point two repetitive antigens from the hemoflagellate Trypanosoma cruzi that are known to stabilize their associated proteins in the bloodstream. A major drawback to their pharmacological application is that these repetitive sequences are highly immunogenic, being therefore the deletion of this characteristic desirable. Based on sequence homology and epitope mapping analyses, an artificial repetitive sequence (PSTAD) was engineered. This motif was tested by genetic fusion to the C terminus of both the trypanosomal trans-sialidase and the rat tyrosine aminotransferase and found to produce a 4.5-6-fold increase in the half-life of the associated proteins in blood while displaying significantly lower immunogenicity. Residues involved in the stabilizing properties of the novel peptide were mapped by a site-directed mutagenesis approach, allowing us to successfully identify another two motifs. Searching databases for sequences displaying some homology, embedded in proline frameworks and associated to shed virulence factors from unrelated microorganisms, resulted in the identification of four other protein extensions. Remarkably, three of them (from Streptococcus pneumoniae, Actinomyces viscosus, and Escherichia coli) revealed similar pharmacokinetic features, suggesting therefore an analogous evolutionarily acquired mechanism to ensure the biodistribution of their corresponding proteins. Our findings indicate that the insertion of defined motifs into a proline-rich framework constitutes a suitable alternative to construct a chimeric protein with extended half-life in blood.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号