首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 156 毫秒
1.
2.
Chemokines are small secreted proteins with important roles in immune responses. They consist of a conserved three-dimensional (3D) structure, so-called IL8-like chemokine fold, which is supported by disulfide bridges characteristic of this protein family. Sequence- and profile-based computational methods have been proficient in discovering novel chemokines by making use of their sequence-conserved cysteine patterns. However, it has been recently shown that some chemokines escaped annotation by these methods due to low sequence similarity to known chemokines and to different arrangement of cysteines in sequence and in 3D. Innovative methods overcoming the limitations of current techniques may allow the discovery of new remote homologs in the still functionally uncharacterized fraction of the human genome. We report a novel computational approach for proteome-wide identification of remote homologs of the chemokine family that uses fold recognition techniques in combination with a scaffold-based automatic mapping of disulfide bonds to define a 3D profile of the chemokine protein family. By applying our methodology to all currently uncharacterized human protein sequences, we have discovered two novel proteins that, without having significant sequence similarity to known chemokines or characteristic cysteine patterns, show strong structural resemblance to known anti-HIV chemokines. Detailed computational analysis and experimental structural investigations based on mass spectrometry and circular dichroism support our structural predictions and highlight several other chemokine-like features. The results obtained support their functional annotation as putative novel chemokines and encourage further experimental characterization. The identification of remote homologs of human chemokines may provide new insights into the molecular mechanisms causing pathologies such as cancer or AIDS, and may contribute to the development of novel treatments. Besides, the genome-wide applicability of our methodology based on 3D protein family profiles may open up new possibilities for improving and accelerating protein function annotation processes.  相似文献   

3.
Shih CH  Chang CM  Lin YS  Lo WC  Hwang JK 《Proteins》2012,80(6):1647-1657
The knowledge of conserved sequences in proteins is valuable in identifying functionally or structurally important residues. Generating the conservation profile of a sequence requires aligning families of homologous sequences and having knowledge of their evolutionary relationships. Here, we report that the conservation profile at the residue level can be quantitatively derived from a single protein structure with only backbone information. We found that the reciprocal packing density profiles of protein structures closely resemble their sequence conservation profiles. For a set of 554 nonhomologous enzymes, 74% (408/554) of the proteins have a correlation coefficient > 0.5 between these two profiles. Our results indicate that the three-dimensional structure, instead of being a mere scaffold for positioning amino acid residues, exerts such strong evolutionary constraints on the residues of the protein that its profile of sequence conservation essentially reflects that of its structural characteristics.  相似文献   

4.
J C Badciong  J M Otto  G L Waring 《Genetics》2001,159(3):1089-1102
The Drosophila dec-1 gene encodes multiple proteins that are required for female fertility and proper eggshell morphogenesis. Genetic and immunolocalization data suggest that the different DEC-1 proteins are functionally distinct. To identify regions within the proteins with potential biological significance, we cloned and sequenced the D. yakuba and D. virilis dec-1 homologs. Interspecies comparisons of the predicted translation products revealed rapidly evolving sequences punctuated by blocks of conserved amino acids. Despite extensive amino acid variability, the proteins produced by the different dec-1 homologs were functionally interchangeable. The introduction of transgenes containing either the D. yakuba or the D. virilis dec-1 open reading frames into a D. melanogaster DEC-1 protein null mutant was sufficient to restore female fertility and wild-type eggshell morphology. Normal expression and extracellular processing of the DEC-1 proteins was correlated with the phenotypic rescue. The nature of the conserved features highlighted by the evolutionary comparison and the molecular resemblance of some of these features to those found in other extracellular proteins suggests functional correlates for some of the multiple DEC-1 derivatives.  相似文献   

5.
6.
7.
8.
The genus Rickettsia consists of intracellular bacteria that cause a variety of arthropod vectored human diseases. I have examined the evolutionary processes that are generating variation in antigens that are potential vaccine candidates. The surface proteins rOmpA and rOmpB are subject to intense positive natural selection, causing rapid diversification of their amino acid sequences between species. The positively selected amino acids were mapped and cluster together in regions that may indicate the location of functionally important regions such as epitopes. In contrast to the rOmp antigens, there is no evidence of positive selection on the intracytoplasmic antigen PS120 despite low selective constraints on this gene. All three genes showed evidence of recombination between species, and certain sequences are clear chimeras of two parental sequences. However, recombination has been sufficiently infrequent that the phylogenies of the three genes are similar, although not identical. [Reviewing Editor: Dr. Willie J. Swanson]  相似文献   

9.
用RACE结合cDNA文库筛选的方法获取新的锌指蛋白基因   总被引:6,自引:1,他引:5  
杜占文  刘立仁  张俊武 《遗传》2002,24(3):329-331
大多数有重要功能的蛋白质都含相应的由保守氨基酸顺序组成的功能结构域。本文首先根据蛋白质功能结构域保守氨基酸序列设计简并引物,用PCR方法扩增出基因EST序列,再利用改进的快速扩增cDNA末端(RACE)方法从cDNA文库中扩增出基因非同源部位,然后以非同源序列为探针,筛选cDNA文库。利用此方法成功地从人骨髓cDNA文库中克隆到几个编码锌指蛋白并代表原有EST的新的全长cDNA。这一策略也应适用于筛选编码具有其他序列保守性功能结构域蛋白的基因。 Abstract:Most of the important functionally proteins contain the corresponding function domains that consist of conserved amino acid sequences.The study provided a method to identify novel genes that encode proteins containing important functionally domains with conserved sequences.First,primers were designed according to the sequence of the cDNA library vector and the ESTs that have been obtained by reverse PCR and degenerate primers encoding Zinc finger domain.The cDNA library DNA was used as template for PCR amplification.The amplified fragment that contains nonhomologous sequences of the cDNA was inserted into pGEM-T easy vector.The fragment was recovered and used as a probe for screening the cDNA library.Several cDNAs with full length that encode proteins with Zinc finger domain and represent the original ESTs have been successfully cloned from a human bone marrow cDNA library.This strategy can also be used in screening genes that encode proteins containing differential function domains with conserved sequences.  相似文献   

10.
ADP-ribosylation factors (ARFs) are ~20-kDa guanine nucleotide-binding proteins that are allosteric activators of the NAD:arginine ADP-ribosyltransferase activity of cholera toxin and appear to play a role in intracellular vesicular trafficking. Although the physiological roles of these proteins have not been defined, it has been presumed that each has a specific intracellular function. To obtain genetic evidence that each ARF is under evolutionary pressure to maintain its structure, and presumably function, rat ARF cDNA clones were isolated and their nucleotide and deduced amino acid sequences were compared to those of other mammalian ARFs. Deduced amino acid sequences for rat ARFs 1, 2, 3, 5 and 6 were identical to those of the known cognate human and bovine ARFs; rat ARF4 was 96% identical to human ARF4. Nucleotide sequences of both the untranslated as well as the coding regions were highly conserved. These results indicate that the ARF proteins are, as a family, extraordinarily well conserved across mammalian species. The unusually high degree of conservation of the untranslated regions is consistent with these regions having important regulatory roles and that individual ARFs contain structurally unique elements required for specific functions.  相似文献   

11.
Signal-sequence-coding regions for protein export were selected from chromosomal Bacillus subtilis DNA. The number of different signals obtained was higher than expected on the basis of known exported proteins in B. subtilis.

Most of the selected regions showed the characteristics of typical signal sequences, including a basic N-terminal region followed by a hydrophobic core and a potential signal-peptidase cleavage site.

The signal-coding regions were functionally interchangeable between the β. licheniformis -amylase and Escherichia coli TEM β-lactamase genes. In addition to the signal-sequence-coding regions, the nature of the host cells, and the mature parts of the reporter proteins contributed to the amounts of protein secreted.  相似文献   


12.
Replication factor C (RFC) is a five-subunit DNA polymerase accessory protein that functions as a structure-specific, DNA-dependent ATPase. The ATPase function of RFC is activated by proliferating cell nuclear antigen. RFC was originally purified from human cells on the basis of its requirement for simian virus 40 DNA replication in vitro. A functionally homologous protein complex from Saccharomyces cerevisiae, called ScRFC, has been identified. Here we report the cloning, by either peptide sequencing or by sequence similarity to the human cDNAs, of the S. cerevisiae genes RFC1, RFC2, RFC3, RFC4, and RFC5. The amino acid sequences are highly similar to the sequences of the homologous human RFC 140-, 37-, 36-, 40-, and 38-kDa subunits, respectively, and also show amino acid sequence similarity to functionally homologous proteins from Escherichia coli and the phage T4 replication apparatus. All five subunits show conserved regions characteristic of ATP/GTP-binding proteins and also have a significant degree of similarity among each other. We have identified eight segments of conserved amino acid sequences that define a family of related proteins. Despite their high degree of sequence similarity, all five RFC genes are essential for cell proliferation in S. cerevisiae. RFC1 is identical to CDC44, a gene identified as a cell division cycle gene encoding a protein involved in DNA metabolism. CDC44/RFC1 is known to interact genetically with the gene encoding proliferating cell nuclear antigen, confirming previous biochemical evidence of their functional interaction in DNA replication.  相似文献   

13.
Shpakov AO 《Tsitologiia》2001,43(5):491-500
In genes alg5, alg8 and swp1 of Saccharomyces cerevisiae, gpt of Schizosaccharomyces pombe and human gene alg6, encoding the dolichol cycle enzymes, a mirror type internal symmetry was found. The symmetry was detected in both complete nucleotide sequences and sequences of the first, second and third nucleotide bases of codons. In the encoding gene regions the density of single- and double-point centres of the internal symmetry for sequences of the second bases was higher in comparison with the sequences of the first and third bases of codons, whereas in the noncoding regions degrees of symmetry of the first, second and third bases sequences did not differ significantly. A clear positive correlation was revealed in the internal symmetry distribution in the second base sequences of codons in genes, on the one hand, and in the gene encoded amino acid sequences, on the other hand. The maximum internal symmetry of gene segments encoding the functionally important regions of proteins was found at the level of the second base sequences. The obtained results corroborate a hypothesis about the determining role of the second bases of codons in encoding amino acid residues. The investigation of internal symmetry in nucleotide sequences has first shown the existence of internal symmetry at the level of gene primary structure.  相似文献   

14.
Do myc,fos and E1A function as protein phosphatase inhibitors?   总被引:1,自引:0,他引:1  
The oncogenic proteins myc, fos and E1A bear striking resemblance to protein phosphatase inhibitors 1 and 2. Both sets of proteins possess several regions rich in proline (P), glutamic acid (E), serine (S) and threonine (T). In addition to PEST sequences four of the five proteins contain clusters of arginine-arginine pairs. On the basis of these similarities, I suggest that myc, fos and E1A are protein phosphatase inhibitors.  相似文献   

15.
The cDNA clones encoding two chicken Deformed (Dfd) family homeobox containing genes Chox-1.4 and Chox-a were isolated. Comparison of their amino acid sequences with another chicken Dfd family homeodomain protein and with those of mouse homologues revealed that strong homologies are located in the amino terminal regions and around the homeodomains. Although homologies in other regions were relatively low, some short conserved sequences were also identified. E. coli-made full length proteins were purified and used for the production of specific antibodies and for DNA binding studies. The binding profiles of these proteins to the 5'-leader and 5'-upstream sequences of Chox-1.4 and Chox-a coding regions were analyzed by immunoprecipitation and DNase I footprint assays. These two Chox proteins bound to the same sites in the 5'-flanking sequences of their coding regions with various affinities and their binding affinities to each site were nearly the same. The consensus sequences of the high and low affinity binding sites were TAATGA(C/G) and CTAATTTT, respectively. A clustered binding site was identified in the 5'-upstream of the Chox-a gene, suggesting that this clustered binding site works as a cis-regulatory element for auto- and/or cross-regulation of Chox-a gene expression.  相似文献   

16.
With the advent of proteomics, there is an increasing need of tools for predicting the properties of large numbers of proteins by using the information provided by their amino acid sequences, even in the absence of the knowledge of their structures. One of the most important types of predictions concerns whether proteins will fold or aggregate. Here, we study the competition between these two processes by analyzing the relationship between the folding and aggregation propensity profiles for the human and Escherichia coli proteomes. These profiles are calculated, respectively, using the CamFold method, which we introduce in this work, and the Zyggregator method. Our results indicate that the kinetic behavior of proteins is, to a large extent, determined by the interplay between regions of low folding and high aggregation propensities.  相似文献   

17.
1. To determine if a relationship exists between vertebrate vitellogenins and mammalian plasma proteins the EMBL and NBRF computer databases were searched with two partial amino acid sequences from Xenopus laevis and Gallus gallus vitellogenin. 2. A significant relationship was found between vitellogenin and human apolipoprotein B-100 genes, and confirmed using homology-determination programs. 3. Further analysis shows that unique multiple proline consensus regions found in apolipoprotein B-100 are significantly similar to proline dominant regions in vitellogenin. 4. This work suggests that these proteins are functionally and structurally related and should be categorized as a functional group of hepatic lipid transport and metabolism proteins.  相似文献   

18.
In Escherichia coli, efficient mutagenesis by UV requires the umuDC operon. A deficiency in umuDC activity is believed to be responsible for the relatively weak UV mutability of Salmonella typhimurium LT2 compared with that of E. coli. To begin evaluating this hypothesis and the evolutionary relationships among umuDC-related sequences, we cloned and sequenced the S. typhimurium umuDC operon. S. typhimurium umuDC restored mutability to umuD and umuC mutants of E. coli. DNA sequence analysis of 2,497 base pairs (bp) identified two nonoverlapping open reading frames spanning 1,691 bp that were were 67 and 72% identical at the nucleotide sequence level to the umuD and umuC sequences, respectively, from E. coli. The sequences encoded proteins whose deduced primary structures were 73 and 84% identical to the E. coli umuD and umuC gene products, respectively. The two bacterial umuDC sequences were more similar to each other than to mucAB, a plasmid-borne umuDC homolog. The umuD product retained the Cys-24--Gly-25, Ser-60, and Lys-97 amino acid residues believed to be critical for RecA-mediated proteolytic activation of UmuD. The presence of a LexA box 17 bp upstream from the UmuD initiation codon suggests that this operon is a member of an SOS regulon. Mu d-P22 inserts were used to locate the S. typhimurium umuDC operon to a region between 35.9 and 40 min on the S. typhimurium chromosome. In E. coli, umuDC is located at 26 min. The umuDC locus in S. typhimurium thus appears to be near one end of a chromosomal inversion that distinguishes gene order in the 25- to 35-min regions of the E. coli and S. typhimurium chromosomes. It is likely, therefore, that the umuDC operon was present in a common ancestor before S. typhimurium and E. coli diverged approximately 150 million years ago. These results provide new information for investigating the structure, function, and evolutionary origins of umuDC and for exploring the genetic basis for the mutability differences between S. typhimurium and E. coli.  相似文献   

19.
Profile comparison methods have been shown to be very powerful in creating accurate alignments of protein sequences, especially in the case of remotely related proteins (RRP). These methods take advantage of the observation that hydrophobic profiles are more conserved than the corresponding amino acid sequences. Here, we present the PROFALIGN algorithm, which allows one to perform a detailed comparative analysis, at both local and global levels of two protein sequence profiles. The user can either choose among four different hydrophobic scales (Miyazawa-Jernigan, Eisenberg, Engelman-Steiz, and Kyte-Doolittle) or can add a personal scale. The interface is designed for a wide range of users, including those who are not involved in protein research. It allows one to vary the alignment parameters (such as gap penalties, embedding, and profile smoothness). Secondary structure propensity is added as an optional alignment filter. Similar segments of two proteins are singled out on the basis of score. We have tested the algorithm with different Src homology 3 (SH3) domain fragments sharing low sequence homology but very similar three-dimensional (3D) structures. By using the Miyazawa-Jernigan hydrophobic scale, PROFALIGN was able to detect the strong correlation between the regions that are known to be crucial for SH3 transition state topology. PROFALIGN seems able to identify most of the mutual alignment of structures on the basis of their hydrophobic profiles, delimiting the regions containing the key determinants of folding. Therefore, the present methodology may be useful for the detection of the most structurally relevant positions inside remote related proteins.  相似文献   

20.
Bacterial populations common to healthy human guts may play important roles in human health. A new strategy for discovering genomic sequences as markers for these bacteria was developed using Enterobacterial Repetitive Intergenic Consensus (ERIC)-PCR fingerprinting. Structural features within microbial communities are compared with ERIC-PCR followed by DNA hybridization to identify genomic fragments shared by samples from healthy human individuals. ERIC-PCR profiles of fecal samples from 12 diseased or healthy human and piglet subjects demonstrated stable, unique banding patterns for each individual tested. Sequence homology of DNA fragments in bands of identical size was examined between samples by hybridization under high stringency conditions with DIG-labeled ERIC-PCR products derived from the fecal sample of one healthy child. Comparative analysis of the hybridization profiles with the original agarose fingerprints identified three predominant bands as signatures for populations associated with healthy human guts with sizes of 500, 800 and 1000 bp. Clone library profiling of the three bands produced 17 genome fragments, three of which showed high similarity only with regions of the Bacteroides thetaiotaomicron genome, while the remainder were orphan sequences. Association of these sequences with healthy guts was validated by sequence-selective PCR experiments, which showed that a single fragment was present in all 32 healthy humans and 13 healthy piglets tested. Two fragments were present in the healthy human group and in 18 children with non-infectious diarrhea but not in eight children with infectious diarrhea. Genome fragments identified with this novel strategy may be used as genome-specific markers for dynamic monitoring and sequence-guided isolation of functionally important bacterial populations in complex communities such as human gut microflora.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号