期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

ApicoAlign: an alignment and sequence search tool for apicomplexan proteins

Ali J Paila U Ranjan A 《BMC genomics》2011,12(Z3):S6

相似文献

2.

A search for patterns in the nucleotide sequence of the MS2 genome

John W. Erickson Gary G. Altman 《Journal of mathematical biology》1979,7(3):219-230

Summary The nucleotide sequence of the RNA of the bacteriophage MS2 was examined by computer for internal patterns. We used a technique which analyzes a nucleotide sequence as a Markov chain. This led us to discover patterns within the translated and untranslated regions of the RNA in addition to those patterns formed by the codons. One of the more surprising results of this analysis was the discovery that the non-coding sequences in the genome are as highly ordered, although in a different sense, as the genes themselves. Also of interest was the discovery that the codon frequency distributions for the three genes are similar. 相似文献

3.

Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

Jacob Bock Axelsen Koon-Kiu Yan Sergei Maslov 《Biology direct》2007,2(1):32-19

Background

The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. 相似文献

4.

Exploring the sequence patterns in the alpha-helices of proteins

Wang J Feng JA 《Protein engineering》2003,16(11):799-807

This paper reports an extensive sequence analysis of the alpha-helices of proteins. alpha-Helices were extracted from the Protein Data Bank (PDB) and were divided into groups according to their sizes. It was found that some amino acids had differential propensity values for adopting helical conformation in short, medium and long alpha-helices. Pro and Trp had a significantly higher propensity for helical conformation in short helices than in medium and long helices. Trp was the strongest helix conformer in short helices. Sequence patterns favoring helical conformation were derived from a neighbor-dependent sequence analysis of proteins, which calculated the effect of neighboring amino acid type on the propensity of residues for adopting a particular secondary structure in proteins. This method produced an enhanced statistical significance scale that allowed us to explore the positional preference of amino acids for alpha-helical conformations. It was shown that the amino acid pair preference for alpha-helix had a unique pattern and this pattern was not always predictable by assuming proportional contributions from the individual propensity values of the amino acids. Our analysis also yielded a series of amino acid dyads that showed preference for alpha-helix conformation. The data presented in this study, along with our previous study on loop sequences of proteins, should prove useful for developing potential 'codes' for recognizing sequence patterns that are favorable for specific secondary structural elements in proteins. 相似文献

5.

Motifer, a search tool for finding amino acid sequence patterns from nucleotide sequence databases.

H J?rnvall 《FEBS letters》1999,456(1):85-88

Motifer is a software tool able to find directly in nucleotide databases very distant homologues to an amino acid query sequence. It focuses searches on a specific amino acid pattern, scoring the matching and intervening residues as specified by the user. The program has been developed for searching databases of expressed sequence tags (ESTs), but it is also well suited to search genomic sequences. The query sequence can be a variable pattern with alternative amino acids or gaps and the sequences searched can contain introns or sequencing errors with accompanying frame shifts. Other features include options to generate a searchable output, set the maximal sequencing error frequency, limit searches to given species, or exclude already known matches. Motifer can find sequence homologues that other search algorithms would deem unrelated or would not find because of sequencing errors or a too large number of other homologues. The ability of Motifer to find relatives to a given sequence is exemplified by searches for members of the transforming growth factor-beta family and for proteins containing a WW-domain. The functions aimed at enhancing EST searches are illustrated by the 'in silico' cloning of a novel cytochrome P450 enzyme. 相似文献

6.

Gramicidin S: the sequence of the amino-acid residues

Consden R Gordon AH Martin AJ 《The Biochemical journal》1947,41(4):596-602

相似文献

7.

'Multifrequency' location and clustering of sequence patterns from proteins

Ollivier Emmanuelle; Soldano Henry; Viari Alain 《Bioinformatics (Oxford, England)》1991,7(1):31-38

In previous work, we have shown that a set of characteristics,defined as (code frequency) pairs, can be derived from a proteinfamily by the use of a signal-processing method. This methodenables the location and extraction of sequence patterns bytaking into account each (code frequency) pair individually.In the present paper, we propose to extend this method in orderto detect and visualize patterns by taking into account severalpairs simultaneously. Two ‘multifrequency’ methodsare described. The first one is based on a rewriting of thesequences with new symbols which summarize the frequency information.The second method is based on a clustering of the patterns associatedwith each pair. Both methods lead to the definition of significantconsensus sequences. Some results obtained with calcium-bindingproteins and serine proteases are also discussed. Received on March 6, 1990; accepted on September 24, 1990 相似文献

8.

A Bayesian system integrating expression data with sequence patterns for localizing proteins: comprehensive application to the yeast genome

Drawid A Gerstein M 《Journal of molecular biology》2000,301(4):1059-1075

We develop a probabilistic system for predicting the subcellular localization of proteins and estimating the relative population of the various compartments in yeast. Our system employs a Bayesian approach, updating a protein's probability of being in a compartment, based on a diverse range of 30 features. These range from specific motifs (e.g. signal sequences or the HDEL motif) to overall properties of a sequence (e.g. surface composition or isoelectric point) to whole-genome data (e.g. absolute mRNA expression levels or their fluctuations). The strength of our approach is the easy integration of many features, particularly the whole-genome expression data. We construct a training and testing set of approximately 1300 yeast proteins with an experimentally known localization from merging, filtering, and standardizing the annotation in the MIPS, Swiss-Prot and YPD databases, and we achieve 75 % accuracy on individual protein predictions using this dataset. Moreover, we are able to estimate the relative protein population of the various compartments without requiring a definite localization for every protein. This approach, which is based on an analogy to formalism in quantum mechanics, gives better accuracy in determining relative compartment populations than that obtained by simply tallying the localization predictions for individual proteins (on the yeast proteins with known localization, 92% versus 74%). Our training and testing also highlights which of the 30 features are informative and which are redundant (19 being particularly useful). After developing our system, we apply it to the 4700 yeast proteins with currently unknown localization and estimate the relative population of the various compartments in the entire yeast genome. An unbiased prior is essential to this extrapolated estimate; for this, we use the MIPS localization catalogue, and adapt recent results on the localization of yeast proteins obtained by Snyder and colleagues using a minitransposon system. Our final localizations for all approximately 6000 proteins in the yeast genome are available over the web at: http://bioinfo.mbb.yale. edu/genome/localize. 相似文献

9.

Using information theory to search for co-evolving residues in proteins 总被引：2，自引：0，他引：2

Martin LC Gloor GB Dunn SD Wahl LM 《Bioinformatics (Oxford, England)》2005,21(22):4116-4124

MOTIVATION: Some functionally important protein residues are easily detected since they correspond to conserved columns in a multiple sequence alignment (MSA). However important residues may also mutate, with compensatory mutations occurring elsewhere in the protein, which serve to preserve or restore functionality. It is difficult to distinguish these co-evolving sites from other non-conserved sites. RESULTS: We used Mutual Information (MI) to identify co-evolving positions. Using in silico evolved MSAs, we examined the effects of the number of sequences, the size of amino acid alphabet and the mutation rate on two sources of background MI: finite sample size effects and phylogenetic influence. We then assessed the performance of various normalizations of MI in enhancing detection of co-evolving positions and found that normalization by the pair entropy was optimal. Real protein alignments were analyzed and co-evolving isolated pairs were often found to be in contact with each other. AVAILABILITY: All data and program files can be found at http://www.biochem.uwo.ca/cgi-bin/CDD/index.cgi 相似文献

10.

A search tool for identification and analysis of conserved sequence patterns in Saccharomyces spp. orthologous promoter

Kohli DK Srikanth CV Bachhawat AK 《In silico biology》2004,4(4):411-415

We describe a web-based resource to identify, search and analyze sequence patterns conserved in the multiple sequence alignments of orthologous promoters from closely related / distant Saccharomyces spp. The webtool interfaces with a database where conserved sequence patterns (greater than 4 bp) have been previously extracted from genome-wide promoter alignments, allowing one to carry out user-defined genome-wide searches for conserved sequences to assist in the discovery of novel promoter elements based on comparative genomics. The web-based server can be accessed at http://www2.imtech.res.in/ anand/sacch_prom_pat.html. 相似文献

11.

Developing a mathematical method to search for latent periodicity in protein amino-acid sequences with deletions and insertions

E. V. Korotkov M. A. Korotkova 《Biophysics》2015,60(6):876-885

A mathematical method has been developed in order to search for latent periodicity in protein amino-acid and other symbolical sequences using dynamic programming and random matrices. The method allows the detection of the latent periodicity with insertions and deletions at positions that are unknown beforehand. The developed method has been applied to search for the periodicity in the amino-acid sequences of several proteins and in the euro/dollar exchange rate since 2001. The presence of a long period with insertions and deletions in amino-acid sequences is shown. The period length of seven amino acids is observed in the proteins that contain supercoiled regions (a coiled-coil structure) as well as of six, five, or more amino acids. The existence of the period length of 6 and 7 days, as well as 24 and 25 h in the analyzed financial time series is observed; note that this periodicity is detectable only for insertions and deletions. The causes that underlie the occurrence of the latent periodicity with insertions and deletions in amino-acid sequences and financial time series are discussed. 相似文献

12.

Metallothioneins: proteins in search of function 总被引：43，自引：0，他引：43

M Karin 《Cell》1985,41(1):9-10

相似文献

13.

Phosphofructokinase: complete amino-acid sequence of the enzyme from Bacillus stearothermophilus 总被引：3，自引：0，他引：3

E Kolb P J Hudson J I Harris 《European journal of biochemistry》1980,108(2):587-597

The entire amino acid sequence of the protein subunit of phosphofructokinase from Bacillus stearothermophilus has been established mainly by sequence analysis of cyanogen bromide fragments and of peptides derived from these fragments by further digestion with proteolytic enzymes. Overlaps of the cyanogen bromide fragments as well as peptide sequences necessary to complement and to confirm tentative assignments within the larger peptide fragments were obtained from the sequences of selected peptides isolated from tryptic and chymotryptic digests of the intact S-[14C]-carboxymethylated protein. Sequence information was also provided by automated sequence analysis of the intact protein subunit and of some of the larger peptide fragments. The sequence is as follows: (See Text). 相似文献

14.

Heat shock proteins: the search for functions 总被引：32，自引：4，他引：32

下载免费PDF全文

M J Schlesinger 《The Journal of cell biology》1986,103(2):321-325

相似文献

15.

The complete amino-acid sequence of the bilin-binding protein from Pieris brassicae and its similarity to a family of serum transport proteins like the retinol-binding proteins

F Suter H Kayser H Zuber 《Biological chemistry Hoppe-Seyler》1988,369(6):497-505

The amino-acid sequence from the bilin binding protein (BBP) of the butterfly Pieris brassicae has been determined. The apoprotein with a length of 173 amino-acid residues has a molecular mass of 19,676 Da. The sequence analysis was performed by automated Edman degradation of the intact apoprotein and of fragments as large as possible generated from different digestions. The 3-dimensional structure of BBP, determined by Huber et al. (Huber, R., Schneider, M., Epp, O., Mayr, I., Messerschmidt, A., Pflugrath, J. & Kayser, H. (1987) J. Mol. Biol. 195, 423-434 and Huber, R., Schneider, M., Mayr, I., Müller, R., Deutzmann, R., Suter, F., Zuber, H., Falk, H. & Kayser, H. (1987) J. Mol. Biol. 198, 499-513) down to 2-A resolution, exhibits a similar conformation to the human retinol binding protein. Sawyer (Sawyer, L. (1987) Nature (London) 327, 659) demonstrated that proteins from a wide variety of sources can be gathered into a "superfamily". Computer searches of data banks yielded in a new member of this superfamily, namely human alpha 1-acid glycoprotein. One of the functions of the listed proteins is to bind and transport small hydrophobic molecules in serum. 相似文献

16.

From patenting genes to proteins: the search for utility via function

Ilag LL Ilag LM Ilag LL 《Trends in biotechnology》2002,20(5):197-199

The debate regarding the patenting of genes has extended into the post-genome era. With only approximately 35000 genes deduced from the draft sequence of the human genome, there are fears that a few companies have already gained monopoly on the potential benefits from this knowledge. Nevertheless, it is accepted that proteins determine gene function and function is not readily predicted from gene sequence. Furthermore, genes can encode multiple proteins and a single protein can have multiple functions. Here, we argue that unraveling the intrinsic complexity of proteins and their functions is the key towards determining the utility requirement for patenting protein inventions and consider the possible socioeconomic impact. 相似文献

17.

FASH: A web application for nucleotides sequence search

Isana Veksler-Lublinksy Danny Barash Chai Avisar Einav Troim Paul Chew Klara Kedem 《Source code for biology and medicine》2008,3(1):9

FASH (Fourier Alignment Sequence Heuristics) is a web application, based on the Fast Fourier Transform, for finding remote homologs within a long nucleic acid sequence. Given a query sequence and a long text-sequence (e.g, the human genome), FASH detects subsequences within the text that are remotely-similar to the query. FASH offers an alternative approach to Blast/Fasta for querying long RNA/DNA sequences. FASH differs from these other approaches in that it does not depend on the existence of contiguous seed-sequences in its initial detection phase. The FASH web server is user friendly and very easy to operate. 相似文献

18.

Local sequence patterns of hydrophobicity and solvent accessibility in soluble globular proteins

D J Lipman R W Pastor B Lee 《Biopolymers》1987,26(1):17-26

We examined the variation in the solvent accessibility and hydrophobicity of the amino acids along the sequences of 58 soluble globular proteins with known tertiary structure. We found that there is a significant tendency for the accessibilities to run in clusters along the sequence but that the hydrophobicities are distributed without such nonrandom clusters. Theseresults suggest severe limitations on the power of sequence analysis tools that use average hydrophobicity scores of overlapping subsequences to predict accessibility. 相似文献

19.

The amino-acid sequence of beta-lactoglobulin II from horse colostrum (Equus caballus, Perissodactyla): beta-lactoglobulins are retinol-binding proteins 总被引：1，自引：0，他引：1

J Godovac-Zimmermann A Conti J Liberatori G Braunitzer 《Biological chemistry Hoppe-Seyler》1985,366(6):601-608

beta-Lactoglobulin isolated from horse colostrum is heterogeneous and contains two components: beta-lactoglobulin I and beta-lactoglobulin II. These two proteins are monomeric and show differences in their electrophoretic mobilities, chain lengths and primary structures. The complete amino-acid sequence of beta-lactoglobulin II was determined by automated Edman degradation of the intact protein and of the peptides derived from these by digestion with trypsin or chymotrypsin and by chemical cleavage with cyanogen bromide. Unlike other beta-lactoglobulins which contain 162 amino acids, horse beta-lactoglobulin II is unique in that it contains 166 amino acids. The additional four amino acids represent an insertion between positions 116 and 117 of other beta-lactoglobulins so far sequenced, including horse beta-lactoglobulin I. Sequence comparison of beta-lactoglobulins I and II from horse colostrum reveals 48 amino acid substitutions (30%). Such a diversity between members of the beta-lactoglobulin gene family has not been encountered before. Sequence comparison with bovine beta-lactoglobulin A shows 85 amino acid replacements accounting for 53% of the residues. The structural homology with human retinol-binding protein may reveal similar biological functions and clues to the origin of milk proteins. 相似文献

20.

Tandem repeats in proteins: from sequence to structure

Kajava AV 《Journal of structural biology》2012,179(3):279-288

The bioinformatics analysis of proteins containing tandem repeats requires special computer programs and databases, since the conventional approaches predominantly developed for globular domains have limited success. Here, I survey bioinformatics tools which have been developed recently for identification and proteome-wide analysis of protein repeats. The last few years have also been marked by an emergence of new 3D structures of these proteins. Appraisal of the known structures and their classification uncovers a straightforward relationship between their architecture and the length of the repetitive units. This relationship and the repetitive character of structural folds suggest rules for better prediction of the 3D structures of such proteins. Furthermore, bioinformatics approaches combined with low resolution structural data, from biophysical techniques, especially, the recently emerged cryo-electron microscopy, lead to reliable prediction of the protein repeat structures and their mode of binding with partners within molecular complexes. This hybrid approach can actively be used for structural and functional annotations of proteomes. 相似文献