期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

BIPAD: A web server for modeling bipartite sequence elements

Chengpeng Bi Peter K Rogan 《BMC bioinformatics》2006,7(1):76

Background

Many dimeric protein complexes bind cooperatively to families of bipartite nucleic acid sequence elements, which consist of pairs of conserved half-site sequences separated by intervening distances that vary among individual sites. 相似文献

2.

GASP: Gapped Ancestral Sequence Prediction for proteins

Richard?J?Edwards Email author Denis?C?Shields 《BMC bioinformatics》2004,5(1):123

Background

The prediction of ancestral protein sequences from multiple sequence alignments is useful for many bioinformatics analyses. Predicting ancestral sequences is not a simple procedure and relies on accurate alignments and phylogenies. Several algorithms exist based on Maximum Parsimony or Maximum Likelihood methods but many current implementations are unable to process residues with gaps, which may represent insertion/deletion (indel) events or sequence fragments. 相似文献

3.

VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens

Aarti Garg Dinesh Gupta 《BMC bioinformatics》2008,9(1):62

Background

Prediction of bacterial virulent protein sequences has implications for identification and characterization of novel virulence-associated factors, finding novel drug/vaccine targets against proteins indispensable to pathogenicity, and understanding the complex virulence mechanism in pathogens. 相似文献

4.

IsoSVM – Distinguishing isoforms and paralogs on the protein level

Michael Spitzer Stefan Lorkowski Paul Cullen Alexander Sczyrba Georg Fuellen 《BMC bioinformatics》2006,7(1):110-14

Background

Recent progress in cDNA and EST sequencing is yielding a deluge of sequence data. Like database search results and proteome databases, this data gives rise to inferred protein sequences without ready access to the underlying genomic data. Analysis of this information (e.g. for EST clustering or phylogenetic reconstruction from proteome data) is hampered because it is not known if two protein sequences are isoforms (splice variants) or not (i.e. paralogs/orthologs). However, even without knowing the intron/exon structure, visual analysis of the pattern of similarity across the alignment of the two protein sequences is usually helpful since paralogs and orthologs feature substitutions with respect to each other, as opposed to isoforms, which do not. 相似文献

5.

Comparative genomics reveals 104 candidate structured RNAs from bacteria,archaea, and their metagenomes

Zasha Weinberg Joy X Wang Jarrod Bogue Jingying Yang Keith Corbino Ryan H Moy Ronald R Breaker 《Genome biology》2010,11(3):R31

Background

Structured noncoding RNAs perform many functions that are essential for protein synthesis, RNA processing, and gene regulation. Structured RNAs can be detected by comparative genomics, in which homologous sequences are identified and inspected for mutations that conserve RNA secondary structure. 相似文献

6.

A combinatorial optimization approach for diverse motif finding applications

Elena Zaslavsky Mona Singh 《Algorithms for molecular biology : AMB》2006,1(1):13-13

Background

Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied. 相似文献

7.

An analysis of the Sargasso Sea resource and the consequences for database composition

Michael L Tress Domenico Cozzetto Anna Tramontano Alfonso Valencia 《BMC bioinformatics》2006,7(1):213-13

Background

The environmental sequencing of the Sargasso Sea has introduced a huge new resource of genomic information. Unlike the protein sequences held in the current searchable databases, the Sargasso Sea sequences originate from a single marine environment and have been sequenced from species that are not easily obtainable by laboratory cultivation. The resource also contains very many fragments of whole protein sequences, a side effect of the shotgun sequencing method. 相似文献

8.

Reconstruction of ancestral protein sequences and its applications

Wei?Cai Jimin?Pei Nick?V?Grishin Email author 《BMC evolutionary biology》2004,4(1):33

Background

Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, reconstructed ancestral protein sequences could serve to fill in sequence space thus aiding remote homology inference. 相似文献

9.

Mosaic structure of intragenic repetitive elements in histone H1-like protein Hc2 varies within serovars of <Emphasis Type="Italic">Chlamydia trachomatis</Emphasis>

Markus Klint Mikael Thollesson Erik Bongcam-Rudloff Svend Birkelund Anders Nilsson Björn Herrmann 《BMC microbiology》2010,10(1):81

Background

The histone-like protein Hc2 binds DNA in Chlamydia trachomatis and is known to vary in size between 165 and 237 amino acids, which is caused by different numbers of lysine-rich pentamers. A more complex structure was seen in this study when sequences from 378 specimens covering the hctB gene, which encodes Hc2, were compared. 相似文献

10.

Evaluating the protein coding potential of exonized transposable element sequences

Jittima Piriyapongsa Mark T Rutledge Sanil Patel Mark Borodovsky I King Jordan 《Biology direct》2007,2(1):31-24

Background

Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. 相似文献

11.

Discriminative motif discovery in DNA and protein sequences using the DEME algorithm

Emma Redhead Timothy L Bailey 《BMC bioinformatics》2007,8(1):385

相似文献

12.

Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences

Marcin J Mizianty Lukasz Kurgan 《BMC bioinformatics》2009,10(1):414-24

Background

Knowledge of structural class is used by numerous methods for identification of structural/functional characteristics of proteins and could be used for the detection of remote homologues, particularly for chains that share twilight-zone similarity. In contrast to existing sequence-based structural class predictors, which target four major classes and which are designed for high identity sequences, we predict seven classes from sequences that share twilight-zone identity with the training sequences. 相似文献

13.

Back-translation for discovering distant protein homologies in the presence of frameshift mutations

Marta G?rdea Laurent Noé Gregory Kucherov 《Algorithms for molecular biology : AMB》2010,5(1):6

Background

Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level. 相似文献

14.

Detailed protein sequence alignment based on Spectral Similarity Score (SSS)

Kshitiz?Gupta Email author Dina?Thomas SV?Vidya KV?Venkatesh Email author S?Ramakumar 《BMC bioinformatics》2005,6(1):105

Background

The chemical property and biological function of a protein is a direct consequence of its primary structure. Several algorithms have been developed which determine alignment and similarity of primary protein sequences. However, character based similarity cannot provide insight into the structural aspects of a protein. We present a method based on spectral similarity to compare subsequences of amino acids that behave similarly but are not aligned well by considering amino acids as mere characters. This approach finds a similarity score between sequences based on any given attribute, like hydrophobicity of amino acids, on the basis of spectral information after partial conversion to the frequency domain. 相似文献

15.

The 3of5 web application for complex and comprehensive pattern matching in protein sequences

Markus Seiler Alexander Mehrle Annemarie Poustka Stefan Wiemann 《BMC bioinformatics》2006,7(1):144-12

Background

The identification of patterns in biological sequences is a key challenge in genome analysis and in proteomics. Frequently such patterns are complex and highly variable, especially in protein sequences. They are frequently described using terms of regular expressions (RegEx) because of the user-friendly terminology. Limitations arise for queries with the increasing complexity of patterns and are accompanied by requirements for enhanced capabilities. This is especially true for patterns containing ambiguous characters and positions and/or length ambiguities. 相似文献

16.

RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs

Christian M Zmasek Sean R Eddy 《BMC bioinformatics》2002,3(1):14-19

Background

When analyzing protein sequences using sequence similarity searches, orthologous sequences (that diverged by speciation) are more reliable predictors of a new protein's function than paralogous sequences (that diverged by gene duplication). The utility of phylogenetic information in high-throughput genome annotation ("phylogenomics") is widely recognized, but existing approaches are either manual or not explicitly based on phylogenetic trees. 相似文献

17.

Identification of protein functions using a machine-learning approach based on sequence-derived properties

Bum Ju Lee Moon Sun Shin Young Joon Oh Hae Seok Oh Keun Ho Ryu 《Proteome science》2009,7(1):27-19

Background

Predicting the function of an unknown protein is an essential goal in bioinformatics. Sequence similarity-based approaches are widely used for function prediction; however, they are often inadequate in the absence of similar sequences or when the sequence similarity among known protein sequences is statistically weak. This study aimed to develop an accurate prediction method for identifying protein function, irrespective of sequence and structural similarities. 相似文献

18.

Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

Don Lorimer Amy Raymond John Walchli Mark Mixon Adrienne Barrow Ellen Wallace Rena Grice Alex Burgin Lance Stewart 《BMC biotechnology》2009,9(1):36-22

Background

To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. 相似文献

19.

Extension of the COG and arCOG databases by amino acid and nucleotide sequences

Florian Meereis Michael Kaufmann 《BMC bioinformatics》2008,9(1):479

Background

The current versions of the COG and arCOG databases, both excellent frameworks for studies in comparative and functional genomics, do not contain the nucleotide sequences corresponding to their protein or protein domain entries. 相似文献

20.

Predicting mostly disordered proteins by using structure-unknown protein data

Kana Shimizu Yoichi Muraoka Shuichi Hirose Kentaro Tomii Tamotsu Noguchi 《BMC bioinformatics》2007,8(1):78

Background

Predicting intrinsically disordered proteins is important in structural biology because they are thought to carry out various cellular functions even though they have no stable three-dimensional structure. We know the structures of far more ordered proteins than disordered proteins. The structural distribution of proteins in nature can therefore be inferred to differ from that of proteins whose structures have been determined experimentally. We know many more protein sequences than we do protein structures, and many of the known sequences can be expected to be those of disordered proteins. Thus it would be efficient to use the information of structure-unknown proteins in order to avoid training data sparseness. We propose a novel method for predicting which proteins are mostly disordered by using spectral graph transducer and training with a huge amount of structure-unknown sequences as well as structure-known sequences. 相似文献