期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

The distance-profile representation and its application to detection of distantly related protein families

Chin-Jen?Ku Golan?Yona Email author 《BMC bioinformatics》2005,6(1):282

Background

Detecting homology between remotely related protein families is an important problem in computational biology since the biological properties of uncharacterized proteins can often be inferred from those of homologous proteins. Many existing approaches address this problem by measuring the similarity between proteins through sequence or structural alignment. However, these methods do not exploit collective aspects of the protein space and the computed scores are often noisy and frequently fail to recognize distantly related protein families. 相似文献

2.

Predicting conserved protein motifs with Sub-HMMs

Kevin Horan Christian R Shelton Thomas Girke 《BMC bioinformatics》2010,11(1):205

Background

Profile HMMs (hidden Markov models) provide effective methods for modeling the conserved regions of protein families. A limitation of the resulting domain models is the difficulty to pinpoint their much shorter functional sub-features, such as catalytically relevant sequence motifs in enzymes or ligand binding signatures of receptor proteins. 相似文献

3.

Partially-supervised protein subclass discovery with simultaneous annotation of functional residues

Benjamin Georgi Jörg Schultz Alexander Schliep 《BMC structural biology》2009,9(1):68-14

Background

The study of functional subfamilies of protein domain families and the identification of the residues which determine substrate specificity is an important question in the analysis of protein domains. One way to address this question is the use of clustering methods for protein sequence data and approaches to predict functional residues based on such clusterings. The locations of putative functional residues in known protein structures provide insights into how different substrate specificities are reflected on the protein structure level. 相似文献

4.

Prediction of protein-protein binding site by using core interface residue and support vector machine

Nan Li Zhonghua Sun Fan Jiang 《BMC bioinformatics》2008,9(1):553

Background

The prediction of protein-protein binding site can provide structural annotation to the protein interaction data from proteomics studies. This is very important for the biological application of the protein interaction data that is increasing rapidly. Moreover, methods for predicting protein interaction sites can also provide crucial information for improving the speed and accuracy of protein docking methods. 相似文献

5.

Ensemble approach to predict specificity determinants: benchmarking and validation

Saikat Chakrabarti Anna R Panchenko 《BMC bioinformatics》2009,10(1):207

Background

It is extremely important and challenging to identify the sites that are responsible for functional specification or diversification in protein families. In this study, a rigorous comparative benchmarking protocol was employed to provide a reliable evaluation of methods which predict the specificity determining sites. Subsequently, three best performing methods were applied to identify new potential specificity determining sites through ensemble approach and common agreement of their prediction results. 相似文献

6.

Estimates of statistical significance for comparison of individual positions in multiple sequence alignments

Ruslan?I?Sadreyev Nick?V?Grishin Email author 《BMC bioinformatics》2004,5(1):106

Background

Profile-based analysis of multiple sequence alignments (MSA) allows for accurate comparison of protein families. Here, we address the problems of detecting statistically confident dissimilarities between (1) MSA position and a set of predicted residue frequencies, and (2) between two MSA positions. These problems are important for (i) evaluation and optimization of methods predicting residue occurrence at protein positions; (ii) detection of potentially misaligned regions in automatically produced alignments and their further refinement; and (iii) detection of sites that determine functional or structural specificity in two related families. 相似文献

7.

Kalign – an accurate and fast multiple sequence alignment algorithm

Timo?Lassmann Email author Erik?LL?Sonnhammer 《BMC bioinformatics》2005,6(1):298

Background

The alignment of multiple protein sequences is a fundamental step in the analysis of biological data. It has traditionally been applied to analyzing protein families for conserved motifs, phylogeny, structural properties, and to improve sensitivity in homology searching. The availability of complete genome sequences has increased the demands on multiple sequence alignment (MSA) programs. Current MSA methods suffer from being either too inaccurate or too computationally expensive to be applied effectively in large-scale comparative genomics. 相似文献

8.

<Emphasis Type="Italic">SplitTester</Emphasis> : software to identify domains responsible for functional divergence in protein family

Xiang?Gao Kent?A?Vander Velden Daniel?F?Voytas Email author Xun?Gu Email author 《BMC bioinformatics》2005,6(1):137

Background

Many protein families have undergone functional divergence after gene duplications such that current subgroups of the family carry out overlapping but distinct biological roles. For the protein families with known functional subtypes (a functional split), we developed the software, SplitTester, to identify potential regions that are responsible for the observed distinct functional subtypes within the same protein family. 相似文献

9.

Evolution by leaps: gene duplication in bacteria

Margrethe H Serres Alastair RW Kerr Thomas J McCormack Monica Riley 《Biology direct》2009,4(1):46-17

Background

Sequence related families of genes and proteins are common in bacterial genomes. In Escherichia coli they constitute over half of the genome. The presence of families and superfamilies of proteins suggest a history of gene duplication and divergence during evolution. Genome encoded protein families, their size and functional composition, reflect metabolic potentials of the organisms they are found in. Comparing protein families of different organisms give insight into functional differences and similarities. 相似文献

10.

TMB-Hunt: An amino acid composition based method to screen proteomes for beta-barrel transmembrane proteins

Andrew?G?Garrow Alison?Agnew David?R?Westhead Email author 《BMC bioinformatics》2005,6(1):56

Background

Beta-barrel transmembrane (bbtm) proteins are a functionally important and diverse group of proteins expressed in the outer membranes of bacteria (both gram negative and acid fast gram positive), mitochondria and chloroplasts. Despite recent publications describing reasonable levels of accuracy for discriminating between bbtm proteins and other proteins, screening of entire genomes remains troublesome as these molecules only constitute a small fraction of the sequences screened. Therefore, novel methods are still required capable of detecting new families of bbtm protein in diverse genomes. 相似文献

11.

Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint

Russell L Marsden Tony A Lewis Christine A Orengo 《BMC bioinformatics》2007,8(1):86

Background

Structural genomics initiatives were established with the aim of solving protein structures on a large-scale. For many initiatives, such as the Protein Structure Initiative (PSI), the primary aim of target selection is focussed towards structurally characterising protein families which, so far, lack a structural representative. It is therefore of considerable interest to gain insights into the number and distribution of these families, and what efforts may be required to achieve a comprehensive structural coverage across all protein families. 相似文献

12.

X-ray structures of two proteins belonging to Pfam DUF178 revealed unexpected structural similarity to the DUF191 Pfam family

Rajiv Tyagi Stephen K Burley Subramanyam Swaminathan 《BMC structural biology》2007,7(1):62

Background

Pfam is a comprehensive collection of protein domains and families, with a range of well-established information including genome annotation. Pfam has two large series of functionally uncharacterized families, known as Domains of Unknown Function (DUFs) and Uncharacterized Protein Families (UPFs). 相似文献

13.

iQuantitator: A tool for protein expression inference using iTRAQ

John H Schwacke Elizabeth G Hill Edward L Krug Susana Comte-Walters Kevin L Schey 《BMC bioinformatics》2009,10(1):342

Background

Isobaric Tags for Relative and Absolute Quantitation (iTRAQ™) [Applied Biosystems] have seen increased application in differential protein expression analysis. To facilitate the growing need to analyze iTRAQ data, especially for cases involving multiple iTRAQ experiments, we have developed a modeling approach, statistical methods, and tools for estimating the relative changes in protein expression under various treatments and experimental conditions. 相似文献

14.

LipocalinPred: a SVM-based method for prediction of lipocalins

Jayashree Ramana Dinesh Gupta 《BMC bioinformatics》2009,10(1):445

Background

Functional annotation of rapidly amassing nucleotide and protein sequences presents a challenging task for modern bioinformatics. This is particularly true for protein families sharing extremely low sequence identity, as for lipocalins, a family of proteins with varied functions and great diversity at the sequence level, yet conserved structures. 相似文献

15.

A hybrid clustering approach to recognition of protein families in 114 microbial genomes

Timothy?J?Harlow J?Peter?Gogarten Mark?A?Ragan Email author 《BMC bioinformatics》2004,5(1):45

Background

Grouping proteins into sequence-based clusters is a fundamental step in many bioinformatic analyses (e.g., homology-based prediction of structure or function). Standard clustering methods such as single-linkage clustering capture a history of cluster topologies as a function of threshold, but in practice their usefulness is limited because unrelated sequences join clusters before biologically meaningful families are fully constituted, e.g. as the result of matches to so-called promiscuous domains. Use of the Markov Cluster algorithm avoids this non-specificity, but does not preserve topological or threshold information about protein families. 相似文献

16.

Inferring functional modules of protein families with probabilistic topic models

Sebastian GA Konietzny Laura Dietz Alice C McHardy 《BMC bioinformatics》2011,12(1):141

Background

Genome and metagenome studies have identified thousands of protein families whose functions are poorly understood and for which techniques for functional characterization provide only partial information. For such proteins, the genome context can give further information about their functional context. 相似文献

17.

CMASA: an accurate algorithm for detecting local protein structural similarity and its application to enzyme catalytic site annotation

Gong-Hua Li Jing-Fei Huang 《BMC bioinformatics》2010,11(1):439

Background

The rapid development of structural genomics has resulted in many "unknown function" proteins being deposited in Protein Data Bank (PDB), thus, the functional prediction of these proteins has become a challenge for structural bioinformatics. Several sequence-based and structure-based methods have been developed to predict protein function, but these methods need to be improved further, such as, enhancing the accuracy, sensitivity, and the computational speed. Here, an accurate algorithm, the CMASA (Contact MAtrix based local Structural Alignment algorithm), has been developed to predict unknown functions of proteins based on the local protein structural similarity. This algorithm has been evaluated by building a test set including 164 enzyme families, and also been compared to other methods. 相似文献

18.

SUPFAM: A database of sequence superfamilies of protein domains

Shashi B Pandit Rana Bhadra VS Gowri S Balaji B Anand N Srinivasan 《BMC bioinformatics》2004,5(1):28

Background

SUPFAM database is a compilation of superfamily relationships between protein domain families of either known or unknown 3-D structure. In SUPFAM, sequence families from Pfam and structural families from SCOP are associated, using profile matching, to result in sequence superfamilies of known structure. Subsequently all-against-all family profile matches are made to deduce a list of new potential superfamilies of yet unknown structure. 相似文献

19.

Taxonomic distribution and origins of the extended LHC (light-harvesting complex) antenna protein superfamily 总被引：1，自引：0，他引：1

Johannes Engelken Henner Brinkmann Iwona Adamska 《BMC evolutionary biology》2010,10(1):233

Background

The extended light-harvesting complex (LHC) protein superfamily is a centerpiece of eukaryotic photosynthesis, comprising the LHC family and several families involved in photoprotection, like the LHC-like and the photosystem II subunit S (PSBS). The evolution of this complex superfamily has long remained elusive, partially due to previously missing families. 相似文献

20.

BIPAD: A web server for modeling bipartite sequence elements

Chengpeng Bi Peter K Rogan 《BMC bioinformatics》2006,7(1):76

Background

Many dimeric protein complexes bind cooperatively to families of bipartite nucleic acid sequence elements, which consist of pairs of conserved half-site sequences separated by intervening distances that vary among individual sites. 相似文献