期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

High-precision high-coverage functional inference from integrated data sources

Bolan Linghu Evan S Snitkin Dustin T Holloway Adam M Gustafson Yu Xia Charles DeLisi 《BMC bioinformatics》2008,9(1):119

Background

Information obtained from diverse data sources can be combined in a principled manner using various machine learning methods to increase the reliability and range of knowledge about protein function. The result is a weighted functional linkage network (FLN) in which linked neighbors share at least one function with high probability. Precision is, however, low. Aiming to provide precise functional annotation for as many proteins as possible, we explore and propose a two-step framework for functional annotation (1) construction of a high-coverage and reliable FLN via machine learning techniques (2) development of a decision rule for the constructed FLN to optimize functional annotation. 相似文献

2.

Prediction of protein-protein binding site by using core interface residue and support vector machine

Nan Li Zhonghua Sun Fan Jiang 《BMC bioinformatics》2008,9(1):553

Background

The prediction of protein-protein binding site can provide structural annotation to the protein interaction data from proteomics studies. This is very important for the biological application of the protein interaction data that is increasing rapidly. Moreover, methods for predicting protein interaction sites can also provide crucial information for improving the speed and accuracy of protein docking methods. 相似文献

3.

Prediction of enzyme function by combining sequence similarity and protein interactions

Jordi Espadaler Narayanan Eswar Enrique Querol Francesc X Avilés Andrej Sali Marc A Marti-Renom Baldomero Oliva 《BMC bioinformatics》2008,9(1):249

Background

A number of studies have used protein interaction data alone for protein function prediction. Here, we introduce a computational approach for annotation of enzymes, based on the observation that similar protein sequences are more likely to perform the same function if they share similar interacting partners. 相似文献

4.

Super paramagnetic clustering of protein sequences

Igor?V?Tetko Email author Axel?Facius Andreas?Ruepp Hans-Werner?Mewes 《BMC bioinformatics》2005,6(1):82

Background

Detection of sequence homologues represents a challenging task that is important for the discovery of protein families and the reliable application of automatic annotation methods. The presence of domains in protein families of diverse function, inhomogeneity and different sizes of protein families create considerable difficulties for the application of published clustering methods. 相似文献

5.

ProLoc-GO: Utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization

Wen-Lin Huang Chun-Wei Tung Shih-Wen Ho Shiow-Fen Hwang Shinn-Ying Ho 《BMC bioinformatics》2008,9(1):80

Background

Gene Ontology (GO) annotation, which describes the function of genes and gene products across species, has recently been used to predict protein subcellular and subnuclear localization. Existing GO-based prediction methods for protein subcellular localization use the known accession numbers of query proteins to obtain their annotated GO terms. An accurate prediction method for predicting subcellular localization of novel proteins without known accession numbers, using only the input sequence, is worth developing. 相似文献

6.

Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks 总被引：1，自引：0，他引：1

Nikolai Daraselia Anton Yuryev Sergei Egorov Ilya Mazo Iaroslav Ispolatov 《BMC bioinformatics》2007,8(1):243

Background

Uncovering cellular roles of a protein is a task of tremendous importance and complexity that requires dedicated experimental work as well as often sophisticated data mining and processing tools. Protein functions, often referred to as its annotations, are believed to manifest themselves through topology of the networks of inter-proteins interactions. In particular, there is a growing body of evidence that proteins performing the same function are more likely to interact with each other than with proteins with other functions. However, since functional annotation and protein network topology are often studied separately, the direct relationship between them has not been comprehensively demonstrated. In addition to having the general biological significance, such demonstration would further validate the data extraction and processing methods used to compose protein annotation and protein-protein interactions datasets. 相似文献

7.

Effective transcription factor binding site prediction using a combination of optimization,a genetic algorithm and discriminant analysis to capture distant interactions

Victor G Levitsky Elena V Ignatieva Elena A Ananko Igor I Turnaev Tatyana I Merkulova Nikolay A Kolchanov TC Hodgman 《BMC bioinformatics》2007,8(1):481

相似文献

8.

CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts

Alison C Testa James K Hane Simon R Ellwood Richard P Oliver 《BMC genomics》2015,16(1)

相似文献

9.

AVID: An integrative framework for discovering functional relationships among proteins

Taijiao?Jiang Amy?E?Keating Email author 《BMC bioinformatics》2005,6(1):136

Background

Determining the functions of uncharacterized proteins is one of the most pressing problems in the post-genomic era. Large scale protein-protein interaction assays, global mRNA expression analyses and systematic protein localization studies provide experimental information that can be used for this purpose. The data from such experiments contain many false positives and false negatives, but can be processed using computational methods to provide reliable information about protein-protein relationships and protein function. An outstanding and important goal is to predict detailed functional annotation for all uncharacterized proteins that is reliable enough to effectively guide experiments. 相似文献

10.

Computational detection of genomic <Emphasis Type="Italic">cis-</Emphasis> regulatory modules applied to body patterning in the early <Emphasis Type="Italic">Drosophila</Emphasis> embryo

Nikolaus?Rajewsky Email author Massimo?Vergassola Ulrike?Gaul Eric?D?Siggia Email author 《BMC bioinformatics》2002,3(1):30

相似文献

11.

Protein subcellular localization prediction based on compartment-specific features and structure conservation

Emily Chia-Yu Su Hua-Sheng Chiu Allan Lo Jenn-Kang Hwang Ting-Yi Sung Wen-Lian Hsu 《BMC bioinformatics》2007,8(1):330

Background

Protein subcellular localization is crucial for genome annotation, protein function prediction, and drug discovery. Determination of subcellular localization using experimental approaches is time-consuming; thus, computational approaches become highly desirable. Extensive studies of localization prediction have led to the development of several methods including composition-based and homology-based methods. However, their performance might be significantly degraded if homologous sequences are not detected. Moreover, methods that integrate various features could suffer from the problem of low coverage in high-throughput proteomic analyses due to the lack of information to characterize unknown proteins. 相似文献

12.

IdentiCS – Identification of coding sequence and <Emphasis Type="Italic">in silico</Emphasis> reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence

Jibin?Sun An-Ping?Zeng Email author 《BMC bioinformatics》2004,5(1):112

Background

A necessary step for a genome level analysis of the cellular metabolism is the in silico reconstruction of the metabolic network from genome sequences. The available methods are mainly based on the annotation of genome sequences including two successive steps, the prediction of coding sequences (CDS) and their function assignment. The annotation process takes time. The available methods often encounter difficulties when dealing with unfinished error-containing genomic sequence. 相似文献

13.

Automatic detection of false annotations via binary property clustering

Noam?Kaplan Email author Michal?Linial 《BMC bioinformatics》2005,6(1):46

Background

Computational protein annotation methods occasionally introduce errors. False-positive (FP) errors are annotations that are mistakenly associated with a protein. Such false annotations introduce errors that may spread into databases through similarity with other proteins. Generally, methods used to minimize the chance for FPs result in decreased sensitivity or low throughput. We present a novel protein-clustering method that enables automatic separation of FP from true hits. The method quantifies the biological similarity between pairs of proteins by examining each protein's annotations, and then proceeds by clustering sets of proteins that received similar annotation into biological groups. 相似文献

14.

<Emphasis Type="SmallCaps">COM</Emphasis> e: the ontology of bioinorganic proteins

Kirill?Degtyarenko Email author Sergio?Contrino 《BMC structural biology》2004,4(1):3

Background

Many characterised proteins contain metal ions, small organic molecules or modified residues. In contrast, the huge amount of data generated by genome projects consists exclusively of sequences with almost no annotation. One of the goals of the structural genomics initiative is to provide representative three-dimensional (3-D) structures for as many protein/domain folds as possible to allow successful homology modelling. However, important functional features such as metal co-ordination or a type of prosthetic group are not always conserved in homologous proteins. So far, the problem of correct annotation of bioinorganic proteins has been largely ignored by the bioinformatics community and information on bioinorganic centres obtained by methods other than crystallography or NMR is only available in literature databases. 相似文献

15.

Exploring inconsistencies in genome-wide protein function annotations: a machine learning approach

Carson?Andorf Drena?Dobbs Vasant?Honavar Email author 《BMC bioinformatics》2007,8(1):284

Background

Incorrectly annotated sequence data are becoming more commonplace as databases increasingly rely on automated techniques for annotation. Hence, there is an urgent need for computational methods for checking consistency of such annotations against independent sources of evidence and detecting potential annotation errors. We show how a machine learning approach designed to automatically predict a protein's Gene Ontology (GO) functional class can be employed to identify potential gene annotation errors. 相似文献

16.

PreDisorder: ab initio sequence-based prediction of protein disordered regions

Xin Deng Jesse Eickholt Jianlin Cheng 《BMC bioinformatics》2009,10(1):436

Background

Disordered regions are segments of the protein chain which do not adopt stable structures. Such segments are often of interest because they have a close relationship with protein expression and functionality. As such, protein disorder prediction is important for protein structure prediction, structure determination and function annotation. 相似文献

17.

MimoSA: a system for minimotif annotation

Jay Vyas Ronald J Nowling Thomas Meusburger David Sargeant Krishna Kadaveru Michael R Gryk Vamsi Kundeti Sanguthevar Rajasekaran Martin R Schiller 《BMC bioinformatics》2010,11(1):328

Background

Minimotifs are short peptide sequences within one protein, which are recognized by other proteins or molecules. While there are now several minimotif databases, they are incomplete. There are reports of many minimotifs in the primary literature, which have yet to be annotated, while entirely novel minimotifs continue to be published on a weekly basis. Our recently proposed function and sequence syntax for minimotifs enables us to build a general tool that will facilitate structured annotation and management of minimotif data from the biomedical literature. 相似文献

18.

Adaptive diffusion kernel learning from biological networks for protein function prediction

Liang Sun Shuiwang Ji Jieping Ye 《BMC bioinformatics》2008,9(1):162

Background

Machine-learning tools have gained considerable attention during the last few years for analyzing biological networks for protein function prediction. Kernel methods are suitable for learning from graph-based data such as biological networks, as they only require the abstraction of the similarities between objects into the kernel matrix. One key issue in kernel methods is the selection of a good kernel function. Diffusion kernels, the discretization of the familiar Gaussian kernel of Euclidean space, are commonly used for graph-based data. 相似文献

19.

Genepi: a blackboard framework for genome annotation

Stéphane Descorps-Declère Danielle Ziébelin François Rechenmann Alain Viari 《BMC bioinformatics》2006,7(1):450-13

Background

Genome annotation can be viewed as an incremental, cooperative, data-driven, knowledge-based process that involves multiple methods to predict gene locations and structures. This process might have to be executed more than once and might be subjected to several revisions as the biological (new data) or methodological (new methods) knowledge evolves. In this context, although a lot of annotation platforms already exist, there is still a strong need for computer systems which take in charge, not only the primary annotation, but also the update and advance of the associated knowledge. In this paper, we propose to adopt a blackboard architecture for designing such a system 相似文献

20.

Analysis of superfamily specific profile-profile recognition accuracy

James?A?Casbon Mansoor?AS?Saqi Email author 《BMC bioinformatics》2004,5(1):200

Background

Annotation of sequences that share little similarity to sequences of known function remains a major obstacle in genome annotation. Some of the best methods of detecting remote relationships between protein sequences are based on matching sequence profiles. We analyse the superfamily specific performance of sequence profile-profile matching. Our benchmark consists of a set of 16 protein superfamilies that are highly diverse at the sequence level. We relate the performance to the number of sequences in the profiles, the profile diversity and the extent of structural conservation in the superfamily. 相似文献