期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Exploring protein structural dissimilarity to facilitate structure classification

Pooja Jain Jonathan D Hirst 《BMC structural biology》2009,9(1):60-16

Background

Classification of newly resolved protein structures is important in understanding their architectural, evolutionary and functional relatedness to known protein structures. Among various efforts to improve the database of Structural Classification of Proteins (SCOP), automation has received particular attention. Herein, we predict the deepest SCOP structural level that an unclassified protein shares with classified proteins with an equal number of secondary structure elements (SSEs). 相似文献

2.

MOTIPS: Automated Motif Analysis for Predicting Targets of Modular Protein Domains

Hugo YK Lam Philip M Kim Janine Mok Raffi Tonikian Sachdev S Sidhu Benjamin E Turk Michael Snyder Mark B Gerstein 《BMC bioinformatics》2010,11(1):243

Background

Many protein interactions, especially those involved in signaling, involve short linear motifs consisting of 5-10 amino acid residues that interact with modular protein domains such as the SH3 binding domains and the kinase catalytic domains. One straightforward way of identifying these interactions is by scanning for matches to the motif against all the sequences in a target proteome. However, predicting domain targets by motif sequence alone without considering other genomic and structural information has been shown to be lacking in accuracy. 相似文献

3.

Support Vector Machines for predicting protein structural class

Yu-Dong Cai Xiao-Jun Liu Xue-biao Xu Guo-Ping Zhou 《BMC bioinformatics》2001,2(1):3-5

Background

We apply a new machine learning method, the so-called Support Vector Machine method, to predict the protein structural class. Support Vector Machine method is performed based on the database derived from SCOP, in which protein domains are classified based on known structures and the evolutionary relationships and the principles that govern their 3-D structure. 相似文献

4.

A topological algorithm for identification of structural domains of proteins

Frank Emmert-Streib Arcady Mushegian 《BMC bioinformatics》2007,8(1):237

Background

Identification of the structural domains of proteins is important for our understanding of the organizational principles and mechanisms of protein folding, and for insights into protein function and evolution. Algorithmic methods of dissecting protein of known structure into domains developed so far are based on an examination of multiple geometrical, physical and topological features. Successful as many of these approaches are, they employ a lot of heuristics, and it is not clear whether they illuminate any deep underlying principles of protein domain organization. Other well-performing domain dissection methods rely on comparative sequence analysis. These methods are applicable to sequences with known and unknown structure alike, and their success highlights a fundamental principle of protein modularity, but this does not directly improve our understanding of protein spatial structure. 相似文献

5.

Identification of similar regions of protein structures using integrated sequence and structure analysis tools

Brandon Peters Charles Moad Eunseog Youn Kris Buffington Randy Heiland Sean Mooney 《BMC structural biology》2006,6(1):4-8

Background

Understanding protein function from its structure is a challenging problem. Sequence based approaches for finding homology have broad use for annotation of both structure and function. 3D structural information of protein domains and their interactions provide a complementary view to structure function relationships to sequence information. We have developed a web site and an API of web services that enables users to submit protein structures and identify statistically significant neighbors and the underlying structural environments that make that match using a suite of sequence and structure analysis tools. To do this, we have integrated S-BLEST, PSI-BLAST and HMMer based superfamily predictions to give a unique integrated view to prediction of SCOP superfamilies, EC number, and GO term, as well as identification of the protein structural environments that are associated with that prediction. Additionally, we have extended UCSF Chimera and PyMOL to support our web services, so that users can characterize their own proteins of interest. 相似文献

6.

Accurate and efficient gp120 V3 loop structure based models for the determination of HIV-1 co-receptor usage

Majid Masso Iosif I Vaisman 《BMC bioinformatics》2010,11(1):494

Background

HIV-1 targets human cells expressing both the CD4 receptor, which binds the viral envelope glycoprotein gp120, as well as either the CCR5 (R5) or CXCR4 (X4) co-receptors, which interact primarily with the third hypervariable loop (V3 loop) of gp120. Determination of HIV-1 affinity for either the R5 or X4 co-receptor on host cells facilitates the inclusion of co-receptor antagonists as a part of patient treatment strategies. A dataset of 1193 distinct gp120 V3 loop peptide sequences (989 R5-utilizing, 204 X4-capable) is utilized to train predictive classifiers based on implementations of random forest, support vector machine, boosted decision tree, and neural network machine learning algorithms. An in silico mutagenesis procedure employing multibody statistical potentials, computational geometry, and threading of variant V3 sequences onto an experimental structure, is used to generate a feature vector representation for each variant whose components measure environmental perturbations at corresponding structural positions. 相似文献

7.

Improving protein structure similarity searches using domain boundaries based on conserved sequence information

Evan Kenneth Thompson Yanli Wang Tom Madej Stephen H Bryant 《BMC structural biology》2009,9(1):33-10

Background

The identification of protein domains plays an important role in protein structure comparison. Domain query size and composition are critical to structure similarity search algorithms such as the Vector Alignment Search Tool (VAST), the method employed for computing related protein structures in NCBI Entrez system. Currently, domains identified on the basis of structural compactness are used for VAST computations. In this study, we have investigated how alternative definitions of domains derived from conserved sequence alignments in the Conserved Domain Database (CDD) would affect the domain comparisons and structure similarity search performance of VAST. 相似文献

8.

SCOPmap: Automated assignment of protein structures to evolutionary superfamilies

Sara?Cheek Yuan?Qi S?Sri?Krishna Lisa?N?Kinch Nick?V?Grishin Email author 《BMC bioinformatics》2004,5(1):197

Background

Inference of remote homology between proteins is very challenging and remains a prerogative of an expert. Thus a significant drawback to the use of evolutionary-based protein structure classifications is the difficulty in assigning new proteins to unique positions in the classification scheme with automatic methods. To address this issue, we have developed an algorithm to map protein domains to an existing structural classification scheme and have applied it to the SCOP database. 相似文献

9.

PURE: A webserver for the prediction of domains in unassigned regions in proteins

Chilamakuri CS Reddy Khader Shameer Bernard O Offmann Ramanathan Sowdhamini 《BMC bioinformatics》2008,9(1):281

Background

Protein domains are the structural and functional units of proteins. The ability to parse proteins into different domains is important for effective classification, understanding of protein structure, function, and evolution and is hence biologically relevant. Several computational methods are available to identify domains in the sequence. Domain finding algorithms often employ stringent thresholds to recognize sequence domains. Identification of additional domains can be tedious involving intense computation and manual intervention but can lead to better understanding of overall biological function. In this context, the problem of identifying new domains in the unassigned regions of a protein sequence assumes a crucial importance. 相似文献

10.

dConsensus: a tool for displaying domain assignments by multiple structure-based algorithms and for construction of a consensus assignment

Kieran Alden Stella Veretnik Philip E Bourne 《BMC bioinformatics》2010,11(1):310

Background

Partitioning of a protein into structural components, known as domains, is an important initial step in protein classification and for functional and evolutionary studies. While the systematic assignments of domains by human experts exist (CATH and SCOP), the introduction of high throughput technologies for structure determination threatens to overwhelm expert approaches. A variety of algorithmic methods have been developed to expedite this process, allowing almost instant structural decomposition into domains. The performance of algorithmic methods can approach 85% agreement on the number of domains with the consensus reached by experts. However, each algorithm takes a somewhat different conceptual approach, each with unique strengths and weaknesses. Currently there is no simple way to automatically compare assignments from different structure-based domain assignment methods, thereby providing a comprehensive understanding of possible structure partitioning as well as providing some insight into the tendencies of particular algorithms. Most importantly, a consensus assignment drawn from multiple assignment methods can provide a singular and presumably more accurate view. 相似文献

11.

Ab initio and homology based prediction of protein domains by recursive neural networks

Ian Walsh Alberto JM Martin Catherine Mooney Enrico Rubagotti Alessandro Vullo Gianluca Pollastri 《BMC bioinformatics》2009,10(1):195-19

Background

Proteins, especially larger ones, are often composed of individual evolutionary units, domains, which have their own function and structural fold. Predicting domains is an important intermediate step in protein analyses, including the prediction of protein structures. 相似文献

12.

Identification of putative domain linkers by a neural network – application to a large sequence database

Satoshi?Miyazaki Yutaka?Kuroda Email author Shigeyuki?Yokoyama 《BMC bioinformatics》2006,7(1):323

Background

The reliable dissection of large proteins into structural domains represents an important issue for structural genomics/proteomics projects. To provide a practical approach to this issue, we tested the ability of neural network to identify domain linkers from the SWISSPROT database (101602 sequences). 相似文献

13.

A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data 总被引：1，自引：0，他引：1

Bjoern H Menze Michael B Kelm Ralf Masuch Uwe Himmelreich Peter Bachert Wolfgang Petrich Fred A Hamprecht 《BMC bioinformatics》2009,10(1):213

Background

Regularized regression methods such as principal component or partial least squares regression perform well in learning tasks on high dimensional spectral data, but cannot explicitly eliminate irrelevant features. The random forest classifier with its associated Gini feature importance, on the other hand, allows for an explicit feature elimination, but may not be optimally adapted to spectral data due to the topology of its constituent classification trees which are based on orthogonal splits in feature space. 相似文献

14.

Comparison of molecular dynamics and superfamily spaces of protein domain deformation

Javier A Velázquez-Muriel Manuel Rueda Isabel Cuesta Alberto Pascual-Montano Modesto Orozco José-María Carazo 《BMC structural biology》2009,9(1):6-14

Background

It is well known the strong relationship between protein structure and flexibility, on one hand, and biological protein function, on the other hand. Technically, protein flexibility exploration is an essential task in many applications, such as protein structure prediction and modeling. In this contribution we have compared two different approaches to explore the flexibility space of protein domains: i) molecular dynamics (MD-space), and ii) the study of the structural changes within superfamily (SF-space). 相似文献

15.

Structural assembly of two-domain proteins by rigid-body docking

Tammy MK Cheng Tom L Blundell Juan Fernandez-Recio 《BMC bioinformatics》2008,9(1):441

Background

Modelling proteins with multiple domains is one of the central challenges in Structural Biology. Although homology modelling has successfully been applied for prediction of protein structures, very often domain-domain interactions cannot be inferred from the structures of homologues and their prediction requiresab initiomethods. Here we present a new structural prediction approach for modelling two-domain proteins based on rigid-body domain-domain docking. 相似文献

16.

Domain selection combined with improved cloning strategy for high throughput expression of higher eukaryotic proteins

Yunjia Chen Shihong Qiu Chi-Hao Luan Ming Luo 《BMC biotechnology》2007,7(1):45

Background

Expression of higher eukaryotic genes as soluble, stable recombinant proteins is still a bottleneck step in biochemical and structural studies of novel proteins today. Correct identification of stable domains/fragments within the open reading frame (ORF), combined with proper cloning strategies, can greatly enhance the success rate when higher eukaryotic proteins are expressed as these domains/fragments. Furthermore, a HTP cloning pipeline incorporated with bioinformatics domain/fragment selection methods will be beneficial to studies of structure and function genomics/proteomics. 相似文献

17.

ASH structure alignment package: Sensitivity and selectivity in domain classification

Daron M Standley Hiroyuki Toh Haruki Nakamura 《BMC bioinformatics》2007,8(1):116

Background

Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In this study we construct a training set of sequence-unique CATH and SCOP domains, from which we develop a scoring function that can reliably identify domains with the same CATH topology and SCOP fold classification. The score is implemented in the ASH structure alignment package, for which the source code and a web service are freely available from the PDBj website . 相似文献

18.

Nh3D: A reference dataset of non-homologous protein structures

B Thiruv G Quon SA Saldanha B Steipe 《BMC structural biology》2005,5(1):12

相似文献

19.

Development of an accurate classification system of proteins into structured and unstructured regions that uncovers novel structural domains: its application to human transcription factors

Satoshi Fukuchi Keiichi Homma Yoshiaki Minezaki Takashi Gojobori Ken Nishikawa 《BMC structural biology》2009,9(1):26

相似文献

20.

High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABERTOOTH

Florian Teichert Jonas Minning Ugo Bastolla Markus Porto 《BMC bioinformatics》2010,11(1):251

Background

Protein alignments are an essential tool for many bioinformatics analyses. While sequence alignments are accurate for proteins of high sequence similarity, they become unreliable as they approach the so-called 'twilight zone' where sequence similarity gets indistinguishable from random. For such distant pairs, structure alignment is of much better quality. Nevertheless, sequence alignment is the only choice in the majority of cases where structural data is not available. This situation demands development of methods that extend the applicability of accurate sequence alignment to distantly related proteins. 相似文献