首页 | 本学科首页   官方微博 | 高级检索  
   检索      


ProDomAs,protein domain assignment algorithm using center‐based clustering and independent dominating set
Authors:Elnaz Saberi Ansari  Changiz Eslahchi  Hamid Pezeshk  Mehdi Sadeghi
Institution:1. Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, , Tehran, Iran;2. Department of Computer Science, Shahid Beheshti University, G.C., , Tehran, Iran;3. School of Biological Science, Institute for Research in Fundamental Science (IPM), , Tehran, Iran;4. School of Mathematics, Statistics and Computer Science, College of Science, University of Tehran, , Tehran, Iran;5. National Institute of Genetic Engineering and Biotechnology, , Tehran, Iran
Abstract:Decomposition of structural domains is an essential task in classifying protein structures, predicting protein function, and many other proteomics problems. As the number of known protein structures in PDB grows exponentially, the need for accurate automatic domain decomposition methods becomes more essential. In this article, we introduce a bottom‐up algorithm for assigning protein domains using a graph theoretical approach. This algorithm is based on a center‐based clustering approach. For constructing initial clusters, members of an independent dominating set for the graph representation of a protein are considered as the centers. A distance matrix is then defined for these clusters. To obtain final domains, these clusters are merged using the compactness principle of domains and a method similar to the neighbor‐joining algorithm considering some thresholds. The thresholds are computed using a training set consisting of 50 protein chains. The algorithm is implemented using C++ language and is named ProDomAs. To assess the performance of ProDomAs, its results are compared with seven automatic methods, against five publicly available benchmarks. The results show that ProDomAs outperforms other methods applied on the mentioned benchmarks. The performance of ProDomAs is also evaluated against 6342 chains obtained from ASTRAL SCOP 1.71. ProDomAs is freely available at http://www.bioinf.cs.ipm.ir/software/prodomas . Proteins 2014; 82:1937–1946. © 2014 Wiley Periodicals, Inc.
Keywords:domain boundary detection  compactness principle of structural domains  discontinuous domains  bottom‐up algorithm  graph theoretical approach
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号