首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Motif-based protein sequence classification using neural networks.
Authors:Konstantinos Blekas  Dimitrios I Fotiadis  Aristidis Likas
Institution:Department of Computer Science and Biomedical Research Institute-FORTH, University of Ioannina, GR-45110 Ioannina, Greece. kblekas@cs.uoi.gr
Abstract:We present a system for multi-class protein classification based on neural networks. The basic issue concerning the construction of neural network systems for protein classification is the sequence encoding scheme that must be used in order to feed the neural network. To deal with this problem we propose a method that maps a protein sequence into a numerical feature space using the matching scores of the sequence to groups of conserved patterns (called motifs) into protein families. We consider two alternative ways for identifying the motifs to be used for feature generation and provide a comparative evaluation of the two schemes. We also evaluate the impact of the incorporation of background features (2-grams) on the performance of the neural system. Experimental results on real datasets indicate that the proposed method is highly efficient and is superior to other well-known methods for protein classification.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号