首页 | 本学科首页   官方微博 | 高级检索  
   检索      


A discriminative method for family-based protein remote homology detection that combines inductive logic programming and propositional models
Authors:Juliana S Bernardes  Alessandra Carbone  Gerson Zaverucha
Institution:1.COPPE, Programa de Engenharia de Sistemas e Computa??o,Universidade Federal do Rio de Janeiro,Rio de Janeiro,Brazil;2.Université Pierre et Marie Curie,UMR7238,Paris,France;3.CNRS, UMR7238,Laboratoire de Génomique des Microorganismes,Paris,France
Abstract:

Background  

Remote homology detection is a hard computational problem. Most approaches have trained computational models by using either full protein sequences or multiple sequence alignments (MSA), including all positions. However, when we deal with proteins in the "twilight zone" we can observe that only some segments of sequences (motifs) are conserved. We introduce a novel logical representation that allows us to represent physico-chemical properties of sequences, conserved amino acid positions and conserved physico-chemical positions in the MSA. From this, Inductive Logic Programming (ILP) finds the most frequent patterns (motifs) and uses them to train propositional models, such as decision trees and support vector machines (SVM).
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号