首页 | 本学科首页   官方微博 | 高级检索  
     


Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction
Affiliation:Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
Abstract:Traditional sequence analysis algorithms fail to identify distant homologies when they lie beyond a detection horizon. In this review, we discuss how co-evolution-based contact and distance prediction methods are pushing back this homology detection horizon, thereby yielding new functional insights and experimentally testable hypotheses. Based on correlated substitutions, these methods divine three-dimensional constraints among amino acids in protein sequences that were previously devoid of all annotated domains and repeats. The new algorithms discern hidden structure in an otherwise featureless sequence landscape. Their revelatory impact promises to be as profound as the use, by archaeologists, of ground-penetrating radar to discern long-hidden, subterranean structures. As examples of this, we describe how triplicated structures reflecting longin domains in MON1A-like proteins, or UVR-like repeats in DISC1, emerge from their predicted contact and distance maps. These methods also help to resolve structures that do not conform to a “beads-on-a-string” model of protein domains. In one such example, we describe CFAP298 whose ubiquitin-like domain was previously challenging to perceive owing to a large sequence insertion within it. More generally, the new algorithms permit an easier appreciation of domain families and folds whose evolution involved structural insertion or rearrangement. As we exemplify with α1-antitrypsin, coevolution-based predicted contacts may also yield insights into protein dynamics and conformational change. This new combination of structure prediction (using innovative co-evolution based methods) and homology inference (using more traditional sequence analysis approaches) shows great promise for bringing into view a sea of evolutionary relationships that had hitherto lain far beyond the horizon of homology detection.
Keywords:remote homology  CFAP298  C21ORF59  DISC1  coevolution  BLAST"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0035"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Basic Local Alignment Search Tool  C21ORF59"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0045"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Chromosome 21 Open Reading Frame 59  CATH"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0055"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Class Architecture Topology Homologous  CFAP298"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0065"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Cilia- and flagella-associated protein 298  DISC1"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0075"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Disrupted in schizophrenia 1  MON1A"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0085"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Monensin sensitivity protein 1A  MSA"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0095"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Multiple sequence alignment  PDB"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0105"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Protein Data Bank  Pfam"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0115"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Protein Families Database  STRING"  },{"  #name"  :"  keyword"  ,"  $"  :{"  id"  :"  k0125"  },"  $$"  :[{"  #name"  :"  text"  ,"  _"  :"  Search Tool for the Retrieval of Interacting Genes/Proteins
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号