Applying negative rule mining to improve genome annotation |
| |
Authors: | Irena I Artamonova Goar Frishman Dmitrij Frishman |
| |
Affiliation: | 1.Institute for Bioinformatics,GSF – National Research Center for Environment and Health,Neuherberg,Germany;2.Group of Bioinformatics,Vavilov Institute of General Genetics RAS,Moscow,Russia;3.Department of Genome Oriented Bioinformatics,Technische Universit?t Munchen, Wissenschaftzentrum Weihenstephan,Freising,Germany |
| |
Abstract: | Background Unsupervised annotation of proteins by software pipelines suffers from very high error rates. Spurious functional assignments
are usually caused by unwarranted homology-based transfer of information from existing database entries to the new target
sequences. We have previously demonstrated that data mining in large sequence annotation databanks can help identify annotation
items that are strongly associated with each other, and that exceptions from strong positive association rules often point
to potential annotation errors. Here we investigate the applicability of negative association rule mining to revealing erroneously
assigned annotation items. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|