首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0
Authors:Daniel P Higgins  Caroline M Weisman  Dominique S Lui  Frank A D&#x;Agostino  Amy K Walker
Institution:Program in Molecular Medicine, UMASS Chan Medical School, Worcester, MA 01605, USA;Lewis-Sigler Institute for Quantitative Genomics, Princeton University, Princeton, NJ 08540, USA;Department of Applied Mathematics, Harvard University, Cambridge, MA 02138, USA
Abstract:Omics tools provide broad datasets for biological discovery. However, the computational tools for identifying important genes or pathways in RNA-seq, proteomics, or GWAS (Genome-Wide Association Study) data depend on Gene Ontogeny annotations and are biased toward well-described pathways. This limits their utility as poorly annotated genes, which could have novel functions, are often passed over. Recently, we developed an annotation and category enrichment tool for Caenorhabditis elegans genomic data, WormCat, which provides an intuitive visualization output. Unlike Gene Ontogeny-based enrichment tools, which exclude genes with no annotation information, WormCat 2.0 retains these genes as a special UNASSIGNED category. Here, we show that the UNASSIGNED gene category enrichment exhibits tissue-specific expression patterns and can include genes with biological functions identified in published datasets. Poorly annotated genes are often considered to be potentially species-specific and thus, of reduced interest to the biomedical community. Instead, we find that around 3% of the UNASSIGNED genes have human orthologs, including some linked to human diseases. These human orthologs themselves have little annotation information. A recently developed method that incorporates lineage relationships (abSENSE) indicates that the failure of BLAST to detect homology explains the apparent lineage specificity for many UNASSIGNED genes. This suggests that a larger subset could be related to human genes. WormCat provides an annotation strategy that allows the association of UNASSIGNED genes with specific phenotypes and known pathways. Building these associations in C. elegans, with its robust genetic tools, provides a path to further functional study and insight into these understudied genes.
Keywords:gene enrichment  function of unknown genes  Caenorhabditis elegans
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号