Automatic identification of large collections of protein-coding or rRNA sequences期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Automatic identification of large collections of protein-coding or rRNA sequences

Authors:	Arigon Anne-Muriel Perrière Guy Gouy Manolo

Institution:	Université de Lyon, UMR 5558, F-69622 Villeurbanne, France. arigon@biomserv.univ-lyon1.fr

Abstract:	The number of available genomic sequences is growing very fast, due to the development of massive sequencing techniques. Sequence identification is needed and contributes to the assessment of gene and species evolutionary relationships. Automated bioinformatics tools are thus necessary to carry out these identification operations in an accurate and fast way. We developed HoSeqI (Homologous Sequence Identification), a software environment allowing this kind of automated sequence identification using homologous gene family databases. HoSeqI is accessible through a Web interface (http://pbil.univ-lyon1.fr/software/HoSeqI/) allowing to identify one or several sequences and to visualize resulting alignments and phylogenetic trees. We also implemented another application, MultiHoSeqI, to quickly add a large set of sequences to a family database in order to identify them, to update the database, or to help automatic genome annotation. Lately, we developed an application, ChiSeqI (Chimeric Sequence Identification), to automate the processes of identification of bacterial 16S ribosomal RNA sequences and of detection of chimeric sequences.

Keywords:	Automatic identification Similarity Alignment Phylogeny Chimera
本文献已被 ScienceDirect PubMed 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏