共查询到20条相似文献,搜索用时 15 毫秒
1.
Background
Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs).Results
The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON’s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced.Conclusions
We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/.Electronic supplementary material
The online version of this article (doi:10.1186/s12864-015-1826-4) contains supplementary material, which is available to authorized users. 相似文献2.
Simple Sequence Repeats (SSR), also called microsatellite, is very useful for genetic marker development and genome application.
The increasing whole sequences of more and more large genomes provide sources for SSR mining in silico. However currently
existing SSR mining tools can’t process large genomes efficiently and generate no or poor statistics. Genome-wide Microsatellite
Analyzing Tool (GMATo) is a novel tool for SSR mining and statistics at genome aspects. It is faster and more accurate than existed
tools SSR Locator and MISA. If a DNA sequence was too long, it was chunked to short segments at several Mb followed by motifs
generation and searching using Perl powerful pattern match function. Matched loci data from each chunk were then merged to
produce final SSR loci information. Only one input file is required which contains raw fasta DNA sequences and output files in
tabular format list all SSR loci information and statistical distribution at four classifications. GMATo was programmed in Java and
Perl with both graphic and command line interface, either executable alone in platform independent manner with full parameters
control. Software GMATo is a powerful tool for complete SSR characterization in genomes at any size.
Availability
The soft GMATo is freely available at http://sourceforge.net/projects/gmato/files/?source=navbar or on contact 相似文献3.
4.
5.
Mohit Midha Raja Polavarapu Potshangbam Angamba Meetei Hari Krishnan Krishnaveni Mohareer Vaibhav Vindal 《Bioinformation》2012,8(15):738-739
Identification of ortholog is one of the important tasks to understand a novel genome. It helps to assign functional annotations,
from one organism to another organism. To identify the putative ortholog, Reciprocal Best BLAST hit (RBBH) method is known to
be an efficient approach. OrFin makes use of the same approach to identify pair of orthologous proteins for a given set of sequences
of two species. It is a user-friendly web tool which works with user defined parameters to search RBBHs. Results are produced in
both html and text format.
Availability
This web tool is freely available at http://bifl.uohyd.ac.in/orfin 相似文献6.
Rommel Thiago Jucá Ramos Adriana R Carneiro Pablo H Caracciolo Vasco Azevedo Maria Paula C Schneider Debmalya Barh Artur Silva 《Bioinformation》2013,9(11):599-604
Genome assembly has always been complicated due to the inherent difficulties of sequencing technologies, as well the
computational methods used to process sequences. Although many of the problems for the generation of contigs from reads are
well known, especially those involving short reads, the orientation and ordination of contigs in the finishing stages is still very
challenging and time consuming, as it requires the manual curation of the contigs to guarantee correct identification them and
prevent misassembly. Due to the large numbers of sequences that are produced, especially from the reads produced by next
generation sequencers, this process demands considerable manual effort, and there are few software options available to facilitate
the process. To address this problem, we have developed the Graphic Contig Analyzer for All Sequencing Platforms (G4ALL): a
stand-alone multi-user tool that facilitates the editing of the contigs produced in the assembly process. Besides providing
information on the gene products contained in each contig, obtained through a search of the available biological databases, G4ALL
produces a scaffold of the genome, based on the overlap of the contigs after curation.
Availability
The software is available at: http://www.genoma.ufpa.br/rramos/softwares/g4all.xhtml 相似文献7.
P de Sá A Pinto RT Ramos N Coimbra R Baraúna H Dall'agnol A Carneiro A Ranieri A Valadares V Azevedo MP Schneider D Barh A Silva 《Bioinformation》2012,8(11):529-531
The vast amount of data produced by next-generation sequencing (NGS) has necessitated the development of computational tools to assist in understanding the myriad functions performed by the biological macromolecules involved in heredity. In this work, we developed the FunSys programme, a stand-alone tool with an user friendly interface that enables us to evaluate and correlate differential expression patterns from RNA sequencing and proteomics datasets. The FunSys generates charts and reports based on the results of the analysis of differential expression to aid the interpretation of the results. AVAILABILITY: The database is available for free at https://sourceforge.net/projects/funsysufpa/ 相似文献
8.
Gisel A Panetta M Grillo G Licciulli VF Liuni S Saccone C Pesole G 《Bioinformatics (Oxford, England)》2004,20(18):3676-3679
SUMMARY: DNAfan (DNA Feature ANalyzer) is a tool combining sequence-filtering and pattern searching. DNAfan automatically extracts user-defined sets of sequence fragments from large sequence sets. Fragments are defined by annotated gene feature keys and co- or non-occurring patterns within the feature or close to it. A gene feature parser and a pattern-based filter tool localizes and extracts the specific subset of sequences. The selected sequence data can subsequently be retrieved for analyses or further processed with DNAfan to find the occurrence of specific patterns or structural motifs. DNAfan is a powerful tool for pattern analysis. Its filter features restricts the pattern search to a well-defined set of sequences, allowing drastic reduction in false positive hits. AVAILABILITY: http://bighost.ba.itb.cnr.it:8080/Framework. 相似文献
9.
Microsatellites are ubiquitous short tandem repeats found in all known genomes and are known to play a very important role in various studies and fields including DNA fingerprinting, paternity studies, evolutionary studies, virulence and adaptation of certain bacteria and viruses etc. Due to the sequencing of several genomes and the availability of enormous amounts of sequence data during the past few years, computational studies of microsatellites are of interest for many researchers. In this context, we developed a software tool called Imperfect Microsatellite Extractor (IMEx), to extract perfect, imperfect and compound microsatellites from genome sequences along with their complete statistics. Recently we developed a user-friendly graphical-interface using JAVA for IMEx to be used as a stand-alone software named G-IMEx. G-IMEx takes a nucleotide sequence as an input and the results are produced in both html and text formats. The Linux version of G-IMEx can be downloaded for free from http://www.cdfd.org.in/imex. 相似文献
10.
Graham R. Wood Eugene V. Ryabov Jessica M. Fannon Jonathan D. Moore David J. Evans Nigel Burroughs 《Nucleic acids research》2014,42(16):e123
Viral recombination is a key evolutionary mechanism, aiding escape from host immunity, contributing to changes in tropism and possibly assisting transmission across species barriers. The ability to determine whether recombination has occurred and to locate associated specific recombination junctions is thus of major importance in understanding emerging diseases and pathogenesis. This paper describes a method for determining recombinant mosaics (and their proportions) originating from two parent genomes, using high-throughput sequence data. The method involves setting the problem geometrically and the use of appropriately constrained quadratic programming. Recombinants of the honeybee deformed wing virus and the Varroa destructor virus-1 are inferred to illustrate the method from both siRNAs and reads sampling the viral genome population (cDNA library); our results are confirmed experimentally. Matlab software (MosaicSolver) is available. 相似文献
11.
The use of Next-Generation Sequencing of mitochondrial DNA is becoming widespread in biological and clinical research. This, in turn, creates a need for a convenient tool that detects and analyzes heteroplasmy. Here we present MitoBamAnnotator, a user friendly web-based tool that allows maximum flexibility and control in heteroplasmy research. MitoBamAnnotator provides the user with a comprehensively annotated overview of mitochondrial genetic variation, allowing for an in-depth analysis with no prior knowledge in programming. 相似文献
12.
We offer a guide to de novo genome assembly1 using sequence data generated by the Illumina platform for biologists working with fungi or other organisms whose genomes are less than 100 Mb in size. The guide requires no familiarity with sequencing assembly technology or associated computer programs. It defines commonly used terms in genome sequencing and assembly; provides examples of assembling short-read genome sequence data for four strains of the fungus Grosmannia clavigera using four assembly programs; gives examples of protocols and software; and presents a commented flowchart that extends from DNA preparation for submission to a sequencing center, through to processing and assembly of the raw sequence reads using freely available operating systems and software. 相似文献
13.
Protein-protein ligand is one of the most detection methods used in Nano biosensors. Based on the advantage of specific docking between two special 3D structures, they have become a potent candidate in bioanalysis and Nanodiagnostic tools. These tools lease users to do a simple, fast, cost-effective, sensitive, and specific detection of molecular biomarkers in real samples. Recent advantages of using protein-protein ligand Nano-biosensors application is remarkable due to its special docking that refers to each protein unique 3D conformation. However, it challenges different problems such as low rate of docking and hard process for fixation on the basic layer. These challenges make developers to optimize the structure and functions of proteins. The process has different Nano scale calculation that could be done with algorithms and solutions are available as bioinformatics tools. This article aimed to have a short overview of the abilities of bioinformatics tools for modeling and optimization of physiochemical features of proteins in Nano scale. 相似文献
14.
Sindhu KV Rani V Gupta MK Ghaskadbi S Choudhury D Goswami SK 《Biochemical and biophysical research communications》2004,323(3):912-919
Enormity of the metazoan genomes and divergence in their regulation impose a serious constraint on the comprehensive understanding of context specific gene regulation. DNA elements located in the promoter, enhancer, and other regulatory regions of the genome dictate the temporal and spatial patterns of gene activities. However, owing to the diminutive and variable nature of the regulatory DNA elements, their identification and location remains a major challenge. We have developed an efficient strategy for isolating a repertoire of target sites for sequence specific DNA binding proteins from embryonic chick heart. A comprehensive library of such sequences was constructed and authenticated using various parameters including in silico determination of functional binding sites. This approach, therefore, for the first time, established an experimental and conceptual framework for defining the entire repertoire of functional DNA elements in any cellular context. 相似文献
15.
Suchismita Chandran Vladimir N Noskov Thomas H Segall-Shapiro Li Ma Caitlin Whiteis Carole Lartigue Joerg Jores Sanjay Vashee Ray-Yuan Chuang 《BMC genomics》2014,15(1)
Background
With the development of several new technologies using synthetic biology, it is possible to engineer genetically intractable organisms including Mycoplasma mycoides subspecies capri (Mmc), by cloning the intact bacterial genome in yeast, using the host yeast’s genetic tools to modify the cloned genome, and subsequently transplanting the modified genome into a recipient cell to obtain mutant cells encoded by the modified genome. The recently described tandem repeat coupled with endonuclease cleavage (TREC) method has been successfully used to generate seamless deletions and point mutations in the mycoplasma genome using the yeast DNA repair machinery. But, attempts to knock-in genes in some cases have encountered a high background of transformation due to maintenance of unwanted circularization of the transforming DNA, which contains possible autonomously replicating sequence (ARS) activity. To overcome this issue, we incorporated a split marker system into the TREC method, enabling seamless gene knock-in with high efficiency. The modified method is called TREC-assisted gene knock-in (TREC-IN). Since a gene to be knocked-in is delivered by a truncated non-functional marker, the background caused by an incomplete integration is essentially eliminated.Results
In this paper, we demonstrate applications of the TREC-IN method in gene complementation and genome minimization studies in Mmc. In the first example, the Mmc dnaA gene was seamlessly replaced by an orthologous gene, which shares a high degree of identity at the nucleotide level with the original Mmc gene, with high efficiency and low background. In the minimization example, we replaced an essential gene back into the genome that was present in the middle of a cluster of non-essential genes, while deleting the non-essential gene cluster, again with low backgrounds of transformation and high efficiency.Conclusion
Although we have demonstrated the feasibility of TREC-IN in gene complementation and genome minimization studies in Mmc, the applicability of TREC-IN ranges widely. This method proves to be a valuable genetic tool that can be extended for genomic engineering in other genetically intractable organisms, where it may be implemented in elucidating specific metabolic pathways and in rationale vaccine design.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-1180) contains supplementary material, which is available to authorized users. 相似文献16.
Analysis for free: comparing programs for sequence analysis 总被引:4,自引:0,他引:4
Tippmann HF 《Briefings in bioinformatics》2004,5(1):82-87
Programs to import, manage and align sequences and to analyse the properties of DNA, RNA and proteins are essential for every biological laboratory. This review describes two different freeware (BioEdit and pDRAW for MS Windows) and a commercial program (Sequencher for MS Windows and Apple MacOS). Bioedit and Sequencher offer functions such as sequence alignment and editing plus reading of sequence trace files. pDRAW is a very comfortable visualisation tool with a variety of analysis functions. While Sequencher impresses with a very user-friendly interface and easy-to-use tools, BioEdit offers the largest and most customisable variety of tools. The strength of pDRAW is drawing and analysis of single sequences for priming and restriction sites and virtual cloning. It has a database function for user-specific oligonucleotides and restriction enzymes. 相似文献
17.
The relationship between artifact manufacture, use, and discard in the Developed Oldowan is complex. Here we use digital-image-analysis techniques to investigate the intensity of reduction in single-platform cores of the Developed Oldowan of the Okote Member, Koobi Fora Formation. Data suggest that this method provides a more accurate measure of reduction intensity than previous applications of a unifacial-scraper model. Assemblages of single-platform cores excavated from extensive lateral exposures of the Okote Member provide insights into the relationship between raw-material availability and discard patterns. Variation in reduction intensity suggests that tools are not always discarded in patterns that would be predicted by the availability of raw material. Further, it appears that hominin transport decisions involved an assessment of the potential use-life of certain forms. Many aspects of Developed Oldowan technology conform to previously developed models of curated technologies. 相似文献
18.
19.
20.
Quality control of probe sequences is a major concern in microarray technology. The presence of poor quality probes has a negative impact on the microarray data analysis process. The Microarray Manual Curation Tool (MMCT) is a web server application that provides computational and visual means to investigate the quality of individual probes for oligo microarrays. The MMCT quality metrics assess the free energy of hybridization and the secondary structure of duplexes formed by selected targets and probes, which are specific to various microarray platforms. AVAILABILITY: http://www.nrcbioinformatics.ca/mmct. 相似文献