共查询到20条相似文献,搜索用时 15 毫秒
1.
Rother K Potrzebowski W Puton T Rother M Wywial E Bujnicki JM 《Briefings in bioinformatics》2012,13(2):244-257
Creating useful software is a major activity of many scientists, including bioinformaticians. Nevertheless, software development in an academic setting is often unsystematic, which can lead to problems associated with maintenance and long-term availibility. Unfortunately, well-documented software development methodology is difficult to adopt, and technical measures that directly improve bioinformatic programming have not been described comprehensively. We have examined 22 software projects and have identified a set of practices for software development in an academic environment. We found them useful to plan a project, support the involvement of experts (e.g. experimentalists), and to promote higher quality and maintainability of the resulting programs. This article describes 12 techniques that facilitate a quick start into software engineering. We describe 3 of the 22 projects in detail and give many examples to illustrate the usage of particular techniques. We expect this toolbox to be useful for many bioinformatics programming projects and to the training of scientific programmers. 相似文献
2.
3.
Gilbert D 《Briefings in bioinformatics》2002,3(4):405-409
Pise is interface construction software for bioinformatics applications that run by command-line operations. It creates common, easy-to-use interfaces to these applications for the Web, or other uses. It is adaptable to new bioinformatics tools, and offers program chaining, Unix system batch and other controls, making it an attractive method for building and using your own bioinformatics web services. 相似文献
4.
5.
6.
Macintosh sequence analysis software 总被引:3,自引:0,他引:3
Jonathan P. Clewley 《Molecular biotechnology》1995,3(3):221-224
The analysis of information in nucleotide and amino acid sequence data from an investigator’s own laboratory, or from the
ever-growing worldwide databases, is critically dependent on well planned and written software. Although the most powerful
packages previously have been confined to workstations, there has been a dramatic increase over the last few years in the
sophistication of the programs available for personal computers, as the speed and power of these have increased. A wide choice
of software is available for the Macintosh, including the LaserGene suite of programs from DNAStar. This review assesses the
strengths and weaknesses of LaserGene and concludes that it provides a useful and comprehensive range of sequence analysis
tools. 相似文献
7.
Biskit is a modular, object-oriented python library that provides intuitive classes for many typical tasks of structural bioinformatics research. It facilitates the manipulation and analysis of macromolecular structures, protein complexes and molecular dynamics trajectories. At the same time, Biskit offers a software platform for the rapid integration of external programs and new algorithms into complex structural bioinformatics workflows. Calculations are thus often delegated to established programs like Xplor, Amber, Hex, Prosa, Hmmer and Modeller; interfaces to further software can be easily added. Moreover, Biskit simplifies the parallelization of time consuming calculations via PVM (Parallel Virtual Machine). AVAILABILITY: The latest snapshot of Biskit, documentation and examples are freely available under the GNU General Public License at http://biskit.sf.net (alternate url http://biskit.pasteur.fr). 相似文献
8.
Gordon Cannon 《Analytical biochemistry》1990,190(2):147-153
It is clear that a computer-aided data control system is required for even small laboratories generating nucleic acid data. While the molecular biologist at many universities and large research institutions has access to mainframe computers and nucleic acid sequence analysis software, many find it more convenient to perform sequence analysis on microcomputers that are typically located within the investigator's laboratory and totally dedicated to sequence storage and analysis, in essence giving the investigator more personal control of analysis activities than is sometimes possible with shared mini- or mainframe computers. New programs are being written and released at an increasing rate to perform increasingly more complex and specialized analyses using small computer-based systems. This trend will undoubtedly continue, fueled by the need to manage the ever increasing quantity of sequence data. 相似文献
9.
Quackenbush J 《Genome biology》2003,4(9):336
A report on the Wellcome Trust/Cold Spring Harbor Genome Informatics meeting, Cold Spring Harbor, USA, 7-11 May 2003. 相似文献
10.
Dmitry A. Filatov 《Molecular ecology resources》2002,2(4):621-624
proseq is an integrated user‐friendly windows based program for convenient sequence editing and evolutionary analysis. It is designed to simplify preparation and analysis of DNA sequence data sets in population genetic, phylogenetic and molecular ecology studies. Sequence editor features include editing of chromatogram files, contig assembly, sequence alignment, translation and other utilities. Analysis features include calculation of genetic diversity, divergence, population subdivision and gene flow with permutation‐based significance testing and various tests of neutrality. A tool for coalescent simulations implements models with intragenic recombination, population subdivision and population growth. 相似文献
11.
Theatre: A software tool for detailed comparative analysis and visualization of genomic sequence 总被引:1,自引:0,他引:1
Edwards YJ Carver TJ Vavouri T Frith M Bishop MJ Elgar G 《Nucleic acids research》2003,31(13):3510-3517
12.
The most common types of nucleotide sequence data analyses and handling can be done more conveniently and inexpensively on microcomputers than on large time-sharing systems. We present a package of computer programs for the analysis of DNA and RNA sequence data which overcomes many of the limitations imposed by microcomputers, while offering most of the features of programs commonly available on large computers, including sequence numbering and translation, restriction site and homology searches with dot-matrix plots, nucleotide distribution analysis, and graphic display of data. Most of the programs were written in Standard Pascal (on an Apple II computer) to facilitate portability to other micro-, mini-, and and mainframe computers. 相似文献
13.
Storage of sequence data is a big concern as the amount of data generated is exponential in nature at several locations. Therefore,there is a need to develop techniques to store data using compression algorithm. Here we describe optimal storage algorithm(OPTSDNA) for storing large amount of DNA sequences of varying length. This paper provides performance analysis of optimalstorage algorithm (OPTSDNA) of a distributed bioinformatics computing system for analysis of DNA sequences. OPTSDNAalgorithm is used for storing various sizes of DNA sequences into database. DNA sequences of different lengths were stored byusing this algorithm. These input DNA sequences are varied in size from very small to very large. Storage size is calculated by thisalgorithm. Response time is also calculated in this work. The efficiency and performance of the algorithm is high (in size calculationwith percentage) when compared with other known with sequential approach. 相似文献
14.
Bioconductor: open software development for computational biology and bioinformatics 总被引:3,自引:0,他引:3
Gentleman RC Carey VJ Bates DM Bolstad B Dettling M Dudoit S Ellis B Gautier L Ge Y Gentry J Hornik K Hothorn T Huber W Iacus S Irizarry R Leisch F Li C Maechler M Rossini AJ Sawitzki G Smith C Smyth G Tierney L Yang JY Zhang J 《Genome biology》2004,5(10):R80-16
The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. The goals of the project include: fostering collaborative development and widespread use of innovative software, reducing barriers to entry into interdisciplinary scientific research, and promoting the achievement of remote reproducibility of research results. We describe details of our aims and methods, identify current challenges, compare Bioconductor to other open bioinformatics projects, and provide working examples. 相似文献
15.
Background
Systems biologists work with many kinds of data, from many different sources, using a variety of software tools. Each of these tools typically excels at one type of analysis, such as of microarrays, of metabolic networks and of predicted protein structure. A crucial challenge is to combine the capabilities of these (and other forthcoming) data resources and tools to create a data exploration and analysis environment that does justice to the variety and complexity of systems biology data sets. A solution to this problem should recognize that data types, formats and software in this high throughput age of biology are constantly changing. 相似文献16.
Daniel N Frank 《BMC bioinformatics》2008,9(1):420
Background
Advances in automated DNA sequencing technology have accelerated the generation of metagenomic DNA sequences, especially environmental ribosomal RNA gene (rDNA) sequences. As the scale of rDNA-based studies of microbial ecology has expanded, need has arisen for software that is capable of managing, annotating, and analyzing the plethora of diverse data accumulated in these projects. 相似文献17.
18.
ARB: a software environment for sequence data 总被引:13,自引:2,他引:13
Ludwig W Strunk O Westram R Richter L Meier H Yadhukumar Buchner A Lai T Steppi S Jobb G Förster W Brettske I Gerber S Ginhart AW Gross O Grumann S Hermann S Jost R König A Liss T Lüssmann R May M Nonhoff B Reichel B Strehlow R Stamatakis A Stuckmann N Vilbig A Lenke M Ludwig T Bode A Schleifer KH 《Nucleic acids research》2004,32(4):1363-1371
19.
Jacob L. Steenwyk Thomas J. Buida III Yuanning Li Xing-Xing Shen Antonis Rokas 《PLoS biology》2020,18(12)
Highly divergent sites in multiple sequence alignments (MSAs), which can stem from erroneous inference of homology and saturation of substitutions, are thought to negatively impact phylogenetic inference. Thus, several different trimming strategies have been developed for identifying and removing these sites prior to phylogenetic inference. However, a recent study reported that doing so can worsen inference, underscoring the need for alternative alignment trimming strategies. Here, we introduce ClipKIT, an alignment trimming software that, rather than identifying and removing putatively phylogenetically uninformative sites, instead aims to identify and retain parsimony-informative sites, which are known to be phylogenetically informative. To test the efficacy of ClipKIT, we examined the accuracy and support of phylogenies inferred from 14 different alignment trimming strategies, including those implemented in ClipKIT, across nearly 140,000 alignments from a broad sampling of evolutionary histories. Phylogenies inferred from ClipKIT-trimmed alignments are accurate, robust, and time saving. Furthermore, ClipKIT consistently outperformed other trimming methods across diverse datasets, suggesting that strategies based on identifying and retaining parsimony-informative sites provide a robust framework for alignment trimming.Highly divergent sites in multiple sequence alignments are thought to negatively impact phylogenetic inference; trimming methods aim to remove these sites, but recent analysis suggests that doing so can worsen inference. This study introduces ClipKIT, a trimming method that instead aims to retain parsimony-informative sites; phylogenetic inference using ClipKIT-trimmed alignments is accurate, robust and time-saving. 相似文献
20.
Gisel A Panetta M Grillo G Licciulli VF Liuni S Saccone C Pesole G 《Bioinformatics (Oxford, England)》2004,20(18):3676-3679
SUMMARY: DNAfan (DNA Feature ANalyzer) is a tool combining sequence-filtering and pattern searching. DNAfan automatically extracts user-defined sets of sequence fragments from large sequence sets. Fragments are defined by annotated gene feature keys and co- or non-occurring patterns within the feature or close to it. A gene feature parser and a pattern-based filter tool localizes and extracts the specific subset of sequences. The selected sequence data can subsequently be retrieved for analyses or further processed with DNAfan to find the occurrence of specific patterns or structural motifs. DNAfan is a powerful tool for pattern analysis. Its filter features restricts the pattern search to a well-defined set of sequences, allowing drastic reduction in false positive hits. AVAILABILITY: http://bighost.ba.itb.cnr.it:8080/Framework. 相似文献