首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 140 毫秒
1.
gmconvert is a platform‐independent program provided in GUI (for Apple OS X and Windows XP) and command‐line versions (for other platforms). gmconvert allows rapid reformatting of microsatellite data from output files produced by Applied Biosystems genemapper software (version 3.x). The program will re‐array data into three formats commonly used in downstream analysis: genepop , cervus , and gerud . gmconvert will greatly increase the speed of data preparation prior to analysis and aid in reducing transpositional errors associated with manual re‐arraying and reformatting steps. gmconvert is available from http://gallus.forestry.uga.edu/software/ .  相似文献   

2.
There has been a great increase in both the number of population genetic analysis programs and the size of data sets being studied with them. Since the file formats required by the most popular and useful programs are variable, automated reformatting or conversion between them is desirable. formatomatic is an easy to use program that can read allelic data files in genepop , raw (csv ) or convert formats and create data files in nine formats: raw (csv ), arlequin , genepop , immanc /bayesass +, migrate , newhybrids , msvar , baps and structure . Use of formatomatic should greatly reduce time spent reformatting data sets and avoid unnecessary errors.  相似文献   

3.
4.
Mass spectrometry‐based proteomics starts with identifications of peptides and proteins, which provide the bases for forming the next‐level hypotheses whose “validations” are often employed for forming even higher level hypotheses and so forth. Scientifically meaningful conclusions are thus attainable only if the number of falsely identified peptides/proteins is accurately controlled. For this reason, RAId continued to be developed in the past decade. RAId employs rigorous statistics for peptides/proteins identification, hence assigning accurate P‐values/E‐values that can be used confidently to control the number of falsely identified peptides and proteins. The RAId web service is a versatile tool built to identify peptides and proteins from tandem mass spectrometry data. Not only recognizing various spectra file formats, the web service also allows four peptide scoring functions and choice of three statistical methods for assigning P‐values/E‐values to identified peptides. Users may upload their own protein database or use one of the available knowledge integrated organismal databases that contain annotated information such as single amino acid polymorphisms, post‐translational modifications, and their disease associations. The web service also provides a friendly interface to display, sort using different criteria, and download the identified peptides and proteins. RAId web service is freely available at https://www.ncbi.nlm.nih.gov/CBBresearch/Yu/raid  相似文献   

5.
SUMMARY: ACGT (a comparative genomics tool) is a genomic DNA sequence comparison viewer and analyzer. It can read a pair of DNA sequences in GenBank, Embl or Fasta formats, with or without a comparison file, and provide users with many options to view and analyze the similarities between the input sequences. It is written in Java and can be run on Unix, Linux and Windows platforms. AVAILABILITY: The ACGT program is freely available with documentation and examples at website: http://db.systemsbiology.net/projects/local/mhc/acgt/  相似文献   

6.
convert is a user‐friendly, 32‐bit Windows program that facilitates ready transfer of codominant, diploid genotypic data amongst commonly used population genetic software packages. convert reads input files in its own ‘standard’ data format, easily produced from an excel file of diploid, codominant marker data, and can convert these to the input formats of the following programs: gda , genepop , arlequin , popgene , microsat , phylip , and structure . convert can also read input files in genepop format. In addition, convert can produce a summary table of allele frequencies in which private alleles and the sample sizes at each locus are indicated.  相似文献   

7.
GOAnno: GO annotation based on multiple alignment   总被引:2,自引:0,他引:2  
SUMMARY: GOAnno is a web tool that automatically annotates proteins according to the Gene Ontology (GO) using evolutionary information available in hierarchized multiple alignments. GO terms present in the aligned functional subfamily can be cross-validated and propagated to obtain highly reliable predicted GO annotation based on the GOAnno algorithm. AVAILABILITY: The web tool and a reduced version for local installation are freely available at http://igbmc.u-strasbg.fr/GOAnno/GOAnno.html SUPPLEMENTARY INFORMATION: The website supplies a detailed explanation and illustration of the algorithm at http://igbmc.u-strasbg.fr/GOAnno/GOAnnoHelp.html.  相似文献   

8.
A motif is a short DNA or protein sequence that contributes to the biological function of the sequence in which it resides. Over the past several decades, many computational methods have been described for identifying, characterizing and searching with sequence motifs. Critical to nearly any motif-based sequence analysis pipeline is the ability to scan a sequence database for occurrences of a given motif described by a position-specific frequency matrix. RESULTS: We describe Find Individual Motif Occurrences (FIMO), a software tool for scanning DNA or protein sequences with motifs described as position-specific scoring matrices. The program computes a log-likelihood ratio score for each position in a given sequence database, uses established dynamic programming methods to convert this score to a P-value and then applies false discovery rate analysis to estimate a q-value for each position in the given sequence. FIMO provides output in a variety of formats, including HTML, XML and several Santa Cruz Genome Browser formats. The program is efficient, allowing for the scanning of DNA sequences at a rate of 3.5 Mb/s on a single CPU. Availability and Implementation: FIMO is part of the MEME Suite software toolkit. A web server and source code are available at http://meme.sdsc.edu.  相似文献   

9.
Sequencing whole genomes has become a standard research tool in many disciplines including Molecular Ecology, but the rapid technological advances in combination with several competing platforms have resulted in a confusing diversity of formats. This lack of standard formats causes several problems, such as undocumented preprocessing steps or the loss of information in downstream software tools, which do not account for the specifics of the different available formats. ReadTools is an open‐source Java toolkit designed to standardize and preprocess read data from different platforms. It manages FASTQ‐ and SAM‐formatted inputs while dealing with platform‐specific peculiarities and provides a standard SAM compliant output. The code and executable are available at https://github.com/magicDGS/ReadTools .  相似文献   

10.
11.
As molecular ecologists, we have by necessity become adept at working across computational platforms. A diverse community of scientists has developed a broad array of analytical resources spanning command line to graphical user interface across Linux, Mac, and Windows environments and a dizzying array of program‐specific input formats. In light of this, we often explore our data like free divers – filling our lungs with air and descending for a short period of time into one part of our data set before resurfacing, reformatting, and preparing for our next analysis. In this issue of Molecular Ecology Resources, Meirmans (2020) presents an updated version of GenoDive, a program with a toolkit that provides users with the opportunity to stay a while and delve deeper into the diverse portfolio of information provided by a genomic data set. The comprehensive nature of GenoDive coupled with its unique capability to handle both diploid and polyploid data also provides an opportunity to reflect on the unevenness of resources available for the analysis of polyploid versus diploid data. Since new updates include the addition of plug‐ins for genotype‐environment association analyses, we limit the observations presented here to the common tools used for landscape genomics analyses.  相似文献   

12.
Molecular docking of peptides to proteins can be a useful tool in the exploration of the possible peptide binding sites and poses. CABS‐dock is a method for protein–peptide docking that features significant conformational flexibility of both the peptide and the protein molecules during the peptide search for a binding site. The CABS‐dock has been made available as a web server and a standalone package. The web server is an easy to use tool with a simple web interface. The standalone package is a command‐line program dedicated to professional users. It offers a number of advanced features, analysis tools and support for large‐sized systems. In this article, we outline the current status of the CABS‐dock method, its recent developments, applications, and challenges ahead.  相似文献   

13.
MOTIVATION: Effective use of proteomics data, specifically mass spectrometry data, relies on the ability to read and write the many mass spectrometer file formats. Even with mass spectrometer vendor-specific libraries and vendor-neutral file formats, such as mzXML and mzData it can be difficult to extract raw data files in a form suitable for batch processing and basic research. Introduced here are the ProteomeCommons.org Input and Output Framework, abbreviated to IO Framework, which is designed to abstractly represent mass spectrometry data. This project is a public, open-source, free-to-use framework that supports most of the mass spectrometry data formats, including current formats, legacy formats and proprietary formats that require a vendor-specific library in order to operate. The IO Framework includes an on-line tool for non-programmers and a set of libraries that developers may use to convert between various proteomics file formats. AVAILABILITY: The current source-code and documentation for the ProteomeCommons.org IO Framework is freely available at http://www.proteomecommons.org/current/531/  相似文献   

14.
VIRS (A visual tool for identifying restriction sites in multiple DNA sequences) is an interactive web‐based program designed for restriction endonuclease cut sites prediction and visualization. It can afford to analyze multiple DNA sequences simultaneously and produce visual restriction maps with several useful options intended for users' customization. These options also perform in‐depth analysis of the restriction maps, such as providing virtual electrophoretic result for digested fragments. Different from other analytical tools, VIRS not only displays visual outputs but also provides the detailed properties of restriction endonucleases that are commercially available. All the information of these enzymes is stored in our internal database, which is updated monthly from the manufacturers' web pages. It is freely available online at http://bis.zju.edu.cn/virs/index.html . © 2009 American Institute of Chemical Engineers Biotechnol. Prog., 2009  相似文献   

15.
The DynDom database of protein domain motions   总被引:1,自引:0,他引:1  
A relational database has been developed based on the results from the application of the DynDom program to a number of proteins for which multiple X-ray conformers are available. The database is populated via a web-based tool that allows visitors to the website to run the DynDom program server-side by selecting pairs of X-ray conformers by Protein Data Bank code and chain identifier. AVAILABILITY: The website can be found at: http://www.sys.uea.ac.uk/dyndom.  相似文献   

16.
ABSTRACT: BACKGROUND: Ongoing innovation in phylogenetics and evolutionary biology has been accompanied by a proliferation of software tools, data formats, analytical techniques and web servers. This brings with it the challenge of integrating phylogenetic and other related biological data found in a wide variety of formats, and underlines the need for reusable software that can read, manipulate and transform this information into the various forms required to build computational pipelines. RESULTS: We built a Python software library for working with phylogenetic data that is tightly integrated with Biopython, a broad-ranging toolkit for computational biology. Our library, Bio.Phylo, is highly interoperable with existing libraries, tools and standards, and is capable of parsing common file formats for phylogenetic trees, performing basic transformations and manipulations, attaching rich annotations, and visualizing trees. We unified the modules for working with the standard file formats Newick, NEXUS and phyloXML behind a consistent and simple API, providing a common set of functionality independent of the data source. CONCLUSIONS: Bio.Phylo meets a growing need in bioinformatics for working with heterogeneous types of phylogenetic data. By supporting interoperability with multiple file formats and leveraging existing Biopython features, this library simplifies the construction of phylogenetic workflows. We also provide examples of the benefits of building a community around a shared open-source project. Bio.Phylo is included with Biopython, available through the Biopython website, http://biopython.org.  相似文献   

17.
High-performance computing faces considerable change as the Internet and the Grid mature. Applications that once were tightly-coupled and monolithic are now decentralized, with collaborating components spread across diverse computational elements. Such distributed systems most commonly communicate through the exchange of structured data. Definition and translation of metadata is incorporated in all systems that exchange structured data. We observe that the manipulation of this metadata can be decomposed into three separate steps: discovery, binding of program objects to the metadata, and marshaling of data to and from wire formats. We have designed a method of representing message formats in XML, using datatypes available in the XML Schema specification. We have implemented a tool, XMIT, that uses such metadata and exploits this decomposition in order to provide flexible run-time metadata definition facilities for an efficient binary communication mechanism. We also demonstrate that the use of XMIT makes possible such flexibility at little performance cost.  相似文献   

18.
MOTIVATION: InFiRe, Insertion Finder via Restriction digest, is a novel software tool that allows for the computational identification of transposon insertion sites in known bacterial genome sequences after transposon mutagenesis experiments. The approach is based on the fact that restriction endonuclease digestions of bacterial DNA yield a unique pattern of DNA fragments with defined sizes. Transposon insertion changes the size of the hosting DNA fragment by a known number of base pairs. The exact size of this fragment can be determined by Southern blot hybridization. Subsequently, the position of insertion can be identified with computational analysis. The outlined method provides a solid basis for the establishment of a new high-throughput technology. AVAILABILITY AND IMPLEMENTATION: The software is freely available on our web server at www.infire.tu-bs.de. The algorithm was implemented in the statistical programming language R. For the most flexible use, InFiRe is provided in two different versions. A web interface offers the convenient use in a web browser. In addition, the software and source code is freely available for download as R-packages on our website. CONTACT: m.steinert@tu-bs.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

19.
Most bioinformatics tools require specialized input formats for sequence comparison and analysis. This is particularly true for molecular phylogeny programs, which accept only certain formats. In addition, it is often necessary to eliminate highly similar sequences among the input, especially when the dataset is large. Moreover, most programs have restrictions upon the sequence name. Here we introduce SeqMaT, a Sequence Manipulation Tool. It has the following functions: data format conversion,sequence name coding and decoding,redundant and highly similar sequence removal, anddata mining utilities. SeqMaT was developed using Java with two versions, web-based and standalone. A standalone program is convenient to manipulate a large number of sequences, while the web version will guarantee wide availability of the tool for researchers and practitioners throughout the Internet. AVAILABILITY: The database is available for free at http://glee.ist.unomaha.edu/seqmat.  相似文献   

20.
TMpro is a transmembrane (TM) helix prediction algorithm that uses language processing methodology for TM segment identification. It is primarily based on the analysis of statistical distributions of properties of amino acids in transmembrane segments. This article describes the availability of TMpro on the internet via a web interface. The key features of the interface are: (i) output is generated in multiple formats including a user-interactive graphical chart which allows comparison of TMpro predicted segment locations with other labeled segments input by the user, such as predictions from other methods. (ii) Up to 5000 sequences can be submitted at a time for prediction. (iii) TMpro is available as a web server and is published as a web service so that the method can be accessed by users as well as other services depending on the need for data integration. Availability: http://linzer.blm.cs.cmu.edu/tmpro/ (web server and help), http://blm.sis.pitt.edu:8080/axis/services/TMProFetcherService (web service).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号