首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A method of interfacing sequence similarity search softwarewith the fast sequence retrieval system ACNUC is described.The method is written in FORTRAN 77 and is straightforward toimplement because no textprocessing code is required —a minimum of 12 extra lines of FORTRAN provided the interfacefor most applications. The method is also efficient, since sequencesare located by simple indexing techniques, with no linear searchesof large database files necessary. Received on November 20, 1986; accepted on January 8, 1987  相似文献   

2.
The program described allows for the creation and manipulation of files of DNA sequence data up to very great lengths. The program uses its own paging system to load segments of the sequence into a small internal buffer so that the program does not have excessive memory requirements. The program offers a menu of functions to the user, and has been written to be forgiving of user errors. A code for the generalised specification of bases as a series of groups (i.e. A or T, Purine, etc.) has been devised and can be used in search specifications or in sequence files. Versions of the program have been developed to run with special efficiency under DIGITAL's RT11 operating system or to run under systems with a suitable implementation of FORTRAN VI.  相似文献   

3.
Many clone-based physical maps have been built with the FingerPrinted Contig (FPC) software, which is written in C and runs locally for fast and flexible analysis. If the maps were viewable only from FPC, they would not be as useful to the whole community since FPC must be installed on the user machine and the database downloaded. Hence, we have created a set of Web tools so users can easily view the FPC data and perform salient queries with standard browsers. This set includes the following four programs: WebFPC, a view of the contigs; WebChrom, the location of the contigs and genetic markers along the chromosome; WebBSS, locating user-supplied sequence on the map; and WebFCmp, comparing fingerprints. For additional FPC support, we have developed an FPC module for BioPerl and an FPC browser using the Generic Model Organism Project (GMOD) genome browser (GBrowse), where the FPC BioPerl module generates the data files for input into GBrowse. This provides an alternative to the WebChrom/WebFPC view. These tools are available to download along with documentation. The tools have been implemented for both the rice (Oryza sativa) and maize (Zea mays) FPC maps, which both contain the locations of clones, markers, genetic markers, and sequenced clone (along with links to sites that contain additional information).  相似文献   

4.
A database was used for data management and interprogram communication in an image processing and three-dimensional reconstruction program suite for biological bundles. The programs were modified from the MRC crystallographic package. The database server works with local and remote programs and data sets, allows simultaneous requests from multiple clients, and maintains multiple databases and data tables within them. It has built-in security for the data access. Several graphical user interfaces are available to view and/or edit data tables. In addition, FORTRAN interface and function libraries are written to communicate with image processing software. The data management overhead is inexpensive, requiring only narrow bandwidth from the network. It easily handles several data tables with over 1000 entries.  相似文献   

5.
Marla S  Singh VK 《In silico biology》2007,7(4-5):543-545
Recent sequencing of genomes of several microorganisms provides an opportunity to have access to huge volumes of data stored in various databases. This has resulted in the development of various computational and visualization tools to aid in retrieval and analysis of data. Development of user friendly genome data mapping and visualization tools facilitates researchers to closely examine various features of genes and make inferences from the displayed data efficiently. PGV - Prokaryotic Genome Viewer is a Java based web application tool capable of generating high quality interactive circular chromosome maps. With simple mouse roll over tasks on the interested region on the displayed map, the user is provided with features such as feature labeling, multi-fold zooming, image rotation and hyperlinking to different information resources. The tool is capable of instantaneously generating maps using user-supplied sequence data.  相似文献   

6.
Vibrio parahaemolyticus contains 11 rRNA operons each including one of six 16S-23S rRNA gene intergenic spacer classes differing in size and nucleotide sequence. Some of the spacer classes may differ between isolates. We observed that the differences in the spacers between isolates are generally in two spacer classes present in single copies in the genome, one class containing tRNA(Ala) and tRNA(Glu) and the other tRNA(Glu) exclusively. Moreover, these differences are due to indels located nearby their tRNA(Glu). Comparison of the nucleotide sequence between spacer classes suggests that intragenomic nonreciprocal recombination causes the size variations observed in the spacer regions of V. parahaemolyticus strains.  相似文献   

7.
It has been generally considered that FORTRAN is inferior to MUMPS and other computer languages in its facility to manipulate files and that it is not satisfactorily competent to handle databases. However, it is our finding that FORTRAN is the most widely used language in patient data management systems discussed in recently published papers. Therefore, it now seems appropriate to review and evaluate the validity of this general belief. The objective of this study is to establish a file management system in FORTRAN 77 for use in the construction of a clinical database. Comparative study is conducted on several problems associated with file manipulation, that is, time requirement for file access and access methods for data retrieval.  相似文献   

8.
msatcommander is a platform-independent program designed to search for microsatellite arrays, design primers, and tag primers using an automated routine. msatcommander accepts as input DNA sequence data in single-sequence or concatenated, fasta-formatted files. Search data and locus-specific primers are written to comma-separated value files for subsequent use in spreadsheet or database programs. Binary versions of the graphical interface for msatcommander are available for Apple OS X and Windows XP. Users of other operating systems may run the graphical interface version using the available source code, provided their environment supports at least Python 2.4, Biopython 1.43, and wxPython 2.8. msatcommander is available from http://code.google.com/p/msatcommander/.  相似文献   

9.
10.
We describe a package of DNA data handling and analysis programs designed for microcomputers. The package is convenient for immediate use by persons with little or no computer experience, and has been optimized by trial in our group for a year. By typing a single command, the user enters a system which asks questions or gives instructions in English. The system will enter, alter, and manage sequence files or a restriction enzyme library. It generates the reverse complement, translates, calculates codon usage, finds restriction sites, finds homologies with various degrees of mismatch, and graphs amino acid composition or base frequencies. A number of options for data handling and printing can be used to produce figures for publication. The package will be available in ANSI Standard FORTRAN for use with virtually any FORTRAN compiler.  相似文献   

11.
Macronuclei of Tetrahymena pyriformis contain approximately 200 copies of the genes for 25S and 17S ribosomal RNA (rRNA) per haploid genome. Micronuclei, however, contain only a few copies of the rRNA genes per haploid complement. Since macronuclei develop from, products of meiosis, fertilization and division of micronuclei, we suggested that the multiple copies of the rRNA genes in macronuclei are generated by amplification of the small number of genes in micronuclei (Yao et al., 1974). This process provides a simple mechanism for maintaining the homogeneity of the repeated rRNA genes. To test if amplification is a general mechanism operating on all repeated genes in Tetrahymena, we have examined the numbers of 5S RNA and tRNA genes in macro- and micronuclei. 5S RNA was purified by polyacrylamide gel electrophoresis and hybridized to saturation against macro- and micronuclear DNA. Approximately 0.013–0.014% of macronuclear DNA and about 0.009% of micronuclear DNA is complementary to 5S RNA. After correcting for the differences in the DNA sequence complexities between the two nuclei, we calculate that there are 300–350 5S genes per haploid macro- or micronuclear genome. From these data we conclude that there is little or no detectable amplification of the 5S genes in macronuclei relative to micronuclei. Similar studies using tRNA indicate that these genes are also highly repeated in both nuclei; about 800 genes are present per haploid genome. Thus, amplification from a small number of genes can be excluded as the mechanism for generating the repeated copies of the 5S and tRNA genes in Tetrahymena and it is likely that another, as yet unidentified, mechanism operates to maintain the homogeneity of these genes.  相似文献   

12.
Retrieving and organizing data from complete genomes is a time‐consuming task, even more so if the interest lies only in part of the genome (for nongenomic analysis). Furthermore, when comparing several genomes or genes, data retrieval has to be repeated multiple times. We present baca , a software for retrieving, organizing and visualizing multiple mitochondrial genomes. baca takes a GenBank query, retrieves all related genomes and generates multiple fasta files organized both by genomes and genes. A web‐based user interface and an interactive graphical map of all genomes with all genes are also provided. The program is available from http://cibio.up.pt/software/baca .  相似文献   

13.
以NCBI维护的一级数据库为数据源建立植物激素相关核酸和蛋白质二级数据库。将该二级数据库设计为基因、蛋白质和文献三部分, 编写软件从上述数据源中采集数据, 并以XML作为中间格式保存, 通过解析提交到二级数据库中并集成部分生物信息学工具软件, 初步实现了数据检索、统计分析、基于Web的本地化BLAST同源序列检索、序列的自动拼接以及蛋白质结构和功能位点的分析等功能。该二级数据库的构建为植物激素作用分子机理研究提供了高针对性的植物激素数据源和生物信息学辅助工具。  相似文献   

14.
We have improved an existing clone database management systemwritten in FORTRAN 77 and adapted it to our software environment.Improvements are that the database can be interrogated for anytype of information, not just keywords. Also, recombinant DNAconstructions can be represented in a simplified ‘shorthand’,whereafter a program assembles the full nucleotide sequencefrom the contributing fragments, which may be obtained fromnucleotide sequence databases. Another improvement is the replacementof the database manager by programs, running in batch to maintainthe databank and verify its consistency automatically. Finally,graphic extensions are written in Graphical Kernel System, todraw linear and circular restriction maps of recombinants. Besidesrestriction sites, recombinant features can be presented fromthe feature lines of recombinant database entries, or from thefeature tables of nucleotide databases. The clone database managementsystem is fully integrated into the sequence analysis softwarepackage from the Pasteur Institute, Paris, and is made accessiblethrough the same menu. As a result, recombinant DNA sequencescan directly be analysed by the sequence analysis programs. Received on March 17, 1986; accepted on June 16, 1986  相似文献   

15.
The Organelle Genome Database Project (GOBASE).   总被引:2,自引:1,他引:1       下载免费PDF全文
The taxonomically broad organelle genome database (GOBASE) organizes and integrates diverse data related to organelles (mitochondria and chloroplasts). The current version of GOBASE focuses on the mitochondrial subset of data and contains molecular sequences, RNA secondary structures and genetic maps, as well as taxonomic information for all eukaryotic species represented. The database has been designed so that complex biological queries, especially ones posed in a comparative genomics context, are supported. GOBASE has been implemented as a relational database with a web-based user interface (http://megasun.bch.umontreal.ca/gobase/gobas e.html ). Custom software tools have been written in house to assist in the population of the database, data validation, nomenclature standardization and front-end design. The database is fully operational and publicly accessible via the World Wide Web, allowing interactive browsing, sophisticated searching and easy downloading of data.  相似文献   

16.
We present a software package, Genquire, that allows visualization, querying, hand editing, and de novo markup of complete or partially annotated genomes. The system is written in Perl/Tk and uses, where possible, existing BioPerl data models and methods for representation and manipulation of the sequence and annotation objects. An adaptor API is provided to allow Genquire to display a wide range of databases and flat files, and a plugins API provides an interface to other sequence analysis software. AVAILABILITY: Genquire v3.03 is open-source software. The code is available for download and/or contribution at http://www.bioinformatics.org/Genquire  相似文献   

17.
CSDBase (http://www.chemie.uni-marburg.de/~csdbase/) is an interactive Internet-embedded research platform providing detailed information on proteins containing the cold shock domain (CSD). It consists of two separated database cores, one dedicated to CSD protein information, and one to provide a powerful resource to relevant literature with emphasis on the bacterial cold shock response. In addition to detailed protein information and useful cross links to other web sites, CSDBase contains computer-generated CSD structure models for most CSD-containing protein sequences available at NCBI non-redundant protein database at the time of CSDBase establishment. These models were calculated on the basis of known crystal and/or NMR structures using SWISS-MODEL and can be downloaded as PDB structure coordinate files for viewing and for manipulation with other software tools. CSDBase will be regularly updated and is organized in a compact form providing user friendly interfaces to both database cores which allow for easy data retrieval.  相似文献   

18.
RepeatAround is a Windows based software tool designed to find "direct repeats", "inverted repeats", "mirror repeats" and "complementary repeats", from 3 to 64bp length, in circular genomes. It processes input files directly extracted from GenBank database, providing visualisation of the repeats location in the genomic structure, so that for instance, in most mtDNAs the user can check if the repeats are located in coding or non-coding region (and in the first case in which gene), and how far apart the repeat pair(s) are. Besides the visual tool, it provides other outputs in a spreadsheet containing information on the number and location of the repeats, facilitating graphic analyses. Several genomes can be inputed simultaneously, for phylogenetic comparison purposes. Other capabilities of the software are the generation of random circular genomes, for statistical evaluation of comparison between observed repeats distributions with their shuffled counterparts, as well as the search for specific motifs, allowing an easy confirmation of repeats flanking a newly detected rearrangement. As an example of the programme's applications we analysed the Direct Repeats distribution in a large human mtDNA database. Results showed that Direct Repeats, even the larger ones, are evenly distributed among the human mtDNA haplogroups, enabling us to state that, based only on the repetitive motifs, no haplogroup is particularly more or less prone to mtDNA macrodeletions.  相似文献   

19.
Clostridium difficile is a major spore-forming environmental pathogen that causes serious health problems in patients undergoing antibiotic therapy. Consequently, reliable and sensitive methods for typing individual strains are required for epidemiological and environmental studies. Ribotyping is generally considered the best method, but it fails to account for sequence diversity which might exist in intergenic 16S-23S rRNA spacer regions (ISRs) within and among strains of this organism. Therefore, this study was undertaken to compare the sequence of each individual ISR in five strains of C. difficile to explore the extent of this diversity and see whether such information might provide the basis for more sensitive and discriminatory strain typing methods. After targeted PCR amplification, cloning, and sequencing, the diversity of the ISRs was used as a measure of rRNA operon copy number. In C. difficile strains 630, ATCC 43593, A, and B, 11, 11, 7, and 8 ISR length variants, respectively, were found (containing different combinations of sequence groups [i to xiii]), suggesting 11, 11, 7, and 8 rrn copies in the respective strains. Many ISRs of the same length differed markedly in their sequences, and some of these were restricted in occurrence to a single strain. Most of these ISRs did not contain any tRNA genes, and only single copies of the tRNA(Ala) gene were found in those that did. The presence of ISR sequence groups (i to xiii) varied between strains, with some found in one, two, three, four, or all five strains. We conclude that the intergenic 16S-23S rRNA spacer regions showed a high degree of diversity, not only among the rrn operons in different strains and different rrn copies in a single strain but also among ISRs of the same length. It appears that C. difficile ISRs vary more at the inter- and intragenic levels than those of other species as determined by empirical comparison of sequences. The precise characterization of these sequences has demonstrated a high level of mosaic sequence block rearrangements that are present or absent in multiple strain-variable rrn copies within and between five different strains of C. difficile.  相似文献   

20.
DNA barcoding is a promising approach to the diagnosis of biological diversity in which DNA sequences serve as the primary key for information retrieval. Most existing software for evolutionary analysis of DNA sequences was designed for phylogenetic analyses and, hence, those algorithms do not offer appropriate solutions for the rapid, but precise analyses needed for DNA barcoding, and are also unable to process the often large comparative datasets. We developed a flexible software tool for DNA taxonomy, named TaxI. This program calculates sequence divergences between a query sequence (taxon to be barcoded) and each sequence of a dataset of reference sequences defined by the user. Because the analysis is based on separate pairwise alignments this software is also able to work with sequences characterized by multiple insertions and deletions that are difficult to align in large sequence sets (i.e. thousands of sequences) by multiple alignment algorithms because of computational restrictions. Here, we demonstrate the utility of this approach with two datasets of fish larvae and juveniles from Lake Constance and juvenile land snails under different models of sequence evolution. Sets of ribosomal 16S rRNA sequences, characterized by multiple indels, performed as good as or better than cox1 sequence sets in assigning sequences to species, demonstrating the suitability of rRNA genes for DNA barcoding.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号