首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到10条相似文献,搜索用时 156 毫秒
1.

Background  

New "next generation" DNA sequencing technologies offer individual researchers the ability to rapidly generate large amounts of genome sequence data at dramatically reduced costs. As a result, a need has arisen for new software tools for storage, management and analysis of genome sequence data. Although bioinformatic tools are available for the analysis and management of genome sequences, limitations still remain. For example, restrictions on the submission of data and use of these tools may be imposed, thereby making them unsuitable for sequencing projects that need to remain in-house or proprietary during their initial stages. Furthermore, the availability and use of next generation sequencing in industrial, governmental and academic environments requires biologist to have access to computational support for the curation and analysis of the data generated; however, this type of support is not always immediately available.  相似文献   

2.
High‐throughput sequencing makes it possible to evaluate thousands of genetic markers across genomes and populations. Reduced‐representation sequencing approaches, like double‐digest restriction site‐associated DNA sequencing (ddRADseq), are frequently applied to screen for genetic variation. In particular in nonmodel organisms where whole‐genome sequencing is not yet feasible, ddRADseq has become popular as it allows genomewide assessment of variation patterns even in the absence of other genomic resources. However, while many tools are available for the analysis of ddRADseq data, few options exist to simulate ddRADseq data in order to evaluate the accuracy of downstream tools. The available tools either focus on the optimization of ddRAD experiment design or do not provide the information necessary for a detailed evaluation of different ddRAD analysis tools. For this task, a ground truth, that is, the underlying information of all effects in the data set, is required. Therefore, we here present ddrage , the ddRA D Data Set Ge nerator, that allows both developers and users to evaluate their ddRAD analysis software. ddrage allows the user to adjust many parameters such as coverage and rates of mutations, sequencing errors or allelic dropouts, in order to generate a realistic simulated ddRADseq data set for given experimental scenarios and organisms. The simulated reads can be easily processed with available analysis software such as stacks or pyrad and evaluated against the underlying parameters used to generate the data to gauge the impact of different parameter values used during downstream data processing.  相似文献   

3.
NEBcutter, version 1.0, is a program available via a web server (http://tools.neb.com/NEBcutter) that will accept an input DNA sequence and produce a comprehensive report of the restriction enzymes that will cleave the sequence. It produces a variety of outputs including restriction enzyme maps, theoretical digests and links into the restriction enzyme database, REBASE (http://www.neb.com/rebase). Importantly, its table of recognition sites is updated daily from REBASE and it marks all sites that are potentially affected by DNA methylation (Dam, Dcm, etc.). Many options exist to choose the enzymes used for digestion, including all known specificities, subsets of those that are commercially available or sets of enzymes that produce compatible termini.  相似文献   

4.
A computer program package for the storage, change, and comparison of restriction maps is described. The programs are intended to detect overlaps between relatively short (about 10-40 kb; abbreviations ref.2) maps and to merge the overlapping fragments into large restriction maps. They run on a 16-bit-microcomputer with limited memory and addressing capability. Due to the restricted reliability of restriction maps compared with DNA sequence data a particular storage method was used. The source code of the programs is freely available (+).  相似文献   

5.
We use the extensive published information describing the genome of Escherichia coli and new restriction map alignment software to align DNA sequence, genetic, and physical maps. Restriction map alignment software is used which considers restriction maps as strings analogous to DNA or protein sequences except that two values, enzyme name and DNA base address, are associated with each position on the string. The resulting alignments reveal a nearly linear relationship between the physical and genetic maps of the E. coli chromosome. Physical map comparisons with the 1976, 1980, and 1983 genetic maps demonstrate a better fit with the more recent maps. The results of these alignments are genomic kilobase coordinates, orientation and rank of the alignment that best fits the genetic data. A statistical measure based on extreme value distribution is applied to the alignments. Additional computer analyses allow us to estimate the accuracy of the published E. coli genomic restriction map, simulate rearrangements of the bacterial chromosome, and search for repetitive DNA. The procedures we used are general enough to be applicable to other genome mapping projects.  相似文献   

6.
MAXAMIZE. A DNA sequencing strategy advisor.   总被引:2,自引:1,他引:1       下载免费PDF全文
The MAXAMIZE advisory system determines from user-provided restriction maps an optimal strategy to do nucleotide sequencing by methods involving end-labeled fragments. The maps may be either simple linear restriction maps of fragments or complex circular maps including restriction sites of a vector. The whole system is interactive and is written in the Genetic English language provided by the GENESIS System, a molecular genetics knowledge representation and manipulation package. In addition, MAXAMIZE provides bookkeeping facilities for sequencing and offers advise on how to verify the newly obtained sequence data.  相似文献   

7.
Physical mapping has been rediscovered as an important component of large-scale sequencing projects. Restriction maps provide landmark sequences at defined intervals, and high-resolution restriction maps can be assembled from ensembles of single molecules by optical means. Such optical maps can be constructed from both large-insert clones and genomic DNA, and are used as a scaffold for accurately aligning sequence contigs generated by shotgun sequencing.  相似文献   

8.
M J Kelly 《Génome》1989,31(2):1027-1033
Mapping and sequencing the human genome will generate large amounts of data, which must be sorted, analyzed, and stored for rapid retrieval to complete this enormous task. Computers and their software programs provide the most important tool to the molecular biologist today. A discussion of current capabilities and future needs in computer hardware and software for the human genome project is the topic of this paper. The use of computer programs to generate restriction maps, manage clone libraries, manage sequence projects, and generate consensus sequences is presented. The use of computers to communicate useful information rapidly to scientific colleagues is also mentioned. The role of both GenBank and BIONET is central to the dissemination and analysis of sequence information. The capabilities of electronic communication worldwide for assisting this project is available on the BIONET National Computer Resource, using existing networks.  相似文献   

9.
Yeast mitochondrial DNA-pBR322 recombinant DNA molecules known to contain tRNA genes from a tRNA rich region of the yeast genome were used as a source of DNA for restriction mapping and tRNA gene sequence analysis. We report here restriction maps of two segments of yeast mitochondrial DNA and the sequence of mitochondrial genes coding for tRNAglyGGR and tRNAvalGUR. Both genes are flanked by A + T rich DNA and neither has an intervening sequence nor codes for a 3' CCA end. The tRNA structures deduced from the genes have the usual cloverleaf structures and invariant nucleotides. This combination of DNA sequencing and restriction mapping has enabled us to determine that the tRNAvalGUR and a previously sequenced tRNA, the tRNApheUUY are transcribed from the same strand of DNA.  相似文献   

10.
The EMBL Nucleotide Sequence Database.   总被引:6,自引:1,他引:5  
The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl.html) constitutes Europe's primary nucleotide sequence resource. Main sources for DNA and RNA sequences are direct submissions from individual researchers, genome sequencing projects and patent applications. While automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO), the preferred submission tool for individual submitters is Webin (WWW). Through all stages, dataflow is monitored by EBI biologists communicating with the sequencing groups. In collaboration with DDBJ and GenBank the database is produced, maintained and distributed at the European Bioinformatics Institute (EBI). Database releases are produced quarterly and are distributed on CD-ROM. Network services allow access to the most up-to-date data collection via Internet and World Wide Web interface. EBI's Sequence Retrieval System (SRS) is a Network Browser for Databanks in Molecular Biology, integrating and linking the main nucleotide and protein databases, plus many specialised databases. For sequence similarity searching a variety of tools (e.g. Blitz, Fasta, Blast etc) are available for external users to compare their own sequences against the most currently available data in the EMBL Nucleotide Sequence Database and SWISS-PROT.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号