首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Mediante is a MIAME-compliant microarray data manager that links together annotations and experimental data. Developed as a J2EE three-tier application, Mediante integrates a management system for production of long oligonucleotide microarrays, an experimental data repository suitable for home made or commercial microarrays, and a user interface dedicated to the management of microarrays projects. Several tools allow quality control of hybridizations and submission of validated data to public repositories. AVAILABILITY: http://www.microarray.fr. SUPPLEMENTARY INFORMATION: http://www.microarray.fr/SP/lebrigand2007/  相似文献   

2.
3.
We constructed error-correcting DNA barcodes that allow one run of a massively parallel pyrosequencer to process up to 1,544 samples simultaneously. Using these barcodes we processed bacterial 16S rRNA gene sequences representing microbial communities in 286 environmental samples, corrected 92% of sample assignment errors, and thus characterized nearly as many 16S rRNA genes as have been sequenced to date by Sanger sequencing.  相似文献   

4.
We describe a query-based web-accessible system (www.neurogadgets.com/bws.php) for facilitating comparative microbial genomics. A variety of query pages are available, each with numerous options, that allow a biologist to pose relevant questions of genomic data. We illustrate with a characterization of species-specific protein-coding genes (so-called "ORFans"), finding that they are on average smaller, faster evolving, and less G+C-rich, and that they encode proteins more basic in their predicted isoelectric point, compared with non-species-specific genes. Using a dual-threshold approach, we conclude that these are characteristics of true species-specific genes, rather than artifacts of mis-annotation.  相似文献   

5.

Background

As Next-Generation Sequencing data becomes available, existing hardware environments do not provide sufficient storage space and computational power to store and process the data due to their enormous size. This is and will be a frequent problem that is encountered everyday by researchers who are working on genetic data. There are some options available for compressing and storing such data, such as general-purpose compression software, PBAT/PLINK binary format, etc. However, these currently available methods either do not offer sufficient compression rates, or require a great amount of CPU time for decompression and loading every time the data is accessed.

Results

Here, we propose a novel and simple algorithm for storing such sequencing data. We show that, the compression factor of the algorithm ranges from 16 to several hundreds, which potentially allows SNP data of hundreds of Gigabytes to be stored in hundreds of Megabytes. We provide a C++ implementation of the algorithm, which supports direct loading and parallel loading of the compressed format without requiring extra time for decompression. By applying the algorithm to simulated and real datasets, we show that the algorithm gives greater compression rate than the commonly used compression methods, and the data-loading process takes less time. Also, The C++ library provides direct-data-retrieving functions, which allows the compressed information to be easily accessed by other C++ programs.

Conclusions

The SpeedGene algorithm enables the storage and the analysis of next generation sequencing data in current hardware environment, making system upgrades unnecessary.  相似文献   

6.
This paper introduces a new modular control design method for a cell controller with integrated error handling. To make the complexity of the cell controller manageable, its control logic is separated into two parts: resource allocation control and operation control. To create the operation control not only quickly but correctly, modular operation blocks integrated with error handling are developed. An algorithm automatically generates the operation control. The operation control created by the proposed method is proved to have desired control behaviors. The method is applied to an example system.  相似文献   

7.

Background  

For effective exposition of biological information, especially with regard to analysis of large-scale data types, researchers need immediate access to multiple categorical knowledge bases and need summary information presented to them on collections of genes, as opposed to the typical one gene at a time.  相似文献   

8.

Background

Genomic selection (GS) promises to improve accuracy in estimating breeding values and genetic gain for quantitative traits compared to traditional breeding methods. Its reliance on high-throughput genome-wide markers and statistical complexity, however, is a serious challenge in data management, analysis, and sharing. A bioinformatics infrastructure for data storage and access, and user-friendly web-based tool for analysis and sharing output is needed to make GS more practical for breeders.

Results

We have developed a web-based tool, called solGS, for predicting genomic estimated breeding values (GEBVs) of individuals, using a Ridge-Regression Best Linear Unbiased Predictor (RR-BLUP) model. It has an intuitive web-interface for selecting a training population for modeling and estimating genomic estimated breeding values of selection candidates. It estimates phenotypic correlation and heritability of traits and selection indices of individuals. Raw data is stored in a generic database schema, Chado Natural Diversity, co-developed by multiple database groups. Analysis output is graphically visualized and can be interactively explored online or downloaded in text format. An instance of its implementation can be accessed at the NEXTGEN Cassava breeding database, http://cassavabase.org/solgs.

Conclusions

solGS enables breeders to store raw data and estimate GEBVs of individuals online, in an intuitive and interactive workflow. It can be adapted to any breeding program.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0398-7) contains supplementary material, which is available to authorized users.  相似文献   

9.
A flexible new computer program for handling DNA sequence data.   总被引:7,自引:2,他引:7       下载免费PDF全文
A compact new computer program for handling nucleic acid sequence data is presented. It consists of a number of different subsets, which may be used according to a given code system. The program is designed for the determination of restriction enzyme and other recognition sites in correlation with translation patterns, and allows tabulation of codon frequencies and protein molecular weights within specified gene boundaries. The program is especially designed for detection of overlapping genes. The language, is FORTRAN and thus the program may be used on small computers; it may also be used without any prior computer experience. Copies are available on request.  相似文献   

10.
11.
12.
The authors describe an easy-to-use barcode-based animal tracking system that has improved record keeping and data retrieval and proved instrumental in the containment of an outbreak.  相似文献   

13.
14.
Shi L  Zhang Q  Rui W  Lu M  Jing X  Shang T  Tang J 《Regulatory peptides》2004,120(1-3):1-3
Bioactive peptide database (BioPD) is a web-based knowledge base that contains more than 1100 protein sequences from human, mouse and rat, which are putative or are known to be bioactive peptides. In addition to peptide sequences and the annotation, the database also contains gene sequences with annotation, protein interaction and disease data related to the peptides. Each entry has as many references as possible to support the information represented. BioPD consists of six parts: PROTEIN, GENE, DISEASE, LINKS, INTERACTION, and REFERENCE. The database is searchable through keyword, gene and protein name, receptor name, etc. The links to PDB, InterPro, Pfam, OMIM, etc. are provided in each entry. Thus BioPD is formed as an information center for the bioactive peptide and serves as a gateway for exploration of bioactive peptides. The database can be accessed at http://biopd.bjmu.edu.cn.  相似文献   

15.
Shu W  Liu M  Chen H  Bo X  Wang S 《Journal of biotechnology》2010,150(4):466-473
RNA molecules play vital informational, structural, and functional roles in molecular biology, making them ideal targets for synthetic biology. However, several challenges remain for engineering novel allosteric RNA molecules, and the development of efficient computational design techniques is vitally needed. Here we describe the development of Allosteric RNA Designer (ARDesigner), a user-friendly and freely available web-based system for allosteric RNA design that incorporates mutational robustness in the design process. The system output includes detailed design information in a graphical HTML format. We used ARDesigner to engineer a temperature-sensitive AR, and found that the resulting design satisfied the prescribed properties/input. ARDesigner provides a simple means for researchers to design allosteric RNAs with specific properties. With its versatile framework and possibilities for further enhancement, ARDesigner may serve as a useful tool for synthetic biologists and therapeutic design. ARDesigner and its executable version are freely available at http://biotech.bmi.ac.cn/ARDesigner.  相似文献   

16.

Background  

Phylogenetic analysis of large, multiple-gene datasets, assembled from public sequence databases, is rapidly becoming a popular way to approach difficult phylogenetic problems. Supermatrices (concatenated multiple sequence alignments of multiple genes) can yield more phylogenetic signal than individual genes. However, manually assembling such datasets for a large taxonomic group is time-consuming and error-prone. Additionally, sequence curation, alignment and assessment of the results of phylogenetic analysis are made particularly difficult by the potential for a given gene in a given species to be unrepresented, or to be represented by multiple or partial sequences. We have developed a software package, TaxMan, that largely automates the processes of sequence acquisition, consensus building, alignment and taxon selection to facilitate this type of phylogenetic study.  相似文献   

17.
DNA barcoding is the use of segments of gene sequences to assign individual organisms to species. Thus it can be used to define species and to identify specimens. Barcoding has been applied as an aid to systematics with little controversy in both monogonont and bdelloid rotifers, and also in environmental sequencing projects designed to determine the diversity of microscopic organisms. In contrast, a great deal of controversy has arisen over the creation of the Consortium for the Barcode of Life, a major initiative to barcode all the species in several major groups of animals, with the long-range goal of barcoding all species of organisms. This is a very brief review of DNA barcoding, especially as applied to rotifers, and a summary of the results of a workshop held at the 11th International Workshop on Rotifera. Guest editors: S. S. S. Sarma, R. D. Gulati, R. L. Wallace, S. Nandini, H. J. Dumont and R. Rico-Martínez Advances in Rotifer Research  相似文献   

18.
Morphology‐based keys support accurate identification of many taxa. However, identification can be difficult for taxa that are either not well studied, very small, members of cryptic species complexes, or represented by immature stages. For such cases, DNA barcodes may provide diagnostic characters. Ecologists and evolutionary biologists deposit museum vouchers to document the species studied in their research. If DNA barcodes are to be used for identification, then both the DNA and the specimen from which it was extracted should be vouchered. We describe a protocol for the nondestructive extraction of DNA from terrestrial arthropods, using as examples members of the orders Acarina, Araneae, Coleoptera, Diptera, and Hymenoptera chosen to represent the ranges in size, overall sclerotization, and delicacy of key morphological characters in the group. We successfully extracted sequenceable DNA from all species after 1–4 h of immersion in extraction buffer. The extracted carcasses, processed and imaged using protocols standard for the taxon, were distinguishable from closely related species, and adequate as morphological vouchers. We provide links from the carcasses and DNA vouchers to image (MorphBank) and sequence (GenBank) databases.  相似文献   

19.
20.
We present an automated, high throughput library construction process for 454 technology. Sample handling errors and cross-contamination are minimized via end-to-end barcoding of plasticware, along with molecular DNA barcoding of constructs. Automation-friendly magnetic bead-based size selection and cleanup steps have been devised, eliminating major bottlenecks and significant sources of error. Using this methodology, one technician can create 96 sequence-ready 454 libraries in 2 days, a dramatic improvement over the standard method.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号