共查询到20条相似文献,搜索用时 15 毫秒
1.
Mediante is a MIAME-compliant microarray data manager that links together annotations and experimental data. Developed as a J2EE three-tier application, Mediante integrates a management system for production of long oligonucleotide microarrays, an experimental data repository suitable for home made or commercial microarrays, and a user interface dedicated to the management of microarrays projects. Several tools allow quality control of hybridizations and submission of validated data to public repositories. AVAILABILITY: http://www.microarray.fr. SUPPLEMENTARY INFORMATION: http://www.microarray.fr/SP/lebrigand2007/ 相似文献
2.
3.
Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex 总被引:2,自引:0,他引:2
We constructed error-correcting DNA barcodes that allow one run of a massively parallel pyrosequencer to process up to 1,544 samples simultaneously. Using these barcodes we processed bacterial 16S rRNA gene sequences representing microbial communities in 286 environmental samples, corrected 92% of sample assignment errors, and thus characterized nearly as many 16S rRNA genes as have been sequenced to date by Sanger sequencing. 相似文献
4.
We describe a query-based web-accessible system (www.neurogadgets.com/bws.php) for facilitating comparative microbial genomics. A variety of query pages are available, each with numerous options, that allow a biologist to pose relevant questions of genomic data. We illustrate with a characterization of species-specific protein-coding genes (so-called "ORFans"), finding that they are on average smaller, faster evolving, and less G+C-rich, and that they encode proteins more basic in their predicted isoelectric point, compared with non-species-specific genes. Using a dual-threshold approach, we conclude that these are characteristics of true species-specific genes, rather than artifacts of mis-annotation. 相似文献
5.
Background
As Next-Generation Sequencing data becomes available, existing hardware environments do not provide sufficient storage space and computational power to store and process the data due to their enormous size. This is and will be a frequent problem that is encountered everyday by researchers who are working on genetic data. There are some options available for compressing and storing such data, such as general-purpose compression software, PBAT/PLINK binary format, etc. However, these currently available methods either do not offer sufficient compression rates, or require a great amount of CPU time for decompression and loading every time the data is accessed.Results
Here, we propose a novel and simple algorithm for storing such sequencing data. We show that, the compression factor of the algorithm ranges from 16 to several hundreds, which potentially allows SNP data of hundreds of Gigabytes to be stored in hundreds of Megabytes. We provide a C++ implementation of the algorithm, which supports direct loading and parallel loading of the compressed format without requiring extra time for decompression. By applying the algorithm to simulated and real datasets, we show that the algorithm gives greater compression rate than the commonly used compression methods, and the data-loading process takes less time. Also, The C++ library provides direct-data-retrieving functions, which allows the compressed information to be easily accessed by other C++ programs.Conclusions
The SpeedGene algorithm enables the storage and the analysis of next generation sequencing data in current hardware environment, making system upgrades unnecessary. 相似文献6.
This paper introduces a new modular control design method for a cell controller with integrated error handling. To make the complexity of the cell controller manageable, its control logic is separated into two parts: resource allocation control and operation control. To create the operation control not only quickly but correctly, modular operation blocks integrated with error handling are developed. An algorithm automatically generates the operation control. The operation control created by the proposed method is proved to have desired control behaviors. The method is applied to an example system. 相似文献
7.
Background
For effective exposition of biological information, especially with regard to analysis of large-scale data types, researchers need immediate access to multiple categorical knowledge bases and need summary information presented to them on collections of genes, as opposed to the typical one gene at a time. 相似文献8.
Isaak Y Tecle Jeremy D Edwards Naama Menda Chiedozie Egesi Ismail Y Rabbi Peter Kulakow Robert Kawuki Jean-Luc Jannink Lukas A Mueller 《BMC bioinformatics》2014,15(1)
Background
Genomic selection (GS) promises to improve accuracy in estimating breeding values and genetic gain for quantitative traits compared to traditional breeding methods. Its reliance on high-throughput genome-wide markers and statistical complexity, however, is a serious challenge in data management, analysis, and sharing. A bioinformatics infrastructure for data storage and access, and user-friendly web-based tool for analysis and sharing output is needed to make GS more practical for breeders.Results
We have developed a web-based tool, called solGS, for predicting genomic estimated breeding values (GEBVs) of individuals, using a Ridge-Regression Best Linear Unbiased Predictor (RR-BLUP) model. It has an intuitive web-interface for selecting a training population for modeling and estimating genomic estimated breeding values of selection candidates. It estimates phenotypic correlation and heritability of traits and selection indices of individuals. Raw data is stored in a generic database schema, Chado Natural Diversity, co-developed by multiple database groups. Analysis output is graphically visualized and can be interactively explored online or downloaded in text format. An instance of its implementation can be accessed at the NEXTGEN Cassava breeding database, http://cassavabase.org/solgs.Conclusions
solGS enables breeders to store raw data and estimate GEBVs of individuals online, in an intuitive and interactive workflow. It can be adapted to any breeding program.Electronic supplementary material
The online version of this article (doi:10.1186/s12859-014-0398-7) contains supplementary material, which is available to authorized users. 相似文献9.
A compact new computer program for handling nucleic acid sequence data is presented. It consists of a number of different subsets, which may be used according to a given code system. The program is designed for the determination of restriction enzyme and other recognition sites in correlation with translation patterns, and allows tabulation of codon frequencies and protein molecular weights within specified gene boundaries. The program is especially designed for detection of overlapping genes. The language, is FORTRAN and thus the program may be used on small computers; it may also be used without any prior computer experience. Copies are available on request. 相似文献
10.
11.
12.
The authors describe an easy-to-use barcode-based animal tracking system that has improved record keeping and data retrieval and proved instrumental in the containment of an outbreak. 相似文献
13.
14.
Bioactive peptide database (BioPD) is a web-based knowledge base that contains more than 1100 protein sequences from human, mouse and rat, which are putative or are known to be bioactive peptides. In addition to peptide sequences and the annotation, the database also contains gene sequences with annotation, protein interaction and disease data related to the peptides. Each entry has as many references as possible to support the information represented. BioPD consists of six parts: PROTEIN, GENE, DISEASE, LINKS, INTERACTION, and REFERENCE. The database is searchable through keyword, gene and protein name, receptor name, etc. The links to PDB, InterPro, Pfam, OMIM, etc. are provided in each entry. Thus BioPD is formed as an information center for the bioactive peptide and serves as a gateway for exploration of bioactive peptides. The database can be accessed at http://biopd.bjmu.edu.cn. 相似文献
15.
RNA molecules play vital informational, structural, and functional roles in molecular biology, making them ideal targets for synthetic biology. However, several challenges remain for engineering novel allosteric RNA molecules, and the development of efficient computational design techniques is vitally needed. Here we describe the development of Allosteric RNA Designer (ARDesigner), a user-friendly and freely available web-based system for allosteric RNA design that incorporates mutational robustness in the design process. The system output includes detailed design information in a graphical HTML format. We used ARDesigner to engineer a temperature-sensitive AR, and found that the resulting design satisfied the prescribed properties/input. ARDesigner provides a simple means for researchers to design allosteric RNAs with specific properties. With its versatile framework and possibilities for further enhancement, ARDesigner may serve as a useful tool for synthetic biologists and therapeutic design. ARDesigner and its executable version are freely available at http://biotech.bmi.ac.cn/ARDesigner. 相似文献
16.
Background
Phylogenetic analysis of large, multiple-gene datasets, assembled from public sequence databases, is rapidly becoming a popular way to approach difficult phylogenetic problems. Supermatrices (concatenated multiple sequence alignments of multiple genes) can yield more phylogenetic signal than individual genes. However, manually assembling such datasets for a large taxonomic group is time-consuming and error-prone. Additionally, sequence curation, alignment and assessment of the results of phylogenetic analysis are made particularly difficult by the potential for a given gene in a given species to be unrepresented, or to be represented by multiple or partial sequences. We have developed a software package, TaxMan, that largely automates the processes of sequence acquisition, consensus building, alignment and taxon selection to facilitate this type of phylogenetic study. 相似文献17.
Workshop on barcoded DNA: application to rotifer phylogeny,evolution, and systematics 总被引:1,自引:0,他引:1
C. William BirkyJr. 《Hydrobiologia》2007,593(1):175-183
DNA barcoding is the use of segments of gene sequences to assign individual organisms to species. Thus it can be used to define
species and to identify specimens. Barcoding has been applied as an aid to systematics with little controversy in both monogonont
and bdelloid rotifers, and also in environmental sequencing projects designed to determine the diversity of microscopic organisms.
In contrast, a great deal of controversy has arisen over the creation of the Consortium for the Barcode of Life, a major initiative
to barcode all the species in several major groups of animals, with the long-range goal of barcoding all species of organisms.
This is a very brief review of DNA barcoding, especially as applied to rotifers, and a summary of the results of a workshop
held at the 11th International Workshop on Rotifera.
Guest editors: S. S. S. Sarma, R. D. Gulati, R. L. Wallace, S. Nandini, H. J. Dumont and R. Rico-Martínez
Advances in Rotifer Research 相似文献
18.
DANIEL L. ROWLEY JONATHAN A. CODDINGTON MICHAEL W. GATES ALLEN L. NORRBOM RONALD A. OCHOA NATALIA J. VANDENBERG MATTHEW H. GREENSTONE 《Molecular ecology resources》2007,7(6):915-924
Morphology‐based keys support accurate identification of many taxa. However, identification can be difficult for taxa that are either not well studied, very small, members of cryptic species complexes, or represented by immature stages. For such cases, DNA barcodes may provide diagnostic characters. Ecologists and evolutionary biologists deposit museum vouchers to document the species studied in their research. If DNA barcodes are to be used for identification, then both the DNA and the specimen from which it was extracted should be vouchered. We describe a protocol for the nondestructive extraction of DNA from terrestrial arthropods, using as examples members of the orders Acarina, Araneae, Coleoptera, Diptera, and Hymenoptera chosen to represent the ranges in size, overall sclerotization, and delicacy of key morphological characters in the group. We successfully extracted sequenceable DNA from all species after 1–4 h of immersion in extraction buffer. The extracted carcasses, processed and imaged using protocols standard for the taxon, were distinguishable from closely related species, and adequate as morphological vouchers. We provide links from the carcasses and DNA vouchers to image (MorphBank) and sequence (GenBank) databases. 相似文献
19.
20.
Niall J Lennon Robert E Lintner Scott Anderson Pablo Alvarez Andrew Barry William Brockman Riza Daza Rachel L Erlich Georgia Giannoukos Lisa Green Andrew Hollinger Cindi A Hoover David B Jaffe Frank Juhn Danielle McCarthy Danielle Perrin Karen Ponchner Taryn L Powers Kamran Rizzolo Dana Robbins Elizabeth Ryan Carsten Russ Todd Sparrow John Stalker Scott Steelman Michael Weiand Andrew Zimmer Matthew R Henn Chad Nusbaum Robert Nicol 《Genome biology》2010,11(2):1-9
We present an automated, high throughput library construction process for 454 technology. Sample handling errors and cross-contamination are minimized via end-to-end barcoding of plasticware, along with molecular DNA barcoding of constructs. Automation-friendly magnetic bead-based size selection and cleanup steps have been devised, eliminating major bottlenecks and significant sources of error. Using this methodology, one technician can create 96 sequence-ready 454 libraries in 2 days, a dramatic improvement over the standard method. 相似文献