首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background

Meaningful exchange of microarray data is currently difficult because it is rare that published data provide sufficient information depth or are even in the same format from one publication to another. Only when data can be easily exchanged will the entire biological community be able to derive the full benefit from such microarray studies.

Results

To this end we have developed three key ingredients towards standardizing the storage and exchange of microarray data. First, we have created a minimal information for the annotation of a microarray experiment (MIAME)-compliant conceptualization of microarray experiments modeled using the unified modeling language (UML) named MAGE-OM (microarray gene expression object model). Second, we have translated MAGE-OM into an XML-based data format, MAGE-ML, to facilitate the exchange of data. Third, some of us are now using MAGE (or its progenitors) in data production settings. Finally, we have developed a freely available software tool kit (MAGE-STK) that eases the integration of MAGE-ML into end users' systems.

Conclusions

MAGE will help microarray data producers and users to exchange information by providing a common platform for data exchange, and MAGE-STK will make the adoption of MAGE easier.  相似文献   

2.
While minimum information about a microarray experiment (MIAME) standards have helped to increase the value of the microarray data deposited into public databases like ArrayExpress and Gene Expression Omnibus (GEO), limited means have been available to assess the quality of this data or to identify the procedures used to normalize and transform raw data. The EMERALD FP6 Coordination Action was designed to deliver approaches to assess and enhance the overall quality of microarray data and to disseminate these approaches to the microarray community through an extensive series of workshops, tutorials, and symposia. Tools were developed for assessing data quality and used to demonstrate how the removal of poor-quality data could improve the power of statistical analyses and facilitate analysis of multiple joint microarray data sets. These quality metrics tools have been disseminated through publications and through the software package arrayQualityMetrics. Within the framework provided by the Ontology of Biomedical Investigations, ontology was developed to describe data transformations, and software ontology was developed for gene expression analysis software. In addition, the consortium has advocated for the development and use of external reference standards in microarray hybridizations and created the Molecular Methods (MolMeth) database, which provides a central source for methods and protocols focusing on microarray-based technologies.  相似文献   

3.
DNA base pair resolution by single molecule force spectroscopy   总被引:4,自引:1,他引:3       下载免费PDF全文
The forces that hold complementary strands of DNA together in a double helix, and the role of base mismatches in these, are examined by single molecule force spectroscopy using an atomic force microscope (AFM). These forces are important when considering the binding of proteins to DNA, since these proteins often mechanically stretch the DNA during their action. In AFM measurement of forces, there is an inherent instrumental limitation that makes it difficult to compare results from different experimental runs. This is circumvented by using an oligonucleotide microarray, which allowed a direct comparison of the forces between perfectly matched short oligonucleotides and those containing a single or double mismatch. Through this greatly increased sensitivity, the force contribution of a single AT base pair was derived. The results indicate that the contribution to forces from the stacking interactions is more important than that from hydrogen bonding.  相似文献   

4.
《Epigenetics》2013,8(2):89-96
The mouse is an established model organism for the study of genomic imprinting. Mice with genetic material originating from only one parent (e.g., mice with uniparental chromosomal duplications) or gene mutations leading to epigenetic deficiencies have proven to be particularly useful tools. In the process of our studies we have accumulated a large set of expression microarray measurements in samples derived from these types of mice. Here, we present the collation of these and third-party microarray data that are relevant to genomic imprinting into a Web Atlas of Murine genomic Imprinting and Differential EXpression (WAMIDEX: https://atlas.genetics.kcl.ac.uk). WAMIDEX integrates the most comprehensive literature-derived catalog of murine imprinted genes to date with a genome browser that makes the microarray data immediately accessible in annotation-rich genomic context. In addition, WAMIDEX exemplifies the use of the self-organizing map method for the discovery of novel imprinted genes from microarray data. The parent-of-origin-specific expression of imprinted genes is frequently limited to specific tissues or developmental stages, a fact that the atlas reflects in its design and data content.  相似文献   

5.
The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for transferring data from one platform to other. Various statistical methods, including the t-test, analysis of variance, Pearson test and Mann-Whitney U test, have been reported for comparing microarray data, whereas the utilization of the Wilcoxon signed-rank test, which is an appropriate test for two-groups comparison of gene expression data, has largely been neglected in microarray studies. The aim of this investigation was to build an integrated tool, ArraySolver, for colour-coded graphical display and comparison of gene expression data using the Wilcoxon signed-rank test. The results of software validation showed similar outputs with ArraySolver and SPSS for large datasets. Whereas the former program appeared to be more accurate for 25 or fewer pairs (n 相似文献   

6.
7.
Dufva M  Petronis S  Jensen LB  Krag C  Christensen CB 《BioTechniques》2004,37(2):286-92, 294, 296
An agarose film has been proposed as an efficient substrate for producing microarrays. The original film preparation procedure was simplified significantly by grafting the agarose layer directly onto unmodified microscope glass slides instead of aminated glass slides, and the blocking procedure was replaced with a wash in 0.1x standard saline citrate (SSC) and 0.5% sodium dodecyl sulfate (SDS) without decreasing the performance of the produced microarrays. Characterization of the grafted agarose film using atomic force microscopy (AFM) and scanning electron microscopy (SEM) showed that the agarose film had a 10-fold increase in surface roughness compared to glass and that the interior of the agarose film was porous, with pore sizes between 100-500 nm. A comparison of hybridization on aldehyde-activated agarose-coated microarray slides and commercial amino-reactive microarray slides showed that aldehyde-activated agarose-coated slides had the highest signal-to-noise ratio of 850, suggesting that the aldehyde-activated agarose microarray slides are suitable in applications where analytes have a wide concentration range. By immobilizing the DNA probes using ultraviolet (UV) light, the signal-to-noise ratio was further increased to 3000 on the agarose microarray slides. The specificity of the UV cross-linked DNA probes was demonstrated using 21 and 25 bp long capture probes, enabling discrimination of target molecules differing in only one base.  相似文献   

8.
9.
In this paper, fluorescent microarray images and various analysis techniques are described to improve the microarray data acquisition processes. Signal intensities produced by rarely expressed genes are initially correctly detected, but they are often lost in corrections for background, log or ratio. Our analyses indicate that a simple correlation between the mean and median signal intensities may be the best way to eliminate inaccurate microarray signals. Unlike traditional quality control methods, the low intensity signals are retained and inaccurate signals are eliminated in this mean and median correlation. With larger amounts of microarray data being generated, it becomes increasingly more difficult to analyze data on a visual basis. Our method allows for the automatic quantitative determination of accurate and reliable signals, which can then be used for normalization. We found that a mean to median correlation of 85% or higher not only retains more data than current methods, but the retained data is more accurate than traditional thresholds or common spot flagging algorithms. We have also found that by using pin microtapping and microvibrations, we can control spot quality independent from initial PCR volume.  相似文献   

10.
Cluster analysis has proven to be a useful tool for investigating the association structure among genes in a microarray data set. There is a rich literature on cluster analysis and various techniques have been developed. Such analyses heavily depend on an appropriate (dis)similarity measure. In this paper, we introduce a general clustering approach based on the confidence interval inferential methodology, which is applied to gene expression data of microarray experiments. Emphasis is placed on data with low replication (three or five replicates). The proposed method makes more efficient use of the measured data and avoids the subjective choice of a dissimilarity measure. This new methodology, when applied to real data, provides an easy-to-use bioinformatics solution for the cluster analysis of microarray experiments with replicates (see the Appendix). Even though the method is presented under the framework of microarray experiments, it is a general algorithm that can be used to identify clusters in any situation. The method's performance is evaluated using simulated and publicly available data set. Our results also clearly show that our method is not an extension of the conventional clustering method based on correlation or euclidean distance.  相似文献   

11.
12.
13.
Park T  Yi SG  Lee S  Lee JK 《BioTechniques》2005,38(3):463-471
Different sources of systematic and random error variations are often observed in cDNA microarray experiments. A simple scatter plot is commonly used to examine outlying slides that have unusual expression patterns or larger variability than other slides. These outlying slides tend to have large impacts on the subsequent analyses, such as identification of differentially expressed genes and clustering analysis. However, it is difficult to select outlying slides rigorously and consistently based on subjective human pattern recognition on their scatter plots. A graphical method and a rigorous diagnostic measure are proposed to detect outlying slides. The proposed graphical method is easy to implement and shown to be quite effective in detecting outlying slides in real microarray data sets. This diagnostic measure is also informative to compare variability among slides. Two cDNA microarray data sets are carefully examined to illustrate the proposed approach. A 3840-gene microarray experiment for neuronal differentiation of cortical stem cells and a 2076-gene microarray experiment for anticancer compound time-course expression of the NCI-60 cancer cell lines.  相似文献   

14.
Normalization of expression levels applied to microarray data can help in reducing measurement error. Different methods, including cyclic loess, quantile normalization and median or mean normalization, have been utilized to normalize microarray data. Although there is considerable literature regarding normalization techniques for mRNA microarray data, there are no publications comparing normalization techniques for microRNA (miRNA) microarray data, which are subject to similar sources of measurement error. In this paper, we compare the performance of cyclic loess, quantile normalization, median normalization and no normalization for a single-color microRNA microarray dataset. We show that the quantile normalization method works best in reducing differences in miRNA expression values for replicate tissue samples. By showing that the total mean squared error are lowest across almost all 36 investigated tissue samples, we are assured that the bias correction provided by quantile normalization is not outweighed by additional error variance that can arise from a more complex normalization method. Furthermore, we show that quantile normalization does not achieve these results by compression of scale.  相似文献   

15.

Background  

To cancel experimental variations, microarray data must be normalized prior to analysis. Where an appropriate model for statistical data distribution is available, a parametric method can normalize a group of data sets that have common distributions. Although such models have been proposed for microarray data, they have not always fit the distribution of real data and thus have been inappropriate for normalization. Consequently, microarray data in most cases have been normalized with non-parametric methods that adjust data in a pair-wise manner. However, data analysis and the integration of resultant knowledge among experiments have been difficult, since such normalization concepts lack a universal standard.  相似文献   

16.
During recent years, microarrays have been firmly established as valuable tools for the discovery of novel biological phenomena. Especially in combination with whole genome sequences, microarray data can help unravel the dynamics of the expressed genome. For filamentous fungi, microarray studies have already been performed with more than 20 different species; these investigations have explored a variety of different aspects of fungal biology. In this review, I will give an overview of some of the key questions that have been addressed using microarray hybridizations with filamentous fungi, with particular focus on the analysis of co-regulated pathways and physically clustered genes, as well as on the use of microarray data to determine a molecular phenotype. Additionally, a number of useful, freely available software tools for the analysis of fungal microarray data will be discussed.  相似文献   

17.
RNA-Seq and microarray platforms have emerged as important tools for detecting changes in gene expression and RNA processing in biological samples. We present ExpressionPlot, a software package consisting of a default back end, which prepares raw sequencing or Affymetrix microarray data, and a web-based front end, which offers a biologically centered interface to browse, visualize, and compare different data sets. Download and installation instructions, a user's manual, discussion group, and a prototype are available at .  相似文献   

18.
lumi: a pipeline for processing Illumina microarray   总被引:2,自引:0,他引:2  
Illumina microarray is becoming a popular microarray platform. The BeadArray technology from Illumina makes its preprocessing and quality control different from other microarray technologies. Unfortunately, most other analyses have not taken advantage of the unique properties of the BeadArray system, and have just incorporated preprocessing methods originally designed for Affymetrix microarrays. lumi is a Bioconductor package especially designed to process the Illumina microarray data. It includes data input, quality control, variance stabilization, normalization and gene annotation portions. In specific, the lumi package includes a variance-stabilizing transformation (VST) algorithm that takes advantage of the technical replicates available on every Illumina microarray. Different normalization method options and multiple quality control plots are provided in the package. To better annotate the Illumina data, a vendor independent nucleotide universal identifier (nuID) was devised to identify the probes of Illumina microarray. The nuID annotation packages and output of lumi processed results can be easily integrated with other Bioconductor packages to construct a statistical data analysis pipeline for Illumina data. Availability: The lumi Bioconductor package, www.bioconductor.org  相似文献   

19.
We have constructed cDNA microarrays for soybean (Glycine maxL. Merrill), containing approximately 4,100 Unigene ESTs derived from axenic roots, to evaluate their application and utility for functional genomics of organ differentiation in legumes. We assessed microarray technology by conducting studies to evaluate the accuracy of microarray data and have found them to be both reliable and reproducible in repeat hybridisations. Several ESTs showed high levels (50 fold) of differential expression in either root or shoot tissue of soybean. A small number of physiologically interesting, and differentially expressed sequences found by microarray analysis were verified by both quantitative real-time RT-PCR and Northern blot analysis. There was a linear correlation (r2 = 0.99, over 5 orders of magnitude) between microarray and quantitative real-time RT-PCR data. Microarray analysis of soybean has enormous potential not only for the discovery of new genes involved in tissue differentiation and function, but also to study the expression of previously characterised genes, gene networks and gene interactions in wild-type, mutant or transgenic plants.  相似文献   

20.
MOTIVATION: Recent technological advances such as cDNA microarray technology have made it possible to simultaneously interrogate thousands of genes in a biological specimen. A cDNA microarray experiment produces a gene expression 'profile'. Often interest lies in discovering novel subgroupings, or 'clusters', of specimens based on their profiles, for example identification of new tumor taxonomies. Cluster analysis techniques such as hierarchical clustering and self-organizing maps have frequently been used for investigating structure in microarray data. However, clustering algorithms always detect clusters, even on random data, and it is easy to misinterpret the results without some objective measure of the reproducibility of the clusters. RESULTS: We present statistical methods for testing for overall clustering of gene expression profiles, and we define easily interpretable measures of cluster-specific reproducibility that facilitate understanding of the clustering structure. We apply these methods to elucidate structure in cDNA microarray gene expression profiles obtained on melanoma tumors and on prostate specimens.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号