首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
MOTIVATION: Data from one-channel cDNA microarray studies may exhibit poor reproducibility due to spatial heterogeneity, non-linear array-to-array variation and problems in correcting for background. Uncorrected, these phenomena can give rise to misleading conclusions. RESULTS: Spatial heterogeneity may be corrected using two-dimensional loess smoothing (Colantuoni et al., 2002). Non-linear between-array variation may be corrected using an iterative application of one-dimensional loess smoothing. A method for background correction using a smoothing function rather than simple subtraction is described. These techniques promote within-array spatial uniformity and between-array reproducibility. Their application is illustrated using data from a study of the effects of an insulin sensitizer, rosiglitazone, on gene expression in white adipose tissue in diabetic db/db mice. They may also be useful with data from two-channel cDNA microarrays and from oligonucleotide arrays. AVAILABILITY: R functions for the methods described are available on request from the author.  相似文献   

2.
Despite the tremendous growth of microarray usage in scientific studies, there is a lack of standards for background correction methodologies, especially in single-color microarray platforms. Traditional background subtraction methods often generate negative signals and thus cause large amounts of data loss. Hence, some researchers prefer to avoid background corrections, which typically result in the underestimation of differential expression. Here, by utilizing nonspecific negative control features integrated into Illumina whole genome expression arrays, we have developed a method of model-based background correction for BeadArrays (MBCB). We compared the MBCB with a method adapted from the Affymetrix robust multi-array analysis algorithm and with no background subtraction, using a mouse acute myeloid leukemia (AML) dataset. We demonstrated that differential expression ratios obtained by using the MBCB had the best correlation with quantitative RT–PCR. MBCB also achieved better sensitivity in detecting differentially expressed genes with biological significance. For example, we demonstrated that the differential regulation of Tnfr2, Ikk and NF-kappaB, the death receptor pathway, in the AML samples, could only be detected by using data after MBCB implementation. We conclude that MBCB is a robust background correction method that will lead to more precise determination of gene expression and better biological interpretation of Illumina BeadArray data.  相似文献   

3.
MOTIVATION: Background correction is an important preprocess in cDNA microarray data analysis. A variety of methods have been used for this purpose. However, many kinds of backgrounds, especially inhomogeneous ones, cannot be estimated correctly using any of the existing methods. In this paper, we propose the use of the TV+L1 model, which minimizes the total variation (TV) of the image subject to an L1-fidelity term, to correct background bias. We demonstrate its advantages over the existing methods by both analytically discussing its properties and numerically comparing it with morphological opening. RESULTS: Experimental results on both synthetic data and real microarray images demonstrate that the TV+L1 model gives the restored intensity that is closer to the true data than morphological opening. As a result, this method can serve an important role in the preprocessing of cDNA microarray data.  相似文献   

4.
Two-color microarrays are a powerful tool for genomic analysis, but have noise components that make inferences regarding gene expression inefficient and potentially misleading. Background fluorescence, whether attributable to nonspecific binding or other sources, is an important component of noise. The decision to subtract fluorescence surrounding spots of hybridization from spot fluorescence has been controversial, with no clear criteria for determining circumstances that may favor, or disfavor, background subtraction. While it is generally accepted that subtracting background reduces bias but increases variance in the estimates of the ratios of interest, no formal analysis of the bias-variance trade off of background subtraction has been undertaken. In this paper, we use simulation to systematically examine the bias-variance trade off under a variety of possible experimental conditions. Our simulation is based on data obtained from 2 self versus self microarray experiments and is free of distributional assumptions. Our results identify factors that are important for determining whether to background subtract, including the correlation of foreground to background intensity ratios. Using these results, we develop recommendations for diagnostic visualizations that can help decisions about background subtraction.  相似文献   

5.
There are many options in handling microarray data that can affect study conclusions, sometimes drastically. Working with a two-color platform, this study uses ten spike-in microarray experiments to evaluate the relative effectiveness of some of these options for the experimental goal of detecting differential expression. We consider two data transformations, background subtraction and intensity normalization, as well as six different statistics for detecting differentially expressed genes. Findings support the use of an intensity-based normalization procedure and also indicate that local background subtraction can be detrimental for effectively detecting differential expression. We also verify that robust statistics outperform t-statistics in identifying differentially expressed genes when there are few replicates. Finally, we find that choice of image analysis software can also substantially influence experimental conclusions.  相似文献   

6.
Here we present a methodology for the normalization of element signal intensities to a mean intensity calculated locally across the surface of a DNA microarray. These methods allow the detection and/or correction of spatially systematic artifacts in microarray data. These include artifacts that can be introduced during the robotic printing, hybridization, washing, or imaging of microarrays. Using array element signal intensities alone, this local mean normalization process can correct for such artifacts because they vary across the surface of the array. The local mean normalization can be usedfor quality control and data correction purposes in the analysis of microarray data. These algorithms assume that array elements are not spatially ordered with regard to sequence or biological function and require that this spatial mapping is identical between the two sets of intensities to be compared. The tool described in this report was developed in the R statistical language and is freely available on the Internet as part of a larger gene expression analysis package. This Web implementation is interactive and user-friendly and allows the easy use of the local mean normalization tool described here, without programming expertise or downloading of additional software.  相似文献   

7.
Normalization removes or minimizes the biases of systematic variation that exists in experimental data sets. This study presents a systematic variation normalization (SVN) procedure for removing systematic variation in two channel microarray gene expression data. Based on an analysis of how systematic variation contributes to variability in microarray data sets, our normalization procedure includes background subtraction determined from the distribution of pixel intensity values from each data acquisition channel and log conversion, linear or non-linear regression, restoration or transformation, and multiarray normalization. In the case when a non-linear regression is required, an empirical polynomial approximation approach is used. Either the high terminated points or their averaged values in the distributions of the pixel intensity values observed in control channels may be used for rescaling multiarray datasets. These pre-processing steps remove systematic variation in the data attributable to variability in microarray slides, assay-batches, the array process, or experimenters. Biologically meaningful comparisons of gene expression patterns between control and test channels or among multiple arrays are therefore unbiased using normalized but not unnormalized datasets.  相似文献   

8.
The aims were to evaluate the common reference design approach in microarray experiments and to evaluate the technical performance and the normalisation of cDNA microarrays with a limited number of spots. Total RNA from 3 normal and 3 tumour sample biopsies were used for synthesis of amino-allyl labelled cRNA. Equal amounts of cRNA from all samples were mixed as reference. After conjugation of cRNA with fluorophores (Cy3/Cy5), each individual tumour cRNA was hybridised to a cDNA microarray together with reference cRNA, scanned and analysed. We show that our procedures for producing cDNA microarrays are reproducible. The concordance between duplicated spots and replicate hybridisation was found to be high. We have demonstrated that our cDNA microarrays are of a high technical quality. The majority of the cDNA microarrays had low local spot background levels. Despite the high background levels for some local spots, variation could be minimized by locally weighted scatter plot smooth normalisation (LOWESS), which we showed was also suitable for normalisation of cDNA microarrays with a limited number of probes.  相似文献   

9.
Little consideration has been given to the effect of different segmentation methods on the variability of data derived from microarray images. Previous work has suggested that the significant source of variability from microarray image analysis is from estimation of local background. In this study, we used Analysis of Variance (ANOVA) models to investigate the effect of methods of segmentation on the precision of measurements obtained from replicate microarray experiments. We used four different methods of spot segmentation (adaptive, fixed circle, histogram and GenePix) to analyse a total number of 156 172 spots from 12 microarray experiments. Using a two-way ANOVA model and the coefficient of repeatability, we show that the method of segmentation significantly affects the precision of the microarray data. The histogram method gave the lowest variability across replicate spots compared to other methods, and had the lowest pixel-to-pixel variability within spots. This effect on precision was independent of background subtraction. We show that these findings have direct, practical implications as the variability in precision between the four methods resulted in different numbers of genes being identified as differentially expressed. Segmentation method is an important source of variability in microarray data that directly affects precision and the identification of differentially expressed genes.  相似文献   

10.
Song PX  Gao X  Liu R  Le W 《Biometrics》2006,62(2):545-554
Identifying local extrema of expression profiles is one primary objective in some cDNA microarray experiments. To study the replication dynamics of the yeast genome, for example, local peaks of hybridization intensity profiles correspond to putative replication origins. We propose a nonparametric kernel smoothing (NKS) technique to detect local hybridization intensity extrema across chromosomes. The novelty of our approach is that we base our inference procedures on equilibrium points, namely those locations at which the first derivative of the intensity curve is zero. The proposed smoothing technique provides both point and interval estimation for the location of local extrema. Also, this technique can be used to test for the hypothesis of either one or multiple suspected locations being the true equilibrium points. We illustrate the proposed method on a microarray data set from an experiment designed to study the replication origins in the yeast genome, in that the locations of autonomous replication sequence (ARS) elements are identified through the equilibrium points of the smoothed intensity profile curve. Our method found a few ARS elements that were not detected by the current smoothing methods such as the Fourier convolution smoothing.  相似文献   

11.
Microarrays are part of a new class of biotechnologies that allow the monitoring of expression levels for thousands of genes simultaneously. Image analysis is an important aspect of microarray experiments, one that can have a potentially large impact on subsequent analyses, such as clustering or the identification of differentially expressed genes. This paper reviews a number of existing image analysis methods used on cDNA microarray data. In particular, it describes and discusses the different segmentation and background adjustment methods. It was found that in some cases background adjustment can substantially reduce the precision--that is, increase the variability of low-intensity spot values. In contrast, the choice of segmentation procedure seems to have a smaller impact.  相似文献   

12.
The accuracy of gene expression measurements generated using cDNA microarrays is dependent on the quality of the image generated following hybridization of fluorescently labelled cDNA. It is not known how this image is influenced by sample preparation factors which such as RNA quality, cDNA synthesis and labelling efficiency. In this study we used a simple metric based on the ratio of the total feature (F) and background (B) fluorescence, which correlates with the visual assessment of 60 microarray images, to determine the influence of sample preparation on image quality. Results indicate that RNA purity (A260/A280) and integrity (18S:28S ratio) do not strongly influence microarray image quality. cDNA having an nucleotide to dye ratio greater than 100 produced poor microarray images, however, cDNA labelled more efficiently was not a guarantee of a better image. The data also indicate that the array image quality is not improved by loading more cDNA into the hybridization mixture however poor image quality did result from a disproportionate amounts of Cy5 and Cy3 labelled cDNA. This study provides insight into the source of variation in microarray image analysis introduced during sample preparation and will assist in the standardisation of cDNA glass slide microarray protocols.  相似文献   

13.
The 3H-thymidine labeling index (TLI) and the percentage of cells in the S-phase have been determined by autoradiography and by flow cytometry, (FCM), respectively, in six malignant tumors of human origin transplanted on athymic nude mice. The Dean and Jett model and the graphical model were used to determine the percent of S-phase cells by FCM. Cell cycle analysis was performed using 1) no correction for background; 2) an algebraic function for background correction; and 3) an exponential function for background subtraction. Each of these three data sets was evaluated using both the Dean and Jett model and a graphical model for the evaluation of DNA histograms. The S-phase fractions (SPF) were compared to the corresponding labeling index results. SPF without background correction were 1.54 times higher than the TLI. SPF, after correction using the algebraic model, were 1.29-fold higher than the TLI, whereas SPF obtained after background subtraction according to the exponential model were only 1.05-fold higher than the TLI. Student's t-test revealed significant differences between the mean TLI values (16.25 +/- 9.06) and the mean SPF obtained by FCM without background correction (mean 25.0 +/- 9.36, P less than 0.01), but not between the mean TLI values and the mean SPF percentages after algebraic (mean 21.0 +/- 10.29) and exponential background correction (mean 17.11 +/- 11.59), P greater than 0.05 each. There was no difference between the results obtained using the Dean and Jett model and those obtained using the graphic evaluation.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

14.
15.
Combinatorial image analysis of DNA microarray features   总被引:3,自引:0,他引:3  
MOTIVATION: DNA and protein microarrays have become an established leading-edge technology for large-scale analysis of gene and protein content and activity. Contact-printed microarrays has emerged as a relatively simple and cost effective method of choice but its reliability is especially susceptible to quality of pixel information obtained from digital scans of spotted features in the microarray image. RESULTS: We address the statistical computation requirements for optimizing data acquisition and processing of digital scans. We consider the use of median filters to reduce noise levels in images and top-hat filters to correct for trends in background values. We also consider, as alternative estimators of spot intensity, discs of fixed radius, proportions of histograms and k-means clustering, either with or without a square-root intensity transformation and background subtraction. We identify, using combinatoric procedures, optimal filter and estimator parameters, in achieving consistency among the replicates of a gene on each microarray. Our results, using test data from microarrays of HCMV, indicate that a highly effective approach for improving reliability and quality of microarray data is to apply a 21 by 21 top-hat filter, then estimate spot intensity as the mean of the largest 20% of pixel values in the target region, after a square-root transformation, and corrected for background, by subtracting the mean of the smallest 70% of pixel values. AVAILABILITY: Fortran90 subroutines implementing these methods are available from the authors, or at http://www.bioss.ac.uk/~chris.  相似文献   

16.
In this study we present two novel normalization schemes for cDNA microarrays. They are based on iterative local regression and optimization of model parameters by generalized cross-validation. Permutation tests assessing the efficiency of normalization demonstrated that the proposed schemes have an improved ability to remove systematic errors and to reduce variability in microarray data. The analysis also reveals that without parameter optimization local regression is frequently insufficient to remove systematic errors in microarray data.  相似文献   

17.
Microarray-based analysis of single nucleotide polymorphisms (SNPs) has many applications in large-scale genetic studies. To minimize the influence of experimental variation, microarray data usually need to be processed in different aspects including background subtraction, normalization and low-signal filtering before genotype determination. Although many algorithms are sophisticated for these purposes, biases are still present. In the present paper, new algorithms for SNP microarray data analysis and the software, AccuTyping, developed based on these algorithms are described. The algorithms take advantage of a large number of SNPs included in each assay, and the fact that the top and bottom 20% of SNPs can be safely treated as homozygous after sorting based on their ratios between the signal intensities. These SNPs are then used as controls for color channel normalization and background subtraction. Genotype calls are made based on the logarithms of signal intensity ratios using two cutoff values, which were determined after training the program with a dataset of approximately 160,000 genotypes and validated by non-microarray methods. AccuTyping was used to determine >300,000 genotypes of DNA and sperm samples. The accuracy was shown to be >99%. AccuTyping can be downloaded from http://www2.umdnj.edu/lilabweb/publications/AccuTyping.html.  相似文献   

18.

Background  

When DNA microarray data are used for gene clustering, genotype/phenotype correlation studies, or tissue classification the signal intensities are usually transformed and normalized in several steps in order to improve comparability and signal/noise ratio. These steps may include subtraction of an estimated background signal, subtracting the reference signal, smoothing (to account for nonlinear measurement effects), and more. Different authors use different approaches, and it is generally not clear to users which method they should prefer.  相似文献   

19.
建立一种简便、快速、特异的制备基因芯片探针的方法.以K562细胞和正常人淋巴细胞作为消减对象,利用自行建立的消减方法进行消减杂交,结合限制性显示技术,分组扩增差异cDNA,回收K562细胞特异基因片段,制作基因芯片探针.结果显示,分离到400个K562特异的基因,片段大小均一,适于制作cDNA芯片.消减杂交技术结合限制性显示技术制备基因芯片探针,具有快速、简便、特异的特点,降低了芯片制作成本,可加速芯片的推广应用.  相似文献   

20.
MOTIVATION: DNA microarrays are now capable of providing genome-wide patterns of gene expression across many different conditions. The first level of analysis of these patterns requires determining whether observed differences in expression are significant or not. Current methods are unsatisfactory due to the lack of a systematic framework that can accommodate noise, variability, and low replication often typical of microarray data. RESULTS: We develop a Bayesian probabilistic framework for microarray data analysis. At the simplest level, we model log-expression values by independent normal distributions, parameterized by corresponding means and variances with hierarchical prior distributions. We derive point estimates for both parameters and hyperparameters, and regularized expressions for the variance of each gene by combining the empirical variance with a local background variance associated with neighboring genes. An additional hyperparameter, inversely related to the number of empirical observations, determines the strength of the background variance. Simulations show that these point estimates, combined with a t -test, provide a systematic inference approach that compares favorably with simple t -test or fold methods, and partly compensate for the lack of replication.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号