首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Comparison of mRNA gene expression by RT-PCR and DNA microarray   总被引:10,自引:0,他引:10  
Etienne W  Meyer MH  Peppers J  Meyer RA 《BioTechniques》2004,36(4):618-20, 622, 624-6
  相似文献   

2.
MOTIVATION: Many biomedical experiments are carried out by pooling individual biological samples. However, pooling samples can potentially hide biological variance and give false confidence concerning the data significance. In the context of microarray experiments for detecting differentially expressed genes, recent publications have addressed the problem of the efficiency of sample pooling, and some approximate formulas were provided for the power and sample size calculations. It is desirable to have exact formulas for these calculations and have the approximate results checked against the exact ones. We show that the difference between the approximate and the exact results can be large. RESULTS: In this study, we have characterized quantitatively the effect of pooling samples on the efficiency of microarray experiments for the detection of differential gene expression between two classes. We present exact formulas for calculating the power of microarray experimental designs involving sample pooling and technical replications. The formulas can be used to determine the total number of arrays and biological subjects required in an experiment to achieve the desired power at a given significance level. The conditions under which pooled design becomes preferable to non-pooled design can then be derived given the unit cost associated with a microarray and that with a biological subject. This paper thus serves to provide guidance on sample pooling and cost-effectiveness. The formulation in this paper is outlined in the context of performing microarray comparative studies, but its applicability is not limited to microarray experiments. It is also applicable to a wide range of biomedical comparative studies where sample pooling may be involved.  相似文献   

3.
We discuss how the samples should be arranged in two-dye microarray studies when the objective is to investigate associations between gene expression and quantitative traits measured on each sample. Because there is typically large between array variation, information about the association will come from the differences in traits and expression measurements between the two values hybridised to the two dyes on the same array. It is shown that within-slide correlation of trait values should be minimised. The arrangement of samples for which this occurs will depend on the trait values in question, and is a computationally demanding problem. An alternative is to minimise the rank correlation. We discuss this and related issues for different combinations of numbers of samples and arrays. Data analysis, including estimation of the variance components, is also described.  相似文献   

4.
5.
6.
MOTIVATION: One particular application of microarray data, is to uncover the molecular variation among cancers. One feature of microarray studies is the fact that the number n of samples collected is relatively small compared to the number p of genes per sample which are usually in the thousands. In statistical terms this very large number of predictors compared to a small number of samples or observations makes the classification problem difficult. An efficient way to solve this problem is by using dimension reduction statistical techniques in conjunction with nonparametric discriminant procedures. RESULTS: We view the classification problem as a regression problem with few observations and many predictor variables. We use an adaptive dimension reduction method for generalized semi-parametric regression models that allows us to solve the 'curse of dimensionality problem' arising in the context of expression data. The predictive performance of the resulting classification rule is illustrated on two well know data sets in the microarray literature: the leukemia data that is known to contain classes that are easy 'separable' and the colon data set.  相似文献   

7.
We developed a practical strategy for serum protein profiling using antibody microarrays and applied the method to the identification of potential biomarkers in prostate cancer serum. Protein abundances from 33 prostate cancer and 20 control serum samples were compared to abundances from a common reference pool using a two-color fluorescence assay. Robotically spotted microarrays containing 184 unique antibodies were prepared on two different substrates: polyacrylamide based hydrogels on glass and poly-1-lysine coated glass with a photoreactive cross-linking layer. The hydrogel substrate yielded an average six-fold higher signal-to-noise ratio than the other substrate, and detection of protein binding was possible from a greater number of antibodies using the hydrogels. A statistical filter based on the correlation of data from "reverse-labeled" experiment sets accurately predicted the agreement between the microarray measurements and enzyme-linked immunosorbent assay measurements, showing that this parameter can serve to screen for antibodies that are functional on microarrays. Having defined a set of reliable microarray measurements, we identified five proteins (von Willebrand Factor, immunoglobulinM, Alpha1-antichymotrypsin, Villin and immunoglobulinG) that had significantly different levels between the prostate cancer samples and the controls. These developments enable the immediate use of high-density antibody and protein microarrays in biomarker discovery studies.  相似文献   

8.
Classification methods used in microarray studies for gene expression are diverse in the way they deal with the underlying complexity of the data, as well as in the technique used to build the classification model. The MAQC II study on cancer classification problems has found that performance was affected by factors such as the classification algorithm, cross validation method, number of genes, and gene selection method. In this paper, we study the hypothesis that the disease under study significantly determines which method is optimal, and that additionally sample size, class imbalance, type of medical question (diagnostic, prognostic or treatment response), and microarray platform are potentially influential. A systematic literature review was used to extract the information from 48 published articles on non-cancer microarray classification studies. The impact of the various factors on the reported classification accuracy was analyzed through random-intercept logistic regression. The type of medical question and method of cross validation dominated the explained variation in accuracy among studies, followed by disease category and microarray platform. In total, 42% of the between study variation was explained by all the study specific and problem specific factors that we studied together.  相似文献   

9.
The widespread use of DNA microarrays has led to the discovery of many genes whose expression profile may have significant clinical relevance. The translation of this data to the bedside requires that gene expression be validated as protein expression, and that annotated clinical samples be available for correlative and quantitative studies to assess clinical context and usefulness of putative biomarkers. We review two microarray platforms developed to facilitate the clinical validation of candidate biomarkers: tissue microarrays and reverse-phase protein microarrays. Tissue microarrays are arrays of core biopsies obtained from paraffin-embedded tissues, which can be assayed for histologically-specific protein expression by immunohistochemistry. Reverse-phase protein microarrays consist of arrays of cell lysates or, more recently, plasma or serum samples, which can be assayed for protein quantity and for the presence of post-translational modifications such as phosphorylation. Although these platforms are limited by the availability of validated antibodies, both enable the preservation of precious clinical samples as well as experimental standardization in a high-throughput manner proper to microarray technologies. While tissue microarrays are rapidly becoming a mainstay of translational research, reverse-phase protein microarrays require further technical refinements and validation prior to their widespread adoption by research laboratories.  相似文献   

10.
基于基因表达谱识别乳腺癌转移相关差异表达基因及其功能时,由于基因表达在个体间的变异相对较高而样本量相对较少,由不同研究识别的差异表达基因的可重复性较低。本文基于两套乳腺癌转移基因表达谱,评价两组差异表达基因及其所富集的功能的可重复性。结果显示:在两套表达谱中识别的差异表达基因的表达改变方向高度一致并具有显著的表达相关性;基于两组差异表达基因识别的转移相关功能在两套表达谱中高度可重复,主要涉及细胞分裂、细胞周期、DNA复制、染色体分离、磷酸肌醇介导信号转导和DNA损伤刺激应答等。  相似文献   

11.
Identifying the molecular basis of QTLs: eQTLs add a new dimension   总被引:1,自引:0,他引:1  
Natural genetic variation within plant species is at the core of plant science ranging from agriculture to evolution. Whereas much progress has been made in mapping quantitative trait loci (QTLs) controlling this natural variation, the elucidation of the underlying molecular mechanisms has remained a bottleneck. Recent systems biology tools have significantly shortened the time required to proceed from a mapped locus to testing of candidate genes. These tools enable research on natural variation to move from simple reductionistic studies focused on individual genes to integrative studies connecting molecular variation at multiple loci with physiological consequences. This review focuses on recent examples that demonstrate how expression QTL data can be used for gene discovery and exploited to untangle complex regulatory networks.  相似文献   

12.

Background

The methods used for sample selection and processing can have a strong influence on the expression values obtained through microarray profiling. Laser capture microdissection (LCM) provides higher specificity in the selection of target cells compared to traditional bulk tissue selection methods, but at an increased processing cost. The benefit gained from the higher tissue specificity realized through LCM sampling is evaluated in this study through a comparison of microarray expression profiles obtained from same-samples using bulk and LCM processing.

Methods

Expression data from ten lung adenocarcinoma samples and six adjacent normal samples were acquired using LCM and bulk sampling methods. Expression values were evaluated for correlation between sample processing methods, as well as for bias introduced by the additional linear amplification required for LCM sample profiling.

Results

The direct comparison of expression values obtained from the bulk and LCM sampled datasets reveals a large number of probesets with significantly varied expression. Many of these variations were shown to be related to bias arising from the process of linear amplification, which is required for LCM sample preparation. A comparison of differentially expressed genes (cancer vs. normal) selected in the bulk and LCM datasets also showed substantial differences. There were more than twice as many down-regulated probesets identified in the LCM data than identified in the bulk data. Controlling for the previously identified amplification bias did not have a substantial impact on the differences identified in the differentially expressed probesets found in the bulk and LCM samples.

Conclusion

LCM-coupled microarray expression profiling was shown to uniquely identify a large number of differentially expressed probesets not otherwise found using bulk tissue sampling. The information gain realized from the LCM sampling was limited to differential analysis, as the absolute expression values obtained for some probesets using this study's protocol were biased during the second round of amplification. Consequently, LCM may enable investigators to obtain additional information in microarray studies not easily found using bulk tissue samples, but it is of critical importance that potential amplification biases are controlled for.  相似文献   

13.
Comparison of microarray designs for class comparison and class discovery   总被引:4,自引:0,他引:4  
MOTIVATION: Two-color microarray experiments in which an aliquot derived from a common RNA sample is placed on each array are called reference designs. Traditionally, microarray experiments have used reference designs, but designs without a reference have recently been proposed as alternatives. RESULTS: We develop a statistical model that distinguishes the different levels of variation typically present in cancer data, including biological variation among RNA samples, experimental error and variation attributable to phenotype. Within the context of this model, we examine the reference design and two designs which do not use a reference, the balanced block design and the loop design, focusing particularly on efficiency of estimates and the performance of cluster analysis. We calculate the relative efficiency of designs when there are a fixed number of arrays available, and when there are a fixed number of samples available. Monte Carlo simulation is used to compare the designs when the objective is class discovery based on cluster analysis of the samples. The number of discrepancies between the estimated clusters and the true clusters were significantly smaller for the reference design than for the loop design. The efficiency of the reference design relative to the loop and block designs depends on the relation between inter- and intra-sample variance. These results suggest that if cluster analysis is a major goal of the experiment, then a reference design is preferable. If identification of differentially expressed genes is the main concern, then design selection may involve a consideration of several factors.  相似文献   

14.
Single-nucleotide polymorphisms (SNPs) are considered useful polymorphic markers for genetic studies of polygenic traits. A new practical approach to high-throughput genotyping of SNPs in a large number of individuals is needed in association study and other studies on relationships between genes and diseases. We have developed an accurate and high-throughput method for determining the allele frequencies by pooling the DNA samples and applying a DNA microarray hybridization analysis. In this method, the combination of the microarray, DNA pooling, probe pair hybridization, and fluorescent ratio analysis solves the dual problems of parallel multiple sample analysis, and parallel multiplex SNP genotyping for association study. Multiple DNA samples are immobilized on a slide and a single hybridization is performed with a pool of allele-specific oligonucleotide probes. The results of this study show that hybridization of microarray from pooled DNA samples can accurately obtain estimates of absolute allele frequencies in a sample pool. This method can also be used to identify differences in allele frequencies in distinct populations. It is amenable to automation and is suitable for immediate utilization for high-throughput genotyping of SNP.  相似文献   

15.
For the identification and quantification of methanogenic archaea (methanogens) in environmental samples, various oligonucleotide probes/primers targeting phylogenetic markers of methanogens, such as 16S rRNA, 16S rRNA gene and the gene for the α‐subunit of methyl coenzyme M reductase (mcrA), have been extensively developed and characterized experimentally. These oligonucleotides were designed to resolve different groups of methanogens at different taxonomic levels, and have been widely used as hybridization probes or polymerase chain reaction primers for membrane hybridization, fluorescence in situ hybridization, rRNA cleavage method, gene cloning, DNA microarray and quantitative polymerase chain reaction for studies in environmental and determinative microbiology. In this review, we present a comprehensive list of such oligonucleotide probes/primers, which enable us to determine methanogen populations in an environment quantitatively and hierarchically, with examples of the practical applications of the probes and primers.  相似文献   

16.
17.
Technical variation, or variation from non-biological sources, is present in most laboratory assays. Correcting for this variation enables analysts to extract a biological signal that informs questions of interest. However, each assay has different sources and levels of technical variation, and the choice of correction methods can impact downstream analyses. Compared to similar assays such as DNA microarrays, relatively few methods have been developed and evaluated for protein microarrays, a versatile tool for measuring levels of various proteins in serum samples. Here, we propose a pre-processing pipeline to correct for some common sources of technical variation in protein microarrays. The pipeline builds upon an existing normalization method by using controls to reduce technical variation. We evaluate our method using data from two protein microarray studies and by simulation. We demonstrate that pre-processing choices impact the fluorescent-intensity based ranks of proteins, which in turn, impact downstream analysis.  相似文献   

18.
Analysis of variance components in gene expression data   总被引:5,自引:0,他引:5  
MOTIVATION: A microarray experiment is a multi-step process, and each step is a potential source of variation. There are two major sources of variation: biological variation and technical variation. This study presents a variance-components approach to investigating animal-to-animal, between-array, within-array and day-to-day variations for two data sets. The first data set involved estimation of technical variances for pooled control and pooled treated RNA samples. The variance components included between-array, and two nested within-array variances: between-section (the upper- and lower-sections of the array are replicates) and within-section (two adjacent spots of the same gene are printed within each section). The second experiment was conducted on four different weeks. Each week there were reference and test samples with a dye-flip replicate in two hybridization days. The variance components included week-to-week, animal-to-animal and between-array and within-array variances. RESULTS: We applied the linear mixed-effects model to quantify different sources of variation. In the first data set, we found that the between-array variance is greater than the between-section variance, which, in turn, is greater than the within-section variance. In the second data set, for the reference samples, the week-to-week variance is larger than the between-array variance, which, in turn, is slightly larger than the within-array variance. For the test samples, the week-to-week variance has the largest variation. The animal-to-animal variance is slightly larger than the between-array and within-array variances. However, in a gene-by-gene analysis, the animal-to-animal variance is smaller than the between-array variance in four out of five housekeeping genes. In summary, the largest variation observed is the week-to-week effect. Another important source of variability is the animal-to-animal variation. Finally, we describe the use of variance-component estimates to determine optimal numbers of animals, arrays per animal and sections per array in planning microarray experiments.  相似文献   

19.
Microarrays are used to study gene expression in a variety of biological systems. A number of different platforms have been developed, but few studies exist that have directly compared the performance of one platform with another. The goal of this study was to determine array variation by analyzing the same RNA samples with three different array platforms. Using gene expression responses to benzo[a]pyrene exposure in normal human mammary epithelial cells (NHMECs), we compared the results of gene expression profiling using three microarray platforms: photolithographic oligonucleotide arrays (Affymetrix), spotted oligonucleotide arrays (Amersham), and spotted cDNA arrays (NCI). While most previous reports comparing microarrays have analyzed pre-existing data from different platforms, this comparison study used the same sample assayed on all three platforms, allowing for analysis of variation from each array platform. In general, poor correlation was found with corresponding measurements from each platform. Each platform yielded different gene expression profiles, suggesting that while microarray analysis is a useful discovery tool, further validation is needed to extrapolate results for broad use of the data. Also, microarray variability needs to be taken into consideration, not only in the data analysis but also in specific probe selection for each array type.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号