期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

SVAw - a web-based application tool for automated surrogate variable analysis of gene expression studies

Mehdi?Pirooznia Fayaz?Seifuddin Fernando?S?Goes Jeffrey?T?Leek Peter?P?Zandi Email author 《Source code for biology and medicine》2013,8(1):8

Background

Surrogate variable analysis (SVA) is a powerful method to identify, estimate, and utilize the components of gene expression heterogeneity due to unknown and/or unmeasured technical, genetic, environmental, or demographic factors. These sources of heterogeneity are common in gene expression studies, and failing to incorporate them into the analysis can obscure results. Using SVA increases the biological accuracy and reproducibility of gene expression studies by identifying these sources of heterogeneity and correctly accounting for them in the analysis.

Results

Here we have developed a web application called SVAw (Surrogate variable analysis Web app) that provides a user friendly interface for SVA analyses of genome-wide expression studies. The software has been developed based on open source bioconductor SVA package. In our software, we have extended the SVA program functionality in three aspects: (i) the SVAw performs a fully automated and user friendly analysis workflow; (ii) It calculates probe/gene Statistics for both pre and post SVA analysis and provides a table of results for the regression of gene expression on the primary variable of interest before and after correcting for surrogate variables; and (iii) it generates a comprehensive report file, including graphical comparison of the outcome for the user.

Conclusions

SVAw is a web server freely accessible solution for the surrogate variant analysis of high-throughput datasets and facilitates removing all unwanted and unknown sources of variation. It is freely available for use at http://psychiatry.igm.jhmi.edu/sva. The executable packages for both web and standalone application and the instruction for installation can be downloaded from our web site.

相似文献

2.

Capturing changes in gene expression dynamics by gene set differential coordination analysis

Yu T Bai Y 《Genomics》2011,98(6):469-477

相似文献

3.

Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies

Teschendorff AE Zhuang J Widschwendter M 《Bioinformatics (Oxford, England)》2011,27(11):1496-1505

相似文献

4.

Effectively identifying regulatory hotspots while capturing expression heterogeneity in gene expression studies

Jong Wha J Joo Jae Hoon Sul Buhm Han Chun Ye Eleazar Eskin 《Genome biology》2014,15(4):r61

Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods. 相似文献

5.

Surrogate variable analysis using partial least squares (SVA-PLS) in gene expression studies

Chakraborty S Datta S Datta S 《Bioinformatics (Oxford, England)》2012,28(6):799-806

MOTIVATION: In a typical gene expression profiling study, our prime objective is to identify the genes that are differentially expressed between the samples from two different tissue types. Commonly, standard analysis of variance (ANOVA)/regression is implemented to identify the relative effects of these genes over the two types of samples from their respective arrays of expression levels. But, this technique becomes fundamentally flawed when there are unaccounted sources of variability in these arrays (latent variables attributable to different biological, environmental or other factors relevant in the context). These factors distort the true picture of differential gene expression between the two tissue types and introduce spurious signals of expression heterogeneity. As a result, many genes which are actually differentially expressed are not detected, whereas many others are falsely identified as positives. Moreover, these distortions can be different for different genes. Thus, it is also not possible to get rid of these variations by simple array normalizations. This both-way error can lead to a serious loss in sensitivity and specificity, thereby causing a severe inefficiency in the underlying multiple testing problem. In this work, we attempt to identify the hidden effects of the underlying latent factors in a gene expression profiling study by partial least squares (PLS) and apply ANCOVA technique with the PLS-identified signatures of these hidden effects as covariates, in order to identify the genes that are truly differentially expressed between the two concerned tissue types. RESULTS: We compare the performance of our method SVA-PLS with standard ANOVA and a relatively recent technique of surrogate variable analysis (SVA), on a wide variety of simulation settings (incorporating different effects of the hidden variable, under situations with varying signal intensities and gene groupings). In all settings, our method yields the highest sensitivity while maintaining relatively reasonable values for the specificity, false discovery rate and false non-discovery rate. Application of our method to gene expression profiling for acute megakaryoblastic leukemia shows that our method detects an additional six genes, that are missed by both the standard ANOVA method as well as SVA, but may be relevant to this disease, as can be seen from mining the existing literature. 相似文献

6.

Characterizing heterogeneity in leukemic cells using single-cell gene expression analysis

Assieh Saadatpour Guoji Guo Stuart H Orkin Guo-Cheng Yuan 《Genome biology》2014,15(12)

相似文献

7.

Microarray gene expression analysis of murine tumor heterogeneity defined by dynamic contrast-enhanced MRI

Costouros NG Lorang D Zhang Y Miller MS Diehn FE Hewitt SM Knopp MV Li KC Choyke PL Alexander HR Libutti SK 《Molecular imaging》2002,1(3):301-308

相似文献

8.

Meta‐analysis based variable selection for gene expression data

下载免费PDF全文

Quefeng Li Sijian Wang Chiang‐Ching Huang Menggang Yu Jun Shao 《Biometrics》2014,70(4):872-880

相似文献

9.

Bacterial reference genes for gene expression studies by RT-qPCR: survey and analysis

Danilo J. P. Rocha Carolina S. Santos Luis G. C. Pacheco 《Antonie van Leeuwenhoek》2015,108(3):685-693

相似文献

10.

A factor model to analyze heterogeneity in gene expression

Yuna Blum Guillaume Le Mignon Sandrine Lagarrigue David Causeur 《BMC bioinformatics》2010,11(1):368

相似文献

11.

Subtraction-coupled custom microarray analysis for gene discovery and gene expression studies in the CNS

Dougherty JD Geschwind DH 《Chemical senses》2002,27(3):293-298

The revolution in our knowledge about the genomes of organisms gives rise to the question, what do we do with this information? The development of techniques allowing high throughput analysis of RNA and protein expression, such as cDNA microarrays, provide for genome-wide analysis of gene expression. These analyses will help bridge the gap between systems and molecular neuroscience. This review discusses the advantages of using a subtractive hybridization technique, such as a representational difference analysis, to generate a custom cDNA microarray enriched for genes relevant to investigating complex, heterogeneous tissues such as those involved in the chemical senses. Real and hypothetical examples of these experiments are discussed. Benefits of this approach over traditional microarray techniques include having a more relevant clone set, the potential for gene discovery and the creation of a new tool to investigate similar systems. Potential pitfalls may include PCR artifacts and the need for sequencing. However, these disadvantages can be overcome so that the coupling of subtraction techniques to microarray screening can be a fruitful approach to a variety of experimental systems. 相似文献

12.

Thyroglobulin regulates follicular function and heterogeneity by suppressing thyroid-specific gene expression. 总被引：1，自引：0，他引：1

K Suzuki A Mori S Lavaroni L Ulianich E Miyagi J Saito M Nakazato M Pietrarelli N Shafran A Grassadonia W B Kim E Consiglio S Formisano L D Kohn 《Biochimie》1999,81(4):329-340

相似文献

13.

Optimal design and analysis of genetic studies on gene expression

下载免费PDF全文

Fu J Jansen RC 《Genetics》2006,172(3):1993-1999

Whole-genome profiling of gene expression in a segregating population has the potential to identify the regulatory consequences of natural allelic variation. Costs of such studies are high and require that resources--microarrays and population--are used as efficiently as possible. We show that current studies can be improved significantly by a new design for two-color microarrays. Our "distant pair design" profiles twice as many individuals as there are arrays, cohybridizes individuals with dissimilar genomes, gives more weight to known regulatory loci if wished, and therewith maximizes the power for decomposing expression variation into regulatory factors. It can also exploit a large population (larger than twice the number of available microarrays) as a useful resource to select the most dissimilar pairs of individuals from. Our approach identifies more regulatory factors than alternative strategies do in computer simulations for realistic genome sizes, and similar promising results are obtained in an application on Arabidopsis thaliana. Our results will aid the design and analysis of future studies on gene expression and will help to shed more light on gene regulatory networks. 相似文献

14.

Delay-induced transient increase and heterogeneity in gene expression in negatively auto-regulated gene circuits

Maithreye R Sarkar RR Parnaik VK Sinha S 《PloS one》2008,3(8):e2972

相似文献

15.

Correlation-maximizing surrogate gene space for visual mining of gene expression patterns in developing barley endosperm tissue

Marc Strickert Nese Sreenivasulu Björn Usadel Udo Seiffert 《BMC bioinformatics》2007,8(1):165

Background

Micro- and macroarray technologies help acquire thousands of gene expression patterns covering important biological processes during plant ontogeny. Particularly, faithful visualization methods are beneficial for revealing interesting gene expression patterns and functional relationships of coexpressed genes. Such screening helps to gain deeper insights into regulatory behavior and cellular responses, as will be discussed for expression data of developing barley endosperm tissue. For that purpose, high-throughput multidimensional scaling (HiT-MDS), a recent method for similarity-preserving data embedding, is substantially refined and used for (a) assessing the quality and reliability of centroid gene expression patterns, and for (b) derivation of functional relationships of coexpressed genes of endosperm tissue during barley grain development (0–26 days after flowering). 相似文献

16.

Testing for heterogeneity in the utility of a surrogate marker

Layla Parast Tianxi Cai Lu Tian 《Biometrics》2023,79(2):799-810

In studies that require long-term and/or costly follow-up of participants to evaluate a treatment, there is often interest in identifying and using a surrogate marker to evaluate the treatment effect. While several statistical methods have been proposed to evaluate potential surrogate markers, available methods generally do not account for or address the potential for a surrogate to vary in utility or strength by patient characteristics. Previous work examining surrogate markers has indicated that there may be such heterogeneity, that is, that a surrogate marker may be useful (with respect to capturing the treatment effect on the primary outcome) for some subgroups, but not for others. This heterogeneity is important to understand, particularly if the surrogate is to be used in a future trial to replace the primary outcome. In this paper, we propose an approach and estimation procedures to measure the surrogate strength as a function of a baseline covariate W and thus examine potential heterogeneity in the utility of the surrogate marker with respect to W. Within a potential outcome framework, we quantify the surrogate strength/utility using the proportion of treatment effect on the primary outcome that is explained by the treatment effect on the surrogate. We propose testing procedures to test for evidence of heterogeneity, examine finite sample performance of these methods via simulation, and illustrate the methods using AIDS clinical trial data. 相似文献

17.

Measuring gene expression by quantitative proteome analysis 总被引：11，自引：0，他引：11

Gygi SP Rist B Aebersold R 《Current opinion in biotechnology》2000,11(4):396-401

Proteome analysis is most commonly accomplished by the combination of two-dimensional gel electrophoresis for protein separation, visualization, and quantification and mass spectrometry for protein identification. Over the past year, exceptional progress has been made towards developing a new technology base for the precise quantification and identification of proteins in complex mixtures, that is, quantitative proteomics. 相似文献

18.

Global gene expression analysis by combinatorial optimization

Ameur A Aurell E Carlsson M Westholm JO 《In silico biology》2004,4(2):225-241

Generally, there is a trade-off between methods of gene expression analysis that are precise but labor-intensive, e.g. RT-PCR, and methods that scale up to global coverage but are not quite as quantitative, e.g. microarrays. In the present paper, we show how how a known method of gene expression profiling (K. Kato, Nucleic Acids Res. 23, 3685-3690 (1995)), which relies on a fairly small number of steps, can be turned into a global gene expression measurement by advanced data post-processing, with potentially little loss of accuracy. Post-processing here entails solving an ancillary combinatorial optimization problem. Validation is performed on in silico experiments generated from the FANTOM data base of full-length mouse cDNA. We present two variants of the method. One uses state-of-the-art commercial software for solving problems of this kind, the other a code developed by us specifically for this purpose, released in the public domain under GPL license. 相似文献

19.

Comprehensive gene expression analysis by transcript profiling 总被引：20，自引：0，他引：20

Donson J Fang Y Espiritu-Santo G Xing W Salazar A Miyamoto S Armendarez V Volkmuth W 《Plant molecular biology》2002,48(1-2):75-97

相似文献

20.

Modeling overdispersion heterogeneity in differential expression analysis using mixtures

下载免费PDF全文

Elisabetta Bonafede Franck Picard Stéphane Robin Cinzia Viroli 《Biometrics》2016,72(3):804-814

相似文献