期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Density based pruning for identification of differentially expressed genes from microarray data

Hu J Xu J 《BMC genomics》2010,11(Z2):S3

Motivation

Identification of differentially expressed genes from microarray datasets is one of the most important analyses for microarray data mining. Popular algorithms such as statistical t-test rank genes based on a single statistics. The false positive rate of these methods can be improved by considering other features of differentially expressed genes.

Results

We proposed a pattern recognition strategy for identifying differentially expressed genes. Genes are mapped to a two dimension feature space composed of average difference of gene expression and average expression levels. A density based pruning algorithm (DB Pruning) is developed to screen out potential differentially expressed genes usually located in the sparse boundary region. Biases of popular algorithms for identifying differentially expressed genes are visually characterized. Experiments on 17 datasets from Gene Omnibus Database (GEO) with experimentally verified differentially expressed genes showed that DB pruning can significantly improve the prediction accuracy of popular identification algorithms such as t-test, rank product, and fold change.

Conclusions

Density based pruning of non-differentially expressed genes is an effective method for enhancing statistical testing based algorithms for identifying differentially expressed genes. It improves t-test, rank product, and fold change by 11% to 50% in the numbers of identified true differentially expressed genes. The source code of DB pruning is freely available on our website http://mleg.cse.sc.edu/degprune

相似文献

2.

Gene Expression Analysis Suggests Bone Development-Related Genes GDF5 and DIO2 Are Involved in the Development of Kashin-Beck Disease in Children Rather than Adults

Yan Wen Feng Zhang Chunyan Li Shulan He Wuhong Tan Yanxia Lei Qiang Zhang Hanjie Yu Jingjing Zheng Xiong Guo 《PloS one》2014,9(7)

相似文献

3.

Transcriptome profiling during a natural host-parasite interaction

Seanna J. McTaggart Timothée Cézard Jennie S. Garbutt Phil J. Wilson Tom J. Little 《BMC genomics》2015,16(1)

相似文献

4.

Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project

Fernández P Paniego N Lew S Hopp HE Heinz RA 《BMC genomics》2003,4(1):40

相似文献

5.

Quantitative Proteomic Analysis of Serum from Pregnant Women Carrying a Fetus with Conotruncal Heart Defect Using Isobaric Tags for Relative and Absolute Quantitation (iTRAQ) Labeling

Ying Zhang Yuan Kang Qiongjie Zhou Jizi Zhou Huijun Wang Hong Jin Xiaohui Liu Duan Ma Xiaotian Li 《PloS one》2014,9(11)

Objective

To identify differentially expressed proteins from serum of pregnant women carrying a conotruncal heart defects (CTD) fetus, using proteomic analysis.

Methods

The study was conducted using a nested case-control design. The 5473 maternal serum samples were collected at 14–18 weeks of gestation. The serum from 9 pregnant women carrying a CTD fetus, 10 with another CHD (ACHD) fetus, and 11 with a normal fetus were selected from the above samples, and analyzed by using isobaric tags for relative and absolute quantitation (iTRAQ) coupled with two-dimensional liquid chromatography-tandem mass spectrometry(2D LC-MS/MS). The differentially expressed proteins identified by iTRAQ were further validated with Western blot.

Results

A total of 105 unique proteins present in the three groups were identified, and relative expression data were obtained for 92 of them with high confidence by employing the iTRAQ-based experiments. The downregulation of gelsolin in maternal serum of fetus with CTD was further verified by Western blot.

Conclusions

The identification of differentially expressed protein gelsolin in the serum of the pregnant women carrying a CTD fetus by using proteomic technology may be able to serve as a foundation to further explore the biomarker for detection of CTD fetus from the maternal serum. 相似文献

6.

Dissecting systems-wide data using mixture models: application to identify affected cellular processes

J?Peter?Svensson Renée?X?de Menezes Ingela?Turesson Micheline?Giphart-Gassler Harry?Vrieling Email author 《BMC bioinformatics》2005,6(1):177

Background

Functional analysis of data from genome-scale experiments, such as microarrays, requires an extensive selection of differentially expressed genes. Under many conditions, the proportion of differentially expressed genes is considerable, making the selection criteria a balance between the inclusion of false positives and the exclusion of false negatives. 相似文献

7.

Comparative Expression Profiles of Midgut Genes in Dengue Virus Refractory and Susceptible Aedes aegypti across Critical Period for Virus Infection

Chitra Chauhan Susanta K. Behura Becky deBruyn Diane D. Lovin Brent W. Harker Consuelo Gomez-Machorro Akio Mori Jeanne Romero-Severson David W. Severson 《PloS one》2012,7(10)

相似文献

8.

Biomarker discovery in heterogeneous tissue samples -taking the in-silico deconfounding approach

Dirk Repsilber Sabine Kern Anna Telaar Gerhard Walzl Gillian F Black Joachim Selbig Shreemanta K Parida Stefan HE Kaufmann Marc Jacobsen 《BMC bioinformatics》2010,11(1):27

Background

For heterogeneous tissues, such as blood, measurements of gene expression are confounded by relative proportions of cell types involved. Conclusions have to rely on estimation of gene expression signals for homogeneous cell populations, e.g. by applying micro-dissection, fluorescence activated cell sorting, or in-silico deconfounding. We studied feasibility and validity of a non-negative matrix decomposition algorithm using experimental gene expression data for blood and sorted cells from the same donor samples. Our objective was to optimize the algorithm regarding detection of differentially expressed genes and to enable its use for classification in the difficult scenario of reversely regulated genes. This would be of importance for the identification of candidate biomarkers in heterogeneous tissues. 相似文献

9.

Investigating the effect of paralogs on microarray gene-set analysis

Andre J Faure Cathal Seoighe Nicola J Mulder 《BMC bioinformatics》2011,12(1):29

Background

In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. 相似文献

10.

Genome-wide gene expression analysis suggests an important role of suppressed immunity in pathogenesis of Kashin-Beck disease

Wang S Guo X Wu XM Lammi MJ 《PloS one》2012,7(1):e28439

相似文献

11.

Intertwining threshold settings, biological data and database knowledge to optimize the selection of differentially expressed genes from microarray

Chuchana P Holzmuller P Vezilier F Berthier D Chantal I Severac D Lemesre JL Cuny G Nirdé P Bucheton B 《PloS one》2010,5(10):e13518

相似文献

12.

Impact of Library Preparation on Downstream Analysis and Interpretation of RNA-Seq Data: Comparison between Illumina PolyA and NuGEN Ovation Protocol

Zhifu Sun Yan W. Asmann Asha Nair Yuji Zhang Liguo Wang Krishna R. Kalari Aditya V. Bhagwate Tiffany R. Baker Jennifer M. Carr Jean-Pierre A. Kocher Edith A. Perez E. Aubrey Thompson 《PloS one》2013,8(8)

Objectives

The sequencing by the PolyA selection is the most common approach for library preparation. With limited amount or degraded RNA, alternative protocols such as the NuGEN have been developed. However, it is not yet clear how the different library preparations affect the downstream analyses of the broad applications of RNA sequencing.

Methods and Materials

Eight human mammary epithelial cell (HMEC) lines with high quality RNA were sequenced by Illumina’s mRNA-Seq PolyA selection and NuGEN ENCORE library preparation. The following analyses and comparisons were conducted: 1) the numbers of genes captured by each protocol; 2) the impact of protocols on differentially expressed gene detection between biological replicates; 3) expressed single nucleotide variant (SNV) detection; 4) non-coding RNAs, particularly lincRNA detection; and 5) intragenic gene expression.

Results

Sequences from the NuGEN protocol had lower (75%) alignment rate than the PolyA (over 90%). The NuGEN protocol detected fewer genes (12–20% less) with a significant portion of reads mapped to non-coding regions. A large number of genes were differentially detected between the two protocols. About 17–20% of the differentially expressed genes between biological replicates were commonly detected between the two protocols. Significantly higher numbers of SNVs (5–6 times) were detected in the NuGEN samples, which were largely from intragenic and intergenic regions. The NuGEN captured fewer exons (25% less) and had higher base level coverage variance. While 6.3% of reads were mapped to intragenic regions in the PolyA samples, the percentages were much higher (20–25%) for the NuGEN samples. The NuGEN protocol did not detect more known non-coding RNAs such as lincRNAs, but targeted small and “novel” lincRNAs.

Conclusion

Different library preparations can have significant impacts on downstream analysis and interpretation of RNA-seq data. The NuGEN provides an alternative for limited or degraded RNA but it has limitations for some RNA-seq applications. 相似文献

13.

Testing for mean and correlation changes in microarray experiments: an application for pathway analysis

Mayer Alvo Zhongzhu Liu Andrew Williams Carole Yauk 《BMC bioinformatics》2010,11(1):60

相似文献

14.

Principal components analysis based methodology to identify differentially expressed genes in time-course microarray data

Sudhakar Jonnalagadda Rajagopalan Srinivasan 《BMC bioinformatics》2008,9(1):267

Background

Time-course microarray experiments are being increasingly used to characterize dynamic biological processes. In these experiments, the goal is to identify genes differentially expressed in time-course data, measured between different biological conditions. These differentially expressed genes can reveal the changes in biological process due to the change in condition which is essential to understand differences in dynamics. 相似文献

15.

Metatranscriptomic profiles of Eastern subterranean termites,Reticulitermes flavipes (Kollar) fed on second generation feedstocks

Swapna Priya Rajarapu Jacob T Shreve Ketaki P Bhide Jyothi Thimmapuram Michael E Scharf 《BMC genomics》2015,16(1)

相似文献

16.

Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein

Gajendra?PS?Raghava Email author Joon?H?Han 《BMC bioinformatics》2005,6(1):59

Background

A large number of papers have been published on analysis of microarray data with particular emphasis on normalization of data, detection of differentially expressed genes, clustering of genes and regulatory network. On other hand there are only few studies on relation between expression level and composition of nucleotide/protein sequence, using expression data. There is a need to understand why particular genes/proteins express more in particular conditions. In this study, we analyze 3468 genes of Saccharomyces cerevisiae obtained from Holstege et al., (1998) to understand the relationship between expression level and amino acid composition. 相似文献

17.

VennMaster: Area-proportional Euler diagrams for functional GO analysis of microarrays

Hans A Kestler André Müller Johann M Kraus Malte Buchholz Thomas M Gress Hongfang Liu David W Kane Barry R Zeeberg John N Weinstein 《BMC bioinformatics》2008,9(1):67

Background

Microarray experiments generate vast amounts of data. The functional context of differentially expressed genes can be assessed by querying the Gene Ontology (GO) database via GoMiner. Directed acyclic graph representations, which are used to depict GO categories enriched with differentially expressed genes, are difficult to interpret and, depending on the particular analysis, may not be well suited for formulating new hypotheses. Additional graphical methods are therefore needed to augment the GO graphical representation. 相似文献

18.

Sex genes for genomic analysis in human brain: internal controls for comparison of probe level data extraction.

Hanga?C?Galfalvy Loubna?Erraji-Benchekroun Peggy?Smyrniotopoulos Paul?Pavlidis Steven?P?Ellis J?John?Mann Etienne?Sibille Email author Victoria?Arango 《BMC bioinformatics》2003,4(1):37

Background

Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods.

Results

Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression.

Conclusion

In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects.

相似文献

19.

Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies

Paolo Martini Davide Risso Gabriele Sales Chiara Romualdi Gerolamo Lanfranchi Stefano Cagnin 《BMC bioinformatics》2011,12(1):92

相似文献

20.

A multiple near isogenic line (multi-NIL) RNA-seq approach to identify candidate genes underpinning QTL

Ahsan Habib Jonathan J. Powell Jiri Stiller Miao Liu Sergey Shabala Meixue Zhou Donald M. Gardiner Chunji Liu 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2018,131(3):613-624

相似文献