期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression 总被引：1，自引：0，他引：1

Rodrigo Malig Cristian Varela Eduardo Agosin Francisco Melo 《BMC bioinformatics》2006,7(1):487

Background

In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. 相似文献

2.

"In-gel" purified ditags direct synthesis of highly efficient SAGE Libraries

Mathupala SP Sloan AE 《BMC genomics》2002,3(1):20-5

相似文献

3.

A seriation approach for visualization-driven discovery of co-expression patterns in Serial Analysis of Gene Expression (SAGE) data

Morozova O Morozov V Hoffman BG Helgason CD Marra MA 《PloS one》2008,3(9):e3205

Background

Serial Analysis of Gene Expression (SAGE) is a DNA sequencing-based method for large-scale gene expression profiling that provides an alternative to microarray analysis. Most analyses of SAGE data aimed at identifying co-expressed genes have been accomplished using various versions of clustering approaches that often result in a number of false positives.

Principal Findings

Here we explore the use of seriation, a statistical approach for ordering sets of objects based on their similarity, for large-scale expression pattern discovery in SAGE data. For this specific task we implement a seriation heuristic we term ‘progressive construction of contigs’ that constructs local chains of related elements by sequentially rearranging margins of the correlation matrix. We apply the heuristic to the analysis of simulated and experimental SAGE data and compare our results to those obtained with a clustering algorithm developed specifically for SAGE data. We show using simulations that the performance of seriation compares favorably to that of the clustering algorithm on noisy SAGE data.

Conclusions

We explore the use of a seriation approach for visualization-based pattern discovery in SAGE data. Using both simulations and experimental data, we demonstrate that seriation is able to identify groups of co-expressed genes more accurately than a clustering algorithm developed specifically for SAGE data. Our results suggest that seriation is a useful method for the analysis of gene expression data whose applicability should be further pursued. 相似文献

4.

A feature selection approach for identification of signature genes from SAGE data

Junior Barrera Roberto M CesarJr Carlos HumesJr David C MartinsJr Diogo FC Patrão Paulo JS Silva Helena Brentani 《BMC bioinformatics》2007,8(1):169

Background

One goal of gene expression profiling is to identify signature genes that robustly distinguish different types or grades of tumors. Several tumor classifiers based on expression profiling have been proposed using microarray technique. Due to important differences in the probabilistic models of microarray and SAGE technologies, it is important to develop suitable techniques to select specific genes from SAGE measurements. 相似文献

5.

Discarding duplicate ditags in LongSAGE analysis may introduce significant error

Jeppe Emmersen Anna M Heidenblut Annabeth Laursen Høgh Stephan A Hahn Karen G Welinder Kåre L Nielsen 《BMC bioinformatics》2007,8(1):92

Background

During gene expression analysis by Serial Analysis of Gene Expression (SAGE), duplicate ditags are routinely removed from the data analysis, because they are suspected to stem from artifacts during SAGE library construction. As a consequence, naturally occurring duplicate ditags are also removed from the analysis leading to an error of measurement. 相似文献

6.

Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries

Céline Keime Francesca Damiola Dominique Mouchiroud Laurent Duret Olivier Gandrillon 《BMC bioinformatics》2004,5(1):143

相似文献

7.

Improving SAGE di-tag processing

下载免费PDF全文

Jacques Colinge Georg Feger 《Genome biology》2001,2(3):preprint00-10

相似文献

8.

Bias correction and Bayesian analysis of aggregate counts in SAGE libraries

Russell L Zaretzki Michael A Gilchrist William M Briggs Artin Armagan 《BMC bioinformatics》2010,11(1):72

相似文献

9.

A comparative analysis of the information content in long and short SAGE libraries

Yi-Ju Li Puting Xu Xuejun Qin Donald E Schmechel Christine M Hulette Jonathan L Haines Margaret A Pericak-Vance John R Gilbert 《BMC bioinformatics》2006,7(1):504

Background

Serial Analysis of Gene Expression (SAGE) is a powerful tool to determine gene expression profiles. Two types of SAGE libraries, ShortSAGE and LongSAGE, are classified based on the length of the SAGE tag (10 vs. 17 basepairs). LongSAGE libraries are thought to be more useful than ShortSAGE libraries, but their information content has not been widely compared. To dissect the differences between these two types of libraries, we utilized four libraries (two LongSAGE and two ShortSAGE libraries) generated from the hippocampus of Alzheimer and control samples. In addition, we generated two additional short SAGE libraries, the truncated long SAGE libraries (tSAGE), from LongSAGE libraries by deleting seven 5' basepairs from each LongSAGE tag. 相似文献

10.

Overdispersed logistic regression for SAGE: Modelling multiple groups and covariates

Keith?A?Baggerly Email author Li?Deng Jeffrey?S?Morris C Marcelo?Aldaz 《BMC bioinformatics》2004,5(1):144

Background

Two major identifiable sources of variation in data derived from the Serial Analysis of Gene Expression (SAGE) are within-library sampling variability and between-library heterogeneity within a group. Most published methods for identifying differential expression focus on just the sampling variability. In recent work, the problem of assessing differential expression between two groups of SAGE libraries has been addressed by introducing a beta-binomial hierarchical model that explicitly deals with both of the above sources of variation. This model leads to a test statistic analogous to a weighted two-sample t-test. When the number of groups involved is more than two, however, a more general approach is needed. 相似文献

11.

Discovering semantic features in the literature: a foundation for building functional associations

Monica Chagoyen Pedro Carmona-Saez Hagit Shatkay Jose M Carazo Alberto Pascual-Montano 《BMC bioinformatics》2006,7(1):41

Background

Experimental techniques such as DNA microarray, serial analysis of gene expression (SAGE) and mass spectrometry proteomics, among others, are generating large amounts of data related to genes and proteins at different levels. As in any other experimental approach, it is necessary to analyze these data in the context of previously known information about the biological entities under study. The literature is a particularly valuable source of information for experiment validation and interpretation. Therefore, the development of automated text mining tools to assist in such interpretation is one of the main challenges in current bioinformatics research. 相似文献

12.

Transcriptional profiling of inductive mesenchyme to identify molecules involved in prostate development and disease

下载免费PDF全文

Vanpoucke G Orr B Grace OC Chan R Ashley GR Williams K Franco OE Hayward SW Thomson AA 《Genome biology》2007,8(10):R213

Background

The mesenchymal compartment plays a key role in organogenesis, and cells within the mesenchyme/stroma are a source of potent molecules that control epithelia during development and tumorigenesis. We used serial analysis of gene expression (SAGE) to profile a key subset of prostatic mesenchyme that regulates prostate development and is enriched for growth-regulatory molecules. 相似文献

13.

Modeling Sage data with a truncated gamma-Poisson model

Helene H Thygesen Aeilko H Zwinderman 《BMC bioinformatics》2006,7(1):157-9

Background

Serial Analysis of Gene Expressions (SAGE) produces gene expression measurements on a discrete scale, due to the finite number of molecules in the sample. This means that part of the variance in SAGE data should be understood as the sampling error in a binomial or Poisson distribution, whereas other variance sources, in particular biological variance, should be modeled using a continuous distribution function, i.e. a prior on the intensity of the Poisson distribution. One challenge is that such a model predicts a large number of genes with zero counts, which cannot be observed. 相似文献

14.

Bayesian model accounting for within-class biological variability in Serial Analysis of Gene Expression (SAGE)

Ricardo?ZN?Vêncio Email author Helena?Brentani Diogo?FC?Patr?o Carlos?AB?Pereira 《BMC bioinformatics》2004,5(1):119

相似文献

15.

Too much data, but little inter-changeability: a lesson learned from mining public data on tissue specificity of gene expression

Shuyu Li Yiqun Helen Li Tao Wei Eric Wen Su Kevin Duffin Birong Liao 《Biology direct》2006,1(1):33-13

Background

The tissue expression pattern of a gene often provides an important clue to its potential role in a biological process. A vast amount of gene expression data have been and are being accumulated in public repository through different technology platforms. However, exploitations of these rich data sources remain limited in part due to issues of technology standardization. Our objective is to test the data comparability between SAGE and microarray technologies, through examining the expression pattern of genes under normal physiological states across variety of tissues. 相似文献

16.

Identification of novel transcripts with differential dorso-ventral expression in Xenopus gastrula using serial analysis of gene expression

Fernando Faunes Natalia Sánchez Javier Castellanos Ismael A Vergara Francisco Melo Juan Larraín 《Genome biology》2009,10(2):R15-16

相似文献

17.

Identification of transcripts with enriched expression in the developing and adult pancreas

Hoffman BG Zavaglia B Witzsche J Ruiz de Algara T Beach M Hoodless PA Jones SJ Marra MA Helgason CD 《Genome biology》2008,9(6):R99-19

相似文献

18.

Unexpected observations after mapping LongSAGE tags to the human genome

Céline Keime Dominique Mouchiroud Laurent Duret Olivier Gandrillon 《BMC bioinformatics》2007,8(1):154

相似文献

19.

Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach

Jun?Lu John?K?Tomfohr Thomas?B?Kepler Email author 《BMC bioinformatics》2005,6(1):165

Background

In testing for differential gene expression involving multiple serial analysis of gene expression (SAGE) libraries, it is critical to account for both between and within library variation. Several methods have been proposed, including the t test, t _wtest, and an overdispersed logistic regression approach. The merits of these tests, however, have not been fully evaluated. Questions still remain on whether further improvements can be made. 相似文献

20.

Function of the PHA-4/FOXA transcription factor during <Emphasis Type="Italic">C. elegans</Emphasis>post-embryonic development

Di Chen Donald L Riddle 《BMC developmental biology》2008,8(1):26

相似文献