首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 671 毫秒
1.

Background  

During gene expression analysis by Serial Analysis of Gene Expression (SAGE), duplicate ditags are routinely removed from the data analysis, because they are suspected to stem from artifacts during SAGE library construction. As a consequence, naturally occurring duplicate ditags are also removed from the analysis leading to an error of measurement.  相似文献   

2.
3.
4.

Background

Serial Analysis of Gene Expression (SAGE) is a DNA sequencing-based method for large-scale gene expression profiling that provides an alternative to microarray analysis. Most analyses of SAGE data aimed at identifying co-expressed genes have been accomplished using various versions of clustering approaches that often result in a number of false positives.

Principal Findings

Here we explore the use of seriation, a statistical approach for ordering sets of objects based on their similarity, for large-scale expression pattern discovery in SAGE data. For this specific task we implement a seriation heuristic we term ‘progressive construction of contigs’ that constructs local chains of related elements by sequentially rearranging margins of the correlation matrix. We apply the heuristic to the analysis of simulated and experimental SAGE data and compare our results to those obtained with a clustering algorithm developed specifically for SAGE data. We show using simulations that the performance of seriation compares favorably to that of the clustering algorithm on noisy SAGE data.

Conclusions

We explore the use of a seriation approach for visualization-based pattern discovery in SAGE data. Using both simulations and experimental data, we demonstrate that seriation is able to identify groups of co-expressed genes more accurately than a clustering algorithm developed specifically for SAGE data. Our results suggest that seriation is a useful method for the analysis of gene expression data whose applicability should be further pursued.  相似文献   

5.
Lee S  Chen J  Zhou G  Wang SM 《BioTechniques》2001,31(2):348-50, 352-4
The serial analysis of gene expression (SAGE) technique is an important tool for genome-wide gene expression analysis. However, the requirement of a large amount of mRNA for the analysis and the difficulties in generating high-quality tag and ditag fragments for the construction of a SAGE library often interfere with the successful performance of the SAGE technique. We developed two procedures to solve these issues: (i) introducing low-cycle PCR amplification of the 3' cDNA before the BsmFI digestion of the 3' cDNAs and (ii) gel purifying the BsmFI-released tag fragments before ditag formation. These modifications provide a large quantity of initial 3' cDNAs and high-quality tags and ditags for the construction of SAGE libraries.  相似文献   

6.
7.
8.

Background  

In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome.  相似文献   

9.
10.
11.

Background  

The Audic-Claverie method [1] has been and still continues to be a popular approach for detection of differentially expressed genes in the SAGE framework. The method is based on the assumption that under the null hypothesis tag counts of the same gene in two libraries come from the same but unknown Poisson distribution. The problem is that each SAGE library represents only a single measurement. We ask: Given that the tag count samples from SAGE libraries are extremely limited, how useful actually is the Audic-Claverie methodology? We rigorously analyze the A-C statistic that forms a backbone of the methodology and represents our knowledge of the underlying tag generating process based on one observation.  相似文献   

12.
SAGE is far more sensitive than EST for detecting low-abundance transcripts   总被引:1,自引:0,他引:1  
Sun M  Zhou G  Lee S  Chen J  Shi RZ  Wang SM 《BMC genomics》2004,5(1):1-4
  相似文献   

13.

Background  

Two major identifiable sources of variation in data derived from the Serial Analysis of Gene Expression (SAGE) are within-library sampling variability and between-library heterogeneity within a group. Most published methods for identifying differential expression focus on just the sampling variability. In recent work, the problem of assessing differential expression between two groups of SAGE libraries has been addressed by introducing a beta-binomial hierarchical model that explicitly deals with both of the above sources of variation. This model leads to a test statistic analogous to a weighted two-sample t-test. When the number of groups involved is more than two, however, a more general approach is needed.  相似文献   

14.

Background  

In testing for differential gene expression involving multiple serial analysis of gene expression (SAGE) libraries, it is critical to account for both between and within library variation. Several methods have been proposed, including the t test, t w test, and an overdispersed logistic regression approach. The merits of these tests, however, have not been fully evaluated. Questions still remain on whether further improvements can be made.  相似文献   

15.
In contrast to DNA chips, serial analysis of gene expression (SAGE) is not dependent on genes having been previously identified for their monitoring. Although useful, the method can be technically challenging, and particularly the last steps including concatenation and cloning may result in less than optimal results. We propose that many of the encountered problems can be attributed to the purification of the 26-bp ditags by polyacrylamide gel electrophoresis. Low yields, gel contaminants, potential exposure to degrading enzymes during handling and lengthy separation all disfavor the method. We introduce purification of 26-bp ditags by reverse-phase high-performance liquid chromatography (HPLC) using polystyrene/divinylbenzene columns and tetraethylammonium acetate buffer with acetonitrile as mobile phase. The method is fast and gives excellent results. Ditags purified by HPLC readily ligate to high-molecular-weight concatemers leading to their efficient cloning. The method should substantially facilitate the construction of SAGE libraries.  相似文献   

16.
17.
18.
19.
20.

Background

De novo genome assembly of next-generation sequencing data is one of the most important current problems in bioinformatics, essential in many biological applications. In spite of significant amount of work in this area, better solutions are still very much needed.

Results

We present a new program, SAGE, for de novo genome assembly. As opposed to most assemblers, which are de Bruijn graph based, SAGE uses the string-overlap graph. SAGE builds upon great existing work on string-overlap graph and maximum likelihood assembly, bringing an important number of new ideas, such as the efficient computation of the transitive reduction of the string overlap graph, the use of (generalized) edge multiplicity statistics for more accurate estimation of read copy counts, and the improved use of mate pairs and min-cost flow for supporting edge merging. The assemblies produced by SAGE for several short and medium-size genomes compared favourably with those of existing leading assemblers.

Conclusions

SAGE benefits from innovations in almost every aspect of the assembly process: error correction of input reads, string-overlap graph construction, read copy counts estimation, overlap graph analysis and reduction, contig extraction, and scaffolding. We hope that these new ideas will help advance the current state-of-the-art in an essential area of research in genomics.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2105-15-302) contains supplementary material, which is available to authorized users.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号