期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

"In-gel" purified ditags direct synthesis of highly efficient SAGE Libraries

Mathupala SP Sloan AE 《BMC genomics》2002,3(1):20-5

相似文献

2.

Improving SAGE di-tag processing

下载免费PDF全文

Jacques Colinge Georg Feger 《Genome biology》2001,2(3):preprint00-10

相似文献

3.

SAGE is far more sensitive than EST for detecting low-abundance transcripts 总被引：1，自引：0，他引：1

Sun M Zhou G Lee S Chen J Shi RZ Wang SM 《BMC genomics》2004,5(1):1-4

相似文献

4.

SAGE: String-overlap Assembly of GEnomes

Lucian Ilie Bahlul Haider Michael Molnar Roberto Solis-Oba 《BMC bioinformatics》2014,15(1)

Background

De novo genome assembly of next-generation sequencing data is one of the most important current problems in bioinformatics, essential in many biological applications. In spite of significant amount of work in this area, better solutions are still very much needed.

Results

We present a new program, SAGE, for de novo genome assembly. As opposed to most assemblers, which are de Bruijn graph based, SAGE uses the string-overlap graph. SAGE builds upon great existing work on string-overlap graph and maximum likelihood assembly, bringing an important number of new ideas, such as the efficient computation of the transitive reduction of the string overlap graph, the use of (generalized) edge multiplicity statistics for more accurate estimation of read copy counts, and the improved use of mate pairs and min-cost flow for supporting edge merging. The assemblies produced by SAGE for several short and medium-size genomes compared favourably with those of existing leading assemblers.

Conclusions

SAGE benefits from innovations in almost every aspect of the assembly process: error correction of input reads, string-overlap graph construction, read copy counts estimation, overlap graph analysis and reduction, contig extraction, and scaffolding. We hope that these new ideas will help advance the current state-of-the-art in an essential area of research in genomics.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2105-15-302) contains supplementary material, which is available to authorized users. 相似文献

5.

Overdispersed logistic regression for SAGE: Modelling multiple groups and covariates

Keith?A?Baggerly Email author Li?Deng Jeffrey?S?Morris C Marcelo?Aldaz 《BMC bioinformatics》2004,5(1):144

Background

Two major identifiable sources of variation in data derived from the Serial Analysis of Gene Expression (SAGE) are within-library sampling variability and between-library heterogeneity within a group. Most published methods for identifying differential expression focus on just the sampling variability. In recent work, the problem of assessing differential expression between two groups of SAGE libraries has been addressed by introducing a beta-binomial hierarchical model that explicitly deals with both of the above sources of variation. This model leads to a test statistic analogous to a weighted two-sample t-test. When the number of groups involved is more than two, however, a more general approach is needed. 相似文献

6.

A comparative analysis of the information content in long and short SAGE libraries

Yi-Ju Li Puting Xu Xuejun Qin Donald E Schmechel Christine M Hulette Jonathan L Haines Margaret A Pericak-Vance John R Gilbert 《BMC bioinformatics》2006,7(1):504

Background

Serial Analysis of Gene Expression (SAGE) is a powerful tool to determine gene expression profiles. Two types of SAGE libraries, ShortSAGE and LongSAGE, are classified based on the length of the SAGE tag (10 vs. 17 basepairs). LongSAGE libraries are thought to be more useful than ShortSAGE libraries, but their information content has not been widely compared. To dissect the differences between these two types of libraries, we utilized four libraries (two LongSAGE and two ShortSAGE libraries) generated from the hippocampus of Alzheimer and control samples. In addition, we generated two additional short SAGE libraries, the truncated long SAGE libraries (tSAGE), from LongSAGE libraries by deleting seven 5' basepairs from each LongSAGE tag. 相似文献

7.

A feature selection approach for identification of signature genes from SAGE data

Junior Barrera Roberto M CesarJr Carlos HumesJr David C MartinsJr Diogo FC Patrão Paulo JS Silva Helena Brentani 《BMC bioinformatics》2007,8(1):169

Background

One goal of gene expression profiling is to identify signature genes that robustly distinguish different types or grades of tumors. Several tumor classifiers based on expression profiling have been proposed using microarray technique. Due to important differences in the probabilistic models of microarray and SAGE technologies, it is important to develop suitable techniques to select specific genes from SAGE measurements. 相似文献

8.

Incidence of "quasi-ditags" in catalogs generated by Serial Analysis of Gene Expression (SAGE)

Sergey?V?Anisimov Email author Alexei?A?Sharov 《BMC bioinformatics》2004,5(1):152

相似文献

9.

Modeling SAGE tag formation and its effects on data interpretation within a Bayesian framework

Michael A Gilchrist Hong Qin Russell Zaretzki 《BMC bioinformatics》2007,8(1):403

相似文献

10.

A seriation approach for visualization-driven discovery of co-expression patterns in Serial Analysis of Gene Expression (SAGE) data

Morozova O Morozov V Hoffman BG Helgason CD Marra MA 《PloS one》2008,3(9):e3205

Background

Serial Analysis of Gene Expression (SAGE) is a DNA sequencing-based method for large-scale gene expression profiling that provides an alternative to microarray analysis. Most analyses of SAGE data aimed at identifying co-expressed genes have been accomplished using various versions of clustering approaches that often result in a number of false positives.

Principal Findings

Here we explore the use of seriation, a statistical approach for ordering sets of objects based on their similarity, for large-scale expression pattern discovery in SAGE data. For this specific task we implement a seriation heuristic we term ‘progressive construction of contigs’ that constructs local chains of related elements by sequentially rearranging margins of the correlation matrix. We apply the heuristic to the analysis of simulated and experimental SAGE data and compare our results to those obtained with a clustering algorithm developed specifically for SAGE data. We show using simulations that the performance of seriation compares favorably to that of the clustering algorithm on noisy SAGE data.

Conclusions

We explore the use of a seriation approach for visualization-based pattern discovery in SAGE data. Using both simulations and experimental data, we demonstrate that seriation is able to identify groups of co-expressed genes more accurately than a clustering algorithm developed specifically for SAGE data. Our results suggest that seriation is a useful method for the analysis of gene expression data whose applicability should be further pursued. 相似文献

11.

A SAGE based approach to human glomerular endothelium: defining the transcriptome,finding a novel molecule and highlighting endothelial diversity

Guerkan Sengoelge Wolfgang Winnicki Anne Kupczok Arndt von Haeseler Michael Schuster Walter Pfaller Paul Jennings Ansgar Weltermann Sophia Blake Gere Sunder-Plassmann 《BMC genomics》2014,15(1)

相似文献

12.

Bias correction and Bayesian analysis of aggregate counts in SAGE libraries

Russell L Zaretzki Michael A Gilchrist William M Briggs Artin Armagan 《BMC bioinformatics》2010,11(1):72

相似文献

13.

Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach

Jun?Lu John?K?Tomfohr Thomas?B?Kepler Email author 《BMC bioinformatics》2005,6(1):165

Background

In testing for differential gene expression involving multiple serial analysis of gene expression (SAGE) libraries, it is critical to account for both between and within library variation. Several methods have been proposed, including the t test, t _wtest, and an overdispersed logistic regression approach. The merits of these tests, however, have not been fully evaluated. Questions still remain on whether further improvements can be made. 相似文献

14.

Bayesian model accounting for within-class biological variability in Serial Analysis of Gene Expression (SAGE)

Ricardo?ZN?Vêncio Email author Helena?Brentani Diogo?FC?Patr?o Carlos?AB?Pereira 《BMC bioinformatics》2004,5(1):119

相似文献

15.

Improved protocol for SAGE tag-to-gene allocation

Kannbley U Kapinya K Dirnagl U Trendelenburg G 《BioTechniques》2003,34(6):1212-4, 1216-9

相似文献

16.

SAGE detects microRNA precursors

Xijin?Ge Qingfa?Wu San?Ming?Wang Email author 《BMC genomics》2006,7(1):285

Background

MicroRNAs (miRNAs) have been shown to play important roles in regulating gene expression. Since miRNAs are often evolutionarily conserved and their precursors can be folded into stem-loop hairpins, many miRNAs have been predicted. Yet experimental confirmation is difficult since miRNA expression is often specific to particular tissues and developmental stages.

Results

Analysis of 29 human and 230 mouse longSAGE libraries revealed the expression of 22 known and 10 predicted mammalian miRNAs. Most were detected in embryonic tissues. Four SAGE tags detected in human embryonic stem cells specifically match a cluster of four human miRNAs (mir-302a, b, c&d) known to be expressed in embryonic stem cells. LongSAGE data also suggest the existence of a mouse homolog of human and rat mir-493.

Conclusion

The observation that some orphan longSAGE tags uniquely match miRNA precursors provides information about the expression of some known and predicted miRNAs.

相似文献

17.

Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data 总被引：1，自引：0，他引：1

Becquet C Blachon S Jeudy B Boulicaut JF Gandrillon O 《Genome biology》2002,3(12):research0067.1-research006716

Background

The association-rules discovery (ARD) technique has yet to be applied to gene-expression data analysis. Even in the absence of previous biological knowledge, it should identify sets of genes whose expression is correlated. The first association-rule miners appeared six years ago and proved efficient at dealing with sparse and weakly correlated data. A huge international research effort has led to new algorithms for tackling difficult contexts and these are particularly suited to analysis of large gene-expression matrices. To validate the ARD technique we have applied it to freely available human serial analysis of gene expression (SAGE) data. 相似文献

18.

Pan-genome isolation of low abundance transcripts using SAGE tag

Kim YC Jung YC Xuan Z Dong H Zhang MQ Wang SM 《FEBS letters》2006,580(28-29):6721-6729

相似文献

19.

Discarding duplicate ditags in LongSAGE analysis may introduce significant error

Jeppe Emmersen Anna M Heidenblut Annabeth Laursen Høgh Stephan A Hahn Karen G Welinder Kåre L Nielsen 《BMC bioinformatics》2007,8(1):92

Background

During gene expression analysis by Serial Analysis of Gene Expression (SAGE), duplicate ditags are routinely removed from the data analysis, because they are suspected to stem from artifacts during SAGE library construction. As a consequence, naturally occurring duplicate ditags are also removed from the analysis leading to an error of measurement. 相似文献

20.

Differential gene expression in an elite hybrid rice cultivar (<Emphasis Type="Italic">Oryza sativa,L</Emphasis>) and its parental lines based on SAGE data

Shuhui Song Hongzhu Qu Chen Chen Songnian Hu Jun Yu 《BMC plant biology》2007,7(1):49

相似文献