期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data

Wenlong Jia Kunlong Qiu Minghui He Pengfei Song Quan Zhou Feng Zhou Yuan Yu Dandan Zhu Michael L Nickerson Shengqing Wan Xiangke Liao Xiaoqian Zhu Shaoliang Peng Yingrui Li Jun Wang Guangwu Guo 《Genome biology》2013,14(2):R12

相似文献

2.

deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data

McPherson A Hormozdiari F Zayed A Giuliany R Ha G Sun MG Griffith M Heravi Moussavi A Senz J Melnyk N Pacheco M Marra MA Hirst M Nielsen TO Sahinalp SC Huntsman D Shah SP 《PLoS computational biology》2011,7(5):e1001138

相似文献

3.

Identification of leukemia-specific fusion gene transcripts with a novel oligonucleotide array

Chun SM Kim YL Choi HB Oh YT Kim YJ Lee S Kim TG Yang EG Park YK Kim DW Han BD 《Molecular diagnosis & therapy》2007,11(1):21-28

相似文献

4.

FoldMiner: structural motif discovery using an improved superposition algorithm 总被引：5，自引：0，他引：5

Shapiro J Brutlag D 《Protein science : a publication of the Protein Society》2004,13(1):278-294

We report an unsupervised structural motif discovery algorithm, FoldMiner, which is able to detect global and local motifs in a database of proteins without the need for multiple structure or sequence alignments and without relying on prior classification of proteins into families. Motifs, which are discovered from pairwise superpositions of a query structure to a database of targets, are described probabilistically in terms of the conservation of each secondary structure element's position and are used to improve detection of distant structural relationships. During each iteration of the algorithm, the motif is defined from the current set of homologs and is used both to recruit additional homologous structures and to discard false positives. FoldMiner thus achieves high specificity and sensitivity by distinguishing between homologous and nonhomologous structures by the regions of the query to which they align. We find that when two proteins of the same fold are aligned, highly conserved secondary structure elements in one protein tend to align to highly conserved elements in the second protein, suggesting that FoldMiner consistently identifies the same motif in members of a fold. Structural alignments are performed by an improved superposition algorithm, LOCK 2, which detects distant structural relationships by placing increased emphasis on the alignment of secondary structure elements. LOCK 2 obeys several properties essential in automated analysis of protein structure: It is symmetric, its alignments of secondary structure elements are transitive, its alignments of residues display a high degree of transitivity, and its scoring system is empirically found to behave as a metric. 相似文献

5.

Motif discovery using an immune genetic algorithm

Jia-wei Luo Ting Wang 《Journal of theoretical biology》2010,264(2):319-325

In this paper, a new immune genetic algorithm for motif discovery is proposed. The algorithm adopts concentration regulation mechanism to maintain the population diversity and vaccine mechanism to inhibit degeneracy during evolution. Experimental results have demonstrated the method's capacity to find known motifs in relatively long promoter sequences and multiple motifs within a single run. 相似文献

6.

The discovery of BMS-275183: an orally efficacious novel taxane

Mastalerz H Cook D Fairchild CR Hansel S Johnson W Kadow JF Long BH Rose WC Tarrant J Wu MJ Xue MQ Zhang G Zoeckler M Vyas DM 《Bioorganic & medicinal chemistry》2003,11(20):4315-4323

The evolution of 2, a C-4-methylcarbonate analogue of paclitaxel with minimal oral bioavailability and oral efficacy, into its C-3'-t-butyl-3'-N-t-butyloxycarbonyl analogue (15i), a novel taxane with oral efficacy in preclinical models that is comparable to iv administered paclitaxel, is described. 相似文献

7.

In silico discovery of human natural antisense transcripts

Yuan-Yuan Li Lei Qin Zong-Ming Guo Lei Liu Hao Xu Pei Hao Jiong Su Yixiang Shi Wei-Zhong He Yi-Xue Li 《BMC bioinformatics》2006,7(1):18-8

相似文献

8.

A novel bioinformatics pipeline for identification and characterization of fusion transcripts in breast cancer and normal cell lines

Asmann YW Hossain A Necela BM Middha S Kalari KR Sun Z Chai HS Williamson DW Radisky D Schroth GP Kocher JP Perez EA Thompson EA 《Nucleic acids research》2011,39(15):e100

相似文献

9.

Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts

Joshua Z Levin Michael F Berger Xian Adiconis Peter Rogov Alexandre Melnikov Timothy Fennell Chad Nusbaum Levi A Garraway Andreas Gnirke 《Genome biology》2009,10(10):R115-8

相似文献

10.

FusionHunter: identifying fusion transcripts in cancer using paired-end RNA-seq

Li Y Chien J Smith DI Ma J 《Bioinformatics (Oxford, England)》2011,27(12):1708-1710

相似文献

11.

LocExpress: a web server for efficiently estimating expression of novel transcripts

Hou Mei Tian Feng Jiang Shuai Kong Lei Yang Dechang Gao Ge 《BMC genomics》2016,17(13):1023-179

相似文献

12.

Genes@Work: an efficient algorithm for pattern discovery and multivariate feature selection in gene expression data 总被引：2，自引：0，他引：2

Lepre J Rice JJ Tu Y Stolovitzky G 《Bioinformatics (Oxford, England)》2004,20(7):1033-1044

MOTIVATION: Despite the growing literature devoted to finding differentially expressed genes in assays probing different tissues types, little attention has been paid to the combinatorial nature of feature selection inherent to large, high-dimensional gene expression datasets. New flexible data analysis approaches capable of searching relevant subgroups of genes and experiments are needed to understand multivariate associations of gene expression patterns with observed phenotypes. RESULTS: We present in detail a deterministic algorithm to discover patterns of multivariate gene associations in gene expression data. The patterns discovered are differential with respect to a control dataset. The algorithm is exhaustive and efficient, reporting all existent patterns that fit a given input parameter set while avoiding enumeration of the entire pattern space. The value of the pattern discovery approach is demonstrated by finding a set of genes that differentiate between two types of lymphoma. Moreover, these genes are found to behave consistently in an independent dataset produced in a different laboratory using different arrays, thus validating the genes selected using our algorithm. We show that the genes deemed significant in terms of their multivariate statistics will be missed using other methods. AVAILABILITY: Our set of pattern discovery algorithms including a user interface is distributed as a package called Genes@Work. This package is freely available to non-commercial users and can be downloaded from our website (http://www.research.ibm.com/FunGen). 相似文献

13.

FusionAnalyser: a new graphical, event-driven tool for fusion rearrangements discovery

Piazza R Pirola A Spinelli R Valletta S Redaelli S Magistroni V Gambacorti-Passerini C 《Nucleic acids research》2012,40(16):e123

相似文献

14.

A cluster refinement algorithm for motif discovery

Li G Chan TM Leung KS Lee KH 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2010,7(4):654-668

相似文献

15.

Tree-structured algorithm for long weak motif discovery

Sun HQ Low MY Hsu WJ Tan CW Rajapakse JC 《Bioinformatics (Oxford, England)》2011,27(19):2641-2647

相似文献

16.

Human CMV transcripts: an overview

Ma Y Wang N Li M Gao S Wang L Zheng B Qi Y Ruan Q 《Future microbiology》2012,7(5):577-593

相似文献

17.

Multiplex fluorescent RT-PCR to quantify leukemic fusion transcripts

Dupont M Goldsborough A Levayer T Savare J Rey JM Rossi JF Demaille J Lavabre-Bertrand T 《BioTechniques》2002,33(1):158-60, 162, 164

相似文献

18.

fdrMotif: identifying cis-elements by an EM algorithm coupled with false discovery rate control

Li L Bass RL Liang Y 《Bioinformatics (Oxford, England)》2008,24(5):629-636

MOTIVATION: Most de novo motif identification methods optimize the motif model first and then separately test the statistical significance of the motif score. In the first stage, a motif abundance parameter needs to be specified or modeled. In the second stage, a Z-score or P-value is used as the test statistic. Error rates under multiple comparisons are not fully considered. Methodology: We propose a simple but novel approach, fdrMotif, that selects as many binding sites as possible while controlling a user-specified false discovery rate (FDR). Unlike existing iterative methods, fdrMotif combines model optimization [e.g. position weight matrix (PWM)] and significance testing at each step. By monitoring the proportion of binding sites selected in many sets of background sequences, fdrMotif controls the FDR in the original data. The model is then updated using an expectation (E)- and maximization (M)-like procedure. We propose a new normalization procedure in the E-step for updating the model. This process is repeated until either the model converges or the number of iterations exceeds a maximum. RESULTS: Simulation studies suggest that our normalization procedure assigns larger weights to the binding sites than do two other commonly used normalization procedures. Furthermore, fdrMotif requires only a user-specified FDR and an initial PWM. When tested on 542 high confidence experimental p53 binding loci, fdrMotif identified 569 p53 binding sites in 505 (93.2%) sequences. In comparison, MEME identified more binding sites but in fewer ChIP sequences than fdrMotif. When tested on 500 sets of simulated 'ChIP' sequences with embedded known p53 binding sites, fdrMotif, compared to MEME, has higher sensitivity with similar positive predictive value. Furthermore, fdrMotif is robust to noise: it selected nearly identical binding sites in data adulterated with 50% added background sequences and the unadulterated data. We suggest that fdrMotif represents an improvement over MEME. AVAILABILITY: C code can be found at: http://www.niehs.nih.gov/research/resources/software/fdrMotif/. 相似文献

19.

Annotation of novel transcripts putatively relevant for bovine fat metabolism

Annett Eberlein Claudia Kalbe Tom Goldammer Ronald M. Brunner Christa Kuehn Rosemarie Weikard 《Molecular biology reports》2011,38(5):2975-2986

相似文献

20.

A generic motif discovery algorithm for sequential data

Jensen KL Styczynski MP Rigoutsos I Stephanopoulos GN 《Bioinformatics (Oxford, England)》2006,22(1):21-28

MOTIVATION: Motif discovery in sequential data is a problem of great interest and with many applications. However, previous methods have been unable to combine exhaustive search with complex motif representations and are each typically only applicable to a certain class of problems. RESULTS: Here we present a generic motif discovery algorithm (Gemoda) for sequential data. Gemoda can be applied to any dataset with a sequential character, including both categorical and real-valued data. As we show, Gemoda deterministically discovers motifs that are maximal in composition and length. As well, the algorithm allows any choice of similarity metric for finding motifs. Finally, Gemoda's output motifs are representation-agnostic: they can be represented using regular expressions, position weight matrices or any number of other models for any type of sequential data. We demonstrate a number of applications of the algorithm, including the discovery of motifs in amino acids sequences, a new solution to the (l,d)-motif problem in DNA sequences and the discovery of conserved protein substructures. AVAILABILITY: Gemoda is freely available at http://web.mit.edu/bamel/gemoda 相似文献