排序方式: 共有59条查询结果,搜索用时 15 毫秒
1.
Ken Chen Nicholas E Navin Yong Wang Heather K Schmidt John W Wallis Beifang Niu Xian Fan Hao Zhao Michael D McLellan Katherine A Hoadley Elaine R Mardis Timothy J Ley Charles M Perou Richard K Wilson Li Ding 《Genome biology》2013,14(8):R87
Producing gene fusions through genomic structural rearrangements is a major mechanism for tumor evolution. Therefore, accurately detecting gene fusions and the originating rearrangements is of great importance for personalized cancer diagnosis and targeted therapy. We present a tool, BreakTrans, that systematically maps predicted gene fusions to structural rearrangements. Thus, BreakTrans not only validates both types of predictions, but also provides mechanistic interpretations. BreakTrans effectively validates known fusions and discovers novel events in a breast cancer cell line. Applying BreakTrans to 43 breast cancer samples in The Cancer Genome Atlas identifies 90 genomically validated gene fusions. BreakTrans is available at http://bioinformatics.mdanderson.org/main/BreakTrans 相似文献
2.
Estienne C. Swart John R. Bracht Vincent Magrini Patrick Minx Xiao Chen Yi Zhou Jaspreet S. Khurana Aaron D. Goldman Mariusz Nowacki Klaas Schotanus Seolkyoung Jung Robert S. Fulton Amy Ly Sean McGrath Kevin Haub Jessica L. Wiggins Donna Storton John C. Matese Lance Parsons Wei-Jen Chang Michael S. Bowen Nicholas A. Stover Thomas A. Jones Sean R. Eddy Glenn A. Herrick Thomas G. Doak Richard K. Wilson Elaine R. Mardis Laura F. Landweber 《PLoS biology》2013,11(1)
The macronuclear genome of the ciliate Oxytricha trifallax displays an extreme and unique eukaryotic genome architecture with extensive genomic variation. During sexual genome development, the expressed, somatic macronuclear genome is whittled down to the genic portion of a small fraction (∼5%) of its precursor “silent” germline micronuclear genome by a process of “unscrambling” and fragmentation. The tiny macronuclear “nanochromosomes” typically encode single, protein-coding genes (a small portion, 10%, encode 2–8 genes), have minimal noncoding regions, and are differentially amplified to an average of ∼2,000 copies. We report the high-quality genome assembly of ∼16,000 complete nanochromosomes (∼50 Mb haploid genome size) that vary from 469 bp to 66 kb long (mean ∼3.2 kb) and encode ∼18,500 genes. Alternative DNA fragmentation processes ∼10% of the nanochromosomes into multiple isoforms that usually encode complete genes. Nucleotide diversity in the macronucleus is very high (SNP heterozygosity is ∼4.0%), suggesting that Oxytricha trifallax may have one of the largest known effective population sizes of eukaryotes. Comparison to other ciliates with nonscrambled genomes and long macronuclear chromosomes (on the order of 100 kb) suggests several candidate proteins that could be involved in genome rearrangement, including domesticated MULE and IS1595-like DDE transposases. The assembly of the highly fragmented Oxytricha macronuclear genome is the first completed genome with such an unusual architecture. This genome sequence provides tantalizing glimpses into novel molecular biology and evolution. For example, Oxytricha maintains tens of millions of telomeres per cell and has also evolved an intriguing expansion of telomere end-binding proteins. In conjunction with the micronuclear genome in progress, the O. trifallax macronuclear genome will provide an invaluable resource for investigating programmed genome rearrangements, complementing studies of rearrangements arising during evolution and disease. 相似文献
3.
4.
5.
The design of new HIV protease inhibitors requires an improved understanding of the physical basis of inhibitor/protein binding. Here, the binding affinities of seven aliphatic cyclic ureas to HIV-1 protease are calculated using a predominant states method and an implicit solvent model based upon finite difference solutions of the Poisson-Boltzmann equation. The calculations are able to reproduce the observed U-shaped trend of binding free energy as a function of aliphatic chain length. Interestingly, the decrease in affinity for the longest chains is attributable primarily to the energy cost of partly desolvating charged aspartic and arginine groups at the mouths of the active site. Even aliphatic chains too short to contact these charged groups directly are subject to considerable desolvation penalties. We are not aware of other systems where binding affinity trends have been attributed to long-ranged electrostatic desolvation of ionized groups. A generalized Born/surface area solvation model yields a much smaller change in desolvation energy with chain length and, therefore, does not reproduce the experimental binding affinity trends. This result suggests that the generalized Born model should be used with caution for complex, partly desolvated systems like protein binding sites. We also find that changing the assumed protonation state of the active site aspartyl dyad significantly affects the computed binding affinity trends. The protonation state of the aspartyl dyad in the presence of cyclic ureas is discussed in light of the observation that the monoprotonated state reproduces the experimental results best. 相似文献
6.
Wang D Urisman A Liu YT Springer M Ksiazek TG Erdman DD Mardis ER Hickenbotham M Magrini V Eldred J Latreille JP Wilson RK Ganem D DeRisi JL 《PLoS biology》2003,1(2):e2
Because of the constant threat posed by emerging infectious diseases and the limitations of existing approaches used to identify new pathogens, there is a great demand for new technological methods for viral discovery. We describe herein a DNA microarray-based platform for novel virus identification and characterization. Central to this approach was a DNA microarray designed to detect a wide range of known viruses as well as novel members of existing viral families; this microarray contained the most highly conserved 70mer sequences from every fully sequenced reference viral genome in GenBank. During an outbreak of severe acute respiratory syndrome (SARS) in March 2003, hybridization to this microarray revealed the presence of a previously uncharacterized coronavirus in a viral isolate cultivated from a SARS patient. To further characterize this new virus, approximately 1 kb of the unknown virus genome was cloned by physically recovering viral sequences hybridized to individual array elements. Sequencing of these fragments confirmed that the virus was indeed a new member of the coronavirus family. This combination of array hybridization followed by direct viral sequence recovery should prove to be a general strategy for the rapid identification and characterization of novel viruses and emerging infectious disease. 相似文献
7.
Extending assembly of short DNA sequences to handle error 总被引:2,自引:0,他引:2
Jeck WR Reinhardt JA Baltrus DA Hickenbotham MT Magrini V Mardis ER Dangl JL Jones CD 《Bioinformatics (Oxford, England)》2007,23(21):2942-2944
Inexpensive de novo genome sequencing, particularly in organisms with small genomes, is now possible using several new sequencing technologies. Some of these technologies such as that from Illumina's Solexa Sequencing, produce high genomic coverage by generating a very large number of small reads ( approximately 30 bp). While prior work shows that partial assembly can be performed by k-mer extension in error-free reads, this algorithm is unsuccessful with the sequencing error rates found in practice. We present VCAKE (Verified Consensus Assembly by K-mer Extension), a modification of simple k-mer extension that overcomes error by using high depth coverage. Though it is a simple modification of a previous approach, we show significant improvements in assembly results on simulated and experimental datasets that include error. AVAILABILITY: http://152.2.15.114/~labweb/VCAKE 相似文献
8.
Wang J Mullighan CG Easton J Roberts S Heatley SL Ma J Rusch MC Chen K Harris CC Ding L Holmfeldt L Payne-Turner D Fan X Wei L Zhao D Obenauer JC Naeve C Mardis ER Wilson RK Downing JR Zhang J 《Nature methods》2011,8(8):652-654
We developed 'clipping reveals structure' (CREST), an algorithm that uses next-generation sequencing reads with partial alignments to a reference genome to directly map structural variations at the nucleotide level of resolution. Application of CREST to whole-genome sequencing data from five pediatric T-lineage acute lymphoblastic leukemias (T-ALLs) and a human melanoma cell line, COLO-829, identified 160 somatic structural variations. Experimental validation exceeded 80%, demonstrating that CREST had a high predictive accuracy. 相似文献
9.
Ye L Hillier LW Minx P Thane N Locke DP Martin JC Chen L Mitreva M Miller JR Haub KV Dooling DJ Mardis ER Wilson RK Weinstock GM Warren WC 《Genome biology》2011,12(3):R31-7
The unparalleled efficiency of next-generation sequencing (NGS) has prompted widespread adoption, but significant problems remain in the use of NGS data for whole genome assembly. We explore the advantages and disadvantages of chicken genome assemblies generated using a variety of sequencing and assembly methodologies. NGS assemblies are equivalent in some ways to a Sanger-based assembly yet deficient in others. Nonetheless, these assemblies are sufficient for the identification of the majority of genes and can reveal novel sequences when compared to existing assembly references. 相似文献
10.
Ramaswamy Govindan Li Ding Malachi Griffith Janakiraman Subramanian Nathan D. DeesKrishna L. Kanchi Christopher A. Maher Robert Fulton Lucinda Fulton John Wallis Ken ChenJason Walker Sandra McDonald Ron Bose David OrnitzDonghai Xiong Ming YouDavid J. Dooling Mark Watson Elaine R. Mardis Richard K. Wilson 《Cell》2012,150(6):1121-1134