期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

M-GCAT: interactively and efficiently constructing large-scale multiple genome comparison frameworks in closely related species

Todd J Treangen Xavier Messeguer 《BMC bioinformatics》2006,7(1):433

Background

Due to recent advances in whole genome shotgun sequencing and assembly technologies, the financial cost of decoding an organism's DNA has been drastically reduced, resulting in a recent explosion of genomic sequencing projects. This increase in related genomic data will allow for in depth studies of evolution in closely related species through multiple whole genome comparisons. 相似文献

2.

Characterization of microsatellites and gene contents from genome shotgun sequences of mungbean (Vigna radiata (L.) Wilczek)

Sithichoke Tangphatsornruang Prakit Somta Pichahpuk Uthaipaisanwong Juntima Chanprasert Duangjai Sangsrakru Worapa Seehalak Warunee Sommanas Somvong Tragoonrung Peerasak Srinives 《BMC plant biology》2009,9(1):137

Background

Mungbean is an important economical crop in Asia. However, genomic research has lagged behind other crop species due to the lack of polymorphic DNA markers found in this crop. The objective of this work is to develop and characterize microsatellite or simple sequence repeat (SSR) markers from genome shotgun sequencing of mungbean. 相似文献

3.

<Emphasis Type="Italic">Tracembler</Emphasis> – software for <Emphasis Type="Italic">in-silico</Emphasis> chromosome walking in unassembled genomes

Qunfeng Dong Matthew D Wilkerson Volker Brendel 《BMC bioinformatics》2007,8(1):151

Background

Whole genome shotgun sequencing produces increasingly higher coverage of a genome with random sequence reads. Progressive whole genome assembly and eventual finishing sequencing is a process that typically takes several years for large eukaryotic genomes. In the interim, all sequence reads of public sequencing projects are made available in repositories such as the NCBI Trace Archive. For a particular locus, sequencing coverage may be high enough early on to produce a reliable local genome assembly. We have developed software, Tracembler, that facilitates in silico chromosome walking by recursively assembling reads of a selected species from the NCBI Trace Archive starting with reads that significantly match sequence seeds supplied by the user. 相似文献

4.

An algorithm for automated closure during assembly

Sergey Koren Jason R Miller Brian P Walenz Granger Sutton 《BMC bioinformatics》2010,11(1):457

Background

Finishing is the process of improving the quality and utility of draft genome sequences generated by shotgun sequencing and computational assembly. Finishing can involve targeted sequencing. Finishing reads may be incorporated by manual or automated means. One automated method uses targeted addition by local re-assembly of gap regions. An obvious alternative uses de novo assembly of all the reads. 相似文献

5.

OryzaPG-DB: Rice Proteome Database based on Shotgun Proteogenomics

Mohamed Helmy Masaru Tomita Yasushi Ishihama 《BMC plant biology》2011,11(1):63

Background

Proteogenomics aims to utilize experimental proteome information for refinement of genome annotation. Since mass spectrometry-based shotgun proteomics approaches provide large-scale peptide sequencing data with high throughput, a data repository for shotgun proteogenomics would represent a valuable source of gene expression evidence at the translational level for genome re-annotation. 相似文献

6.

Analysis of High-Throughput Sequencing and Annotation Strategies for Phage Genomes

Matthew R. Henn Matthew B. Sullivan Nicole Stange-Thomann Marcia S. Osburne Aaron M. Berlin Libusha Kelly Chandri Yandava Chinnappa Kodira Qiandong Zeng Michael Weiand Todd Sparrow Sakina Saif Georgia Giannoukos Sarah K. Young Chad Nusbaum Bruce W. Birren Sallie W. Chisholm 《PloS one》2010,5(2)

Background

Bacterial viruses (phages) play a critical role in shaping microbial populations as they influence both host mortality and horizontal gene transfer. As such, they have a significant impact on local and global ecosystem function and human health. Despite their importance, little is known about the genomic diversity harbored in phages, as methods to capture complete phage genomes have been hampered by the lack of knowledge about the target genomes, and difficulties in generating sufficient quantities of genomic DNA for sequencing. Of the approximately 550 phage genomes currently available in the public domain, fewer than 5% are marine phage.

Methodology/Principal Findings

To advance the study of phage biology through comparative genomic approaches we used marine cyanophage as a model system. We compared DNA preparation methodologies (DNA extraction directly from either phage lysates or CsCl purified phage particles), and sequencing strategies that utilize either Sanger sequencing of a linker amplification shotgun library (LASL) or of a whole genome shotgun library (WGSL), or 454 pyrosequencing methods. We demonstrate that genomic DNA sample preparation directly from a phage lysate, combined with 454 pyrosequencing, is best suited for phage genome sequencing at scale, as this method is capable of capturing complete continuous genomes with high accuracy. In addition, we describe an automated annotation informatics pipeline that delivers high-quality annotation and yields few false positives and negatives in ORF calling.

Conclusions/Significance

These DNA preparation, sequencing and annotation strategies enable a high-throughput approach to the burgeoning field of phage genomics. 相似文献

7.

Unlocking the mystery of the hard-to-sequence phage genome: PaP1 methylome and bacterial immunity

Shuguang Lu Shuai Le Yinling Tan Ming Li Chang Liu Kebin Zhang Jianjun Huang Haimei Chen Xiancai Rao Junmin Zhu Lingyun Zou Qingshan Ni Shu Li Jing Wang Xiaolin Jin Qiwen Hu Xinyue Yao Xia Zhao Lin Zhang Guangtao Huang Fuquan Hu 《BMC genomics》2014,15(1)

Background

Whole-genome sequencing is an important method to understand the genetic information, gene function, biological characteristics and survival mechanisms of organisms. Sequencing large genomes is very simple at present. However, we encountered a hard-to-sequence genome of Pseudomonas aeruginosa phage PaP1. Shotgun sequencing method failed to complete the sequence of this genome.

Results

After persevering for 10 years and going over three generations of sequencing techniques, we successfully completed the sequence of the PaP1 genome with a length of 91,715 bp. Single-molecule real-time sequencing results revealed that this genome contains 51 N-6-methyladenines and 152 N-4-methylcytosines. Three significant modified sequence motifs were predicted, but not all of the sites found in the genome were methylated in these motifs. Further investigations revealed a novel immune mechanism of bacteria, in which host bacteria can recognise and repel modified bases containing inserts in a large scale. This mechanism could be accounted for the failure of the shotgun method in PaP1 genome sequencing. This problem was resolved using the nfi^- mutant of Escherichia coli DH5α as a host bacterium to construct a shotgun library.

Conclusions

This work provided insights into the hard-to-sequence phage PaP1 genome and discovered a new mechanism of bacterial immunity. The methylome of phage PaP1 is responsible for the failure of shotgun sequencing and for bacterial immunity mediated by enzyme Endo V activity; this methylome also provides a valuable resource for future studies on PaP1 genome replication and modification, as well as on gene regulation and host interaction.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-803) contains supplementary material, which is available to authorized users. 相似文献

8.

An analysis of the Sargasso Sea resource and the consequences for database composition

Michael L Tress Domenico Cozzetto Anna Tramontano Alfonso Valencia 《BMC bioinformatics》2006,7(1):213-13

Background

The environmental sequencing of the Sargasso Sea has introduced a huge new resource of genomic information. Unlike the protein sequences held in the current searchable databases, the Sargasso Sea sequences originate from a single marine environment and have been sequenced from species that are not easily obtainable by laboratory cultivation. The resource also contains very many fragments of whole protein sequences, a side effect of the shotgun sequencing method. 相似文献

9.

Functional Annotation,Genome Organization and Phylogeny of the Grapevine (<Emphasis Type="Italic">Vitis vinifera</Emphasis>) Terpene Synthase Gene Family Based on Genome Assembly,FLcDNA Cloning,and Enzyme Assays

Diane M Martin Sébastien Aubourg Marina B Schouwey Laurent Daviet Michel Schalk Omid Toub Steven T Lund Jörg Bohlmann 《BMC plant biology》2010,10(1):226

Background

Terpenoids are among the most important constituents of grape flavour and wine bouquet, and serve as useful metabolite markers in viticulture and enology. Based on the initial 8-fold sequencing of a nearly homozygous Pinot noir inbred line, 89 putative terpenoid synthase genes (VvTPS) were predicted by in silico analysis of the grapevine (Vitis vinifera) genome assembly [1]. The finding of this very large VvTPS family, combined with the importance of terpenoid metabolism for the organoleptic properties of grapevine berries and finished wines, prompted a detailed examination of this gene family at the genomic level as well as an investigation into VvTPS biochemical functions. 相似文献

10.

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI

Yongchao Liu Bertil Schmidt Douglas L Maskell 《BMC bioinformatics》2011,12(1):85

Background

Next-generation sequencing technologies have led to the high-throughput production of sequence data (reads) at low cost. However, these reads are significantly shorter and more error-prone than conventional Sanger shotgun reads. This poses a challenge for the de novo assembly in terms of assembly quality and scalability for large-scale short read datasets. 相似文献

11.

rSW-seq: Algorithm for detection of copy number alterations in deep sequencing data

Tae-Min Kim Lovelace J Luquette Ruibin Xi Peter J Park 《BMC bioinformatics》2010,11(1):432

Background

Recent advances in sequencing technologies have enabled generation of large-scale genome sequencing data. These data can be used to characterize a variety of genomic features, including the DNA copy number profile of a cancer genome. A robust and reliable method for screening chromosomal alterations would allow a detailed characterization of the cancer genome with unprecedented accuracy. 相似文献

12.

A high-throughput <Emphasis Type="Italic">de novo</Emphasis> sequencing approach for shotgun proteomics using high-resolution tandem mass spectrometry

Chongle Pan Byung H Park William H McDonald Patricia A Carey Jillian F Banfield Nathan C VerBerkmoes Robert L Hettich Nagiza F Samatova 《BMC bioinformatics》2010,11(1):118

Background

High-resolution tandem mass spectra can now be readily acquired with hybrid instruments, such as LTQ-Orbitrap and LTQ-FT, in high-throughput shotgun proteomics workflows. The improved spectral quality enables more accurate de novo sequencing for identification of post-translational modifications and amino acid polymorphisms. 相似文献

13.

Reconstructing the plant mitochondrial genome for marker discovery: a case study using Pinus

下载免费PDF全文

Kevin Donnelly Joan Cottrell Richard A. Ennos Giovanni Giuseppe Vendramin Stuart A'Hara Sarah King Annika Perry Witold Wachowiak Stephen Cavers 《Molecular ecology resources》2017,17(5):943-954

Whole‐genome‐shotgun (WGS) sequencing of total genomic DNA was used to recover ~1 Mbp of novel mitochondrial (mtDNA) sequence from Pinus sylvestris (L.) and three members of the closely related Pinus mugo species complex. DNA was extracted from megagametophyte tissue from six mother trees from locations across Europe, and 100‐bp paired‐end sequencing was performed on the Illumina HiSeq platform. Candidate mtDNA sequences were identified by their size and coverage characteristics, and by comparison with published plant mitochondrial genomes. Novel variants were identified, and primers targeting these loci were trialled on a set of 28 individuals from across Europe. In total, 31 SNP loci were successfully resequenced, characterizing 15 unique haplotypes. This approach offers a cost‐effective means of developing marker resources for mitochondrial genomes in other plant species where reference sequences are unavailable. 相似文献

14.

A physical map of the bovine genome 总被引：1，自引：1，他引：0

下载免费PDF全文

Snelling WM Chiu R Schein JE Hobbs M Abbey CA Adelson DL Aerts J Bennett GL Bosdet IE Boussaha M Brauning R Caetano AR Costa MM Crawford AM Dalrymple BP Eggen A Everts-van der Wind A Floriot S Gautier M Gill CA Green RD Holt R Jann O Jones SJ Kappes SM Keele JW de Jong PJ Larkin DM Lewin HA McEwan JC McKay S Marra MA Mathewson CA Matukumalli LK Moore SS Murdoch B Nicholas FW Osoegawa K Roy A Salih H Schibler L Schnabel RD Silveri L Skow LC Smith TP Sonstegard TS Taylor JF Tellam R 《Genome biology》2007,8(8):R165

Background

Cattle are important agriculturally and relevant as a model organism. Previously described genetic and radiation hybrid (RH) maps of the bovine genome have been used to identify genomic regions and genes affecting specific traits. Application of these maps to identify influential genetic polymorphisms will be enhanced by integration with each other and with bacterial artificial chromosome (BAC) libraries. The BAC libraries and clone maps are essential for the hybrid clone-by-clone/whole-genome shotgun sequencing approach taken by the bovine genome sequencing project.

Results

A bovine BAC map was constructed with HindIII restriction digest fragments of 290,797 BAC clones from animals of three different breeds. Comparative mapping of 422,522 BAC end sequences assisted with BAC map ordering and assembly. Genotypes and pedigree from two genetic maps and marker scores from three whole-genome RH panels were consolidated on a 17,254-marker composite map. Sequence similarity allowed integrating the BAC and composite maps with the bovine draft assembly (Btau3.1), establishing a comprehensive resource describing the bovine genome. Agreement between the marker and BAC maps and the draft assembly is high, although discrepancies exist. The composite and BAC maps are more similar than either is to the draft assembly.

Conclusion

Further refinement of the maps and greater integration into the genome assembly process may contribute to a high quality assembly. The maps provide resources to associate phenotypic variation with underlying genomic variation, and are crucial resources for understanding the biology underpinning this important ruminant species so closely associated with humans. 相似文献

15.

A high quality draft consensus sequence of the genome of a heterozygous grapevine variety 总被引：7，自引：0，他引：7

Velasco R Zharkikh A Troggio M Cartwright DA Cestaro A Pruss D Pindo M Fitzgerald LM Vezzulli S Reid J Malacarne G Iliev D Coppola G Wardell B Micheletti D Macalma T Facci M Mitchell JT Perazzolli M Eldredge G Gatto P Oyzerski R Moretto M Gutin N Stefanini M Chen Y Segala C Davenport C Demattè L Mraz A Battilana J Stormo K Costa F Tao Q Si-Ammour A Harkins T Lackey A Perbost C Taillon B Stella A Solovyev V Fawcett JA Sterck L Vandepoele K Grando SM Toppo S Moser C Lanchbury J Bogden R 《PloS one》2007,2(12):e1326

Background

Worldwide, grapes and their derived products have a large market. The cultivated grape species Vitis vinifera has potential to become a model for fruit trees genetics. Like many plant species, it is highly heterozygous, which is an additional challenge to modern whole genome shotgun sequencing. In this paper a high quality draft genome sequence of a cultivated clone of V. vinifera Pinot Noir is presented.

Principal Findings

We estimate the genome size of V. vinifera to be 504.6 Mb. Genomic sequences corresponding to 477.1 Mb were assembled in 2,093 metacontigs and 435.1 Mb were anchored to the 19 linkage groups (LGs). The number of predicted genes is 29,585, of which 96.1% were assigned to LGs. This assembly of the grape genome provides candidate genes implicated in traits relevant to grapevine cultivation, such as those influencing wine quality, via secondary metabolites, and those connected with the extreme susceptibility of grape to pathogens. Single nucleotide polymorphism (SNP) distribution was consistent with a diffuse haplotype structure across the genome. Of around 2,000,000 SNPs, 1,751,176 were mapped to chromosomes and one or more of them were identified in 86.7% of anchored genes. The relative age of grape duplicated genes was estimated and this made possible to reveal a relatively recent Vitis-specific large scale duplication event concerning at least 10 chromosomes (duplication not reported before).

Conclusions

Sanger shotgun sequencing and highly efficient sequencing by synthesis (SBS), together with dedicated assembly programs, resolved a complex heterozygous genome. A consensus sequence of the genome and a set of mapped marker loci were generated. Homologous chromosomes of Pinot Noir differ by 11.2% of their DNA (hemizygous DNA plus chromosomal gaps). SNP markers are offered as a tool with the potential of introducing a new era in the molecular breeding of grape. 相似文献

16.

Genome-wide analysis of the role of GlnR in Streptomyces venezuelae provides new insights into global nitrogen regulation in actinomycetes

Steven T Pullan Govind Chandra Mervyn J Bibb Mike Merrick 《BMC genomics》2011,12(1):1-14

Background

Knowledge of the origins, distribution, and inheritance of variation in the malaria parasite (Plasmodium falciparum) genome is crucial for understanding its evolution; however the 81% (A+T) genome poses challenges to high-throughput sequencing technologies. We explore the viability of the Roche 454 Genome Sequencer FLX (GS FLX) high throughput sequencing technology for both whole genome sequencing and fine-resolution characterization of genetic exchange in malaria parasites.

Results

We present a scheme to survey recombination in the haploid stage genomes of two sibling parasite clones, using whole genome pyrosequencing that includes a sliding window approach to predict recombination breakpoints. Whole genome shotgun (WGS) sequencing generated approximately 2 million reads, with an average read length of approximately 300 bp. De novo assembly using a combination of WGS and 3 kb paired end libraries resulted in contigs ≤ 34 kb. More than 8,000 of the 24,599 SNP markers identified between parents were genotyped in the progeny, resulting in a marker density of approximately 1 marker/3.3 kb and allowing for the detection of previously unrecognized crossovers (COs) and many non crossover (NCO) gene conversions throughout the genome.

Conclusions

By sequencing the 23 Mb genomes of two haploid progeny clones derived from a genetic cross at more than 30× coverage, we captured high resolution information on COs, NCOs and genetic variation within the progeny genomes. This study is the first to resequence progeny clones to examine fine structure of COs and NCOs in malaria parasites. 相似文献

17.

SeeGH – A software tool for visualization of whole genome array comparative genomic hybridization data

Bryan?Chi Email author Ronald?J?deLeeuw Bradley?P?Coe Calum?MacAulay Wan?L?Lam 《BMC bioinformatics》2004,5(1):13

Background

Array comparative genomic hybridization (CGH) is a technique which detects copy number differences in DNA segments. Complete sequencing of the human genome and the development of an array representing a tiling set of tens of thousands of DNA segments spanning the entire human genome has made high resolution copy number analysis throughout the genome possible. Since array CGH provides signal ratio for each DNA segment, visualization would require the reassembly of individual data points into chromosome profiles. 相似文献

18.

CLAME: a new alignment-based binning algorithm allows the genomic description of a novel Xanthomonadaceae from the Colombian Andes

Andres Benavides Juan Pablo Isaza Juan Pablo Niño-García Juan Fernando Alzate Felipe Cabarcas 《BMC genomics》2018,19(8):858

Background

Hot spring bacteria have unique biological adaptations to survive the extreme conditions of these environments; these bacteria produce thermostable enzymes that can be used in biotechnological and industrial applications. However, sequencing these bacteria is complex, since it is not possible to culture them. As an alternative, genome shotgun sequencing of whole microbial communities can be used. The problem is that the classification of sequences within a metagenomic dataset is very challenging particularly when they include unknown microorganisms since they lack genomic reference. We failed to recover a bacterium genome from a hot spring metagenome using the available software tools, so we develop a new tool that allowed us to recover most of this genome.

Results

We present a proteobacteria draft genome reconstructed from a Colombian’s Andes hot spring metagenome. The genome seems to be from a new lineage within the family Rhodanobacteraceae of the class Gammaproteobacteria, closely related to the genus Dokdonella. We were able to generate this genome thanks to CLAME. CLAME, from Spanish “CLAsificador MEtagenomico”, is a tool to group reads in bins. We show that most reads from each bin belong to a single chromosome. CLAME is very effective recovering most of the reads belonging to the predominant species within a metagenome.

Conclusions

We developed a tool that can be used to extract genomes (or parts of them) from a complex metagenome.

相似文献

19.

Construction of a dairy microbial genome catalog opens new perspectives for the metagenomic analysis of dairy fermented products

Mathieu Almeida Agnès Hébert Anne-Laure Abraham Simon Rasmussen Christophe Monnet Nicolas Pons Céline Delbès Valentin Loux Jean-Michel Batto Pierre Leonard Sean Kennedy Stanislas Dusko Ehrlich Mihai Pop Marie-Christine Montel Fran?oise Irlinger Pierre Renault 《BMC genomics》2014,15(1)

Background

Microbial communities of traditional cheeses are complex and insufficiently characterized. The origin, safety and functional role in cheese making of these microbial communities are still not well understood. Metagenomic analysis of these communities by high throughput shotgun sequencing is a promising approach to characterize their genomic and functional profiles. Such analyses, however, critically depend on the availability of appropriate reference genome databases against which the sequencing reads can be aligned.

Results

We built a reference genome catalog suitable for short read metagenomic analysis using a low-cost sequencing strategy. We selected 142 bacteria isolated from dairy products belonging to 137 different species and 67 genera, and succeeded to reconstruct the draft genome of 117 of them at a standard or high quality level, including isolates from the genera Kluyvera, Luteococcus and Marinilactibacillus, still missing from public database. To demonstrate the potential of this catalog, we analysed the microbial composition of the surface of two smear cheeses and one blue-veined cheese, and showed that a significant part of the microbiota of these traditional cheeses was composed of microorganisms newly sequenced in our study.

Conclusions

Our study provides data, which combined with publicly available genome references, represents the most expansive catalog to date of cheese-associated bacteria. Using this extended dairy catalog, we revealed the presence in traditional cheese of dominant microorganisms not deliberately inoculated, mainly Gram-negative genera such as Pseudoalteromonas haloplanktis or Psychrobacter immobilis, that may contribute to the characteristics of cheese produced through traditional methods.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1101) contains supplementary material, which is available to authorized users. 相似文献

20.

Predicting phenotypic traits of prokaryotes from protein domain frequencies

Thomas Lingner Stefanie Mühlhausen Toni Gabaldón Cedric Notredame Peter Meinicke 《BMC bioinformatics》2010,11(1):481

Background

Establishing the relationship between an organism's genome sequence and its phenotype is a fundamental challenge that remains largely unsolved. Accurately predicting microbial phenotypes solely based on genomic features will allow us to infer relevant phenotypic characteristics when the availability of a genome sequence precedes experimental characterization, a scenario that is favored by the advent of novel high-throughput and single cell sequencing techniques. 相似文献