期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

M-GCAT: interactively and efficiently constructing large-scale multiple genome comparison frameworks in closely related species

Todd J Treangen Xavier Messeguer 《BMC bioinformatics》2006,7(1):433

Background

Due to recent advances in whole genome shotgun sequencing and assembly technologies, the financial cost of decoding an organism's DNA has been drastically reduced, resulting in a recent explosion of genomic sequencing projects. This increase in related genomic data will allow for in depth studies of evolution in closely related species through multiple whole genome comparisons. 相似文献

2.

Low-pass sequencing for microbial comparative genomics

Goo YA Roach J Glusman G Baliga NS Deutsch K Pan M Kennedy S DasSarma S Ng WV Hood L 《BMC genomics》2004,5(1):3-19

Background

We studied four extremely halophilic archaea by low-pass shotgun sequencing: (1) the metabolically versatile Haloarcula marismortui; (2) the non-pigmented Natrialba asiatica; (3) the psychrophile Halorubrum lacusprofundi and (4) the Dead Sea isolate Halobaculum gomorrense. Approximately one thousand single pass genomic sequences per genome were obtained. The data were analyzed by comparative genomic analyses using the completed Halobacterium sp. NRC-1 genome as a reference. Low-pass shotgun sequencing is a simple, inexpensive, and rapid approach that can readily be performed on any cultured microbe. 相似文献

3.

Characterization of microsatellites and gene contents from genome shotgun sequences of mungbean (Vigna radiata (L.) Wilczek)

Sithichoke Tangphatsornruang Prakit Somta Pichahpuk Uthaipaisanwong Juntima Chanprasert Duangjai Sangsrakru Worapa Seehalak Warunee Sommanas Somvong Tragoonrung Peerasak Srinives 《BMC plant biology》2009,9(1):137

Background

Mungbean is an important economical crop in Asia. However, genomic research has lagged behind other crop species due to the lack of polymorphic DNA markers found in this crop. The objective of this work is to develop and characterize microsatellite or simple sequence repeat (SSR) markers from genome shotgun sequencing of mungbean. 相似文献

4.

<Emphasis Type="Italic">Tracembler</Emphasis> – software for <Emphasis Type="Italic">in-silico</Emphasis> chromosome walking in unassembled genomes

Qunfeng Dong Matthew D Wilkerson Volker Brendel 《BMC bioinformatics》2007,8(1):151

Background

Whole genome shotgun sequencing produces increasingly higher coverage of a genome with random sequence reads. Progressive whole genome assembly and eventual finishing sequencing is a process that typically takes several years for large eukaryotic genomes. In the interim, all sequence reads of public sequencing projects are made available in repositories such as the NCBI Trace Archive. For a particular locus, sequencing coverage may be high enough early on to produce a reliable local genome assembly. We have developed software, Tracembler, that facilitates in silico chromosome walking by recursively assembling reads of a selected species from the NCBI Trace Archive starting with reads that significantly match sequence seeds supplied by the user. 相似文献

5.

An algorithm for automated closure during assembly

Sergey Koren Jason R Miller Brian P Walenz Granger Sutton 《BMC bioinformatics》2010,11(1):457

Background

Finishing is the process of improving the quality and utility of draft genome sequences generated by shotgun sequencing and computational assembly. Finishing can involve targeted sequencing. Finishing reads may be incorporated by manual or automated means. One automated method uses targeted addition by local re-assembly of gap regions. An obvious alternative uses de novo assembly of all the reads. 相似文献

6.

GO Explorer: A gene-ontology tool to aid in the interpretation of shotgun proteomics data

Paulo C Carvalho Juliana SG Fischer Emily I Chen Gilberto B Domont Maria GC Carvalho Wim M Degrave John R Yates III Valmir C Barbosa 《Proteome science》2009,7(1):6-11

Background

Spectral counting is a shotgun proteomics approach comprising the identification and relative quantitation of thousands of proteins in complex mixtures. However, this strategy generates bewildering amounts of data whose biological interpretation is a challenge. 相似文献

7.

Unsupervised statistical clustering of environmental shotgun sequences

Andrey Kislyuk Srijak Bhatnagar Jonathan Dushoff Joshua S Weitz 《BMC bioinformatics》2009,10(1):316

Background

The development of effective environmental shotgun sequence binning methods remains an ongoing challenge in algorithmic analysis of metagenomic data. While previous methods have focused primarily on supervised learning involving extrinsic data, a first-principles statistical model combined with a self-training fitting method has not yet been developed. 相似文献

8.

PatternLab for proteomics: a tool for differential shotgun proteomics

Paulo C Carvalho Juliana SG Fischer Emily I Chen John R YatesIII Valmir C Barbosa 《BMC bioinformatics》2008,9(1):316

Background

A goal of proteomics is to distinguish between states of a biological system by identifying protein expression differences. Liu et al. demonstrated a method to perform semi-relative protein quantitation in shotgun proteomics data by correlating the number of tandem mass spectra obtained for each protein, or "spectral count", with its abundance in a mixture; however, two issues have remained open: how to normalize spectral counting data and how to efficiently pinpoint differences between profiles. Moreover, Chen et al. recently showed how to increase the number of identified proteins in shotgun proteomics by analyzing samples with different MS-compatible detergents while performing proteolytic digestion. The latter introduced new challenges as seen from the data analysis perspective, since replicate readings are not acquired. 相似文献

9.

Unlocking the mystery of the hard-to-sequence phage genome: PaP1 methylome and bacterial immunity

Shuguang Lu Shuai Le Yinling Tan Ming Li Chang Liu Kebin Zhang Jianjun Huang Haimei Chen Xiancai Rao Junmin Zhu Lingyun Zou Qingshan Ni Shu Li Jing Wang Xiaolin Jin Qiwen Hu Xinyue Yao Xia Zhao Lin Zhang Guangtao Huang Fuquan Hu 《BMC genomics》2014,15(1)

Background

Whole-genome sequencing is an important method to understand the genetic information, gene function, biological characteristics and survival mechanisms of organisms. Sequencing large genomes is very simple at present. However, we encountered a hard-to-sequence genome of Pseudomonas aeruginosa phage PaP1. Shotgun sequencing method failed to complete the sequence of this genome.

Results

After persevering for 10 years and going over three generations of sequencing techniques, we successfully completed the sequence of the PaP1 genome with a length of 91,715 bp. Single-molecule real-time sequencing results revealed that this genome contains 51 N-6-methyladenines and 152 N-4-methylcytosines. Three significant modified sequence motifs were predicted, but not all of the sites found in the genome were methylated in these motifs. Further investigations revealed a novel immune mechanism of bacteria, in which host bacteria can recognise and repel modified bases containing inserts in a large scale. This mechanism could be accounted for the failure of the shotgun method in PaP1 genome sequencing. This problem was resolved using the nfi^- mutant of Escherichia coli DH5α as a host bacterium to construct a shotgun library.

Conclusions

This work provided insights into the hard-to-sequence phage PaP1 genome and discovered a new mechanism of bacterial immunity. The methylome of phage PaP1 is responsible for the failure of shotgun sequencing and for bacterial immunity mediated by enzyme Endo V activity; this methylome also provides a valuable resource for future studies on PaP1 genome replication and modification, as well as on gene regulation and host interaction.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-803) contains supplementary material, which is available to authorized users. 相似文献

10.

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI

Yongchao Liu Bertil Schmidt Douglas L Maskell 《BMC bioinformatics》2011,12(1):85

Background

Next-generation sequencing technologies have led to the high-throughput production of sequence data (reads) at low cost. However, these reads are significantly shorter and more error-prone than conventional Sanger shotgun reads. This poses a challenge for the de novo assembly in terms of assembly quality and scalability for large-scale short read datasets. 相似文献

11.

A BAC-based integrated linkage map of the silkworm Bombyx mori 总被引：3，自引：0，他引：3

Yamamoto K Nohata J Kadono-Okuda K Narukawa J Sasanuma M Sasanuma S Minami H Shimomura M Suetsugu Y Banno Y Osoegawa K de Jong PJ Goldsmith MR Mita K 《Genome biology》2008,9(1):R21-14

Background

In 2004, draft sequences of the model lepidopteran Bombyx mori were reported using whole-genome shotgun sequencing. Because of relatively shallow genome coverage, the silkworm genome remains fragmented, hampering annotation and comparative genome studies. For a more complete genome analysis, we developed extended scaffolds combining physical maps with improved genetic maps.

Results

We mapped 1,755 single nucleotide polymorphism (SNP) markers from bacterial artificial chromosome (BAC) end sequences onto 28 linkage groups using a recombining male backcross population, yielding an average inter-SNP distance of 0.81 cM (about 270 kilobases). We constructed 6,221 contigs by fingerprinting clones from three BAC libraries digested with different restriction enzymes, and assigned a total of 724 single copy genes to them by BLAST (basic local alignment search tool) search of the BAC end sequences and high-density BAC filter hybridization using expressed sequence tags as probes. We assigned 964 additional expressed sequence tags to linkage groups by restriction fragment length polymorphism analysis of a nonrecombining female backcross population. Altogether, 361.1 megabases of BAC contigs and singletons were integrated with a map containing 1,688 independent genes. A test of synteny using Oxford grid analysis with more than 500 silkworm genes revealed six versus 20 silkworm linkage groups containing eight or more orthologs of Apis versus Tribolium, respectively.

Conclusion

The integrated map contains approximately 10% of predicted silkworm genes and has an estimated 76% genome coverage by BACs. This provides a new resource for improved assembly of whole-genome shotgun data, gene annotation and positional cloning, and will serve as a platform for comparative genomics and gene discovery in Lepidoptera and other insects. 相似文献

12.

A robust linear regression based algorithm for automated evaluation of peptide identifications from shotgun proteomics by use of reversed-phase liquid chromatography retention time

Hua Xu Lanhao Yang Michael A Freitas 《BMC bioinformatics》2008,9(1):347

Background

Rejection of false positive peptide matches in database searches of shotgun proteomic experimental data is highly desirable. Several methods have been developed to use the peptide retention time as to refine and improve peptide identifications from database search algorithms. This report describes the implementation of an automated approach to reduce false positives and validate peptide matches. 相似文献

13.

A Dynamic Noise Level Algorithm for Spectral Screening of Peptide MS/MS Spectra

Hua Xu Michael A Freitas 《BMC bioinformatics》2010,11(1):436

Background

High-throughput shotgun proteomics data contain a significant number of spectra from non-peptide ions or spectra of too poor quality to obtain highly confident peptide identifications. These spectra cannot be identified with any positive peptide matches in some database search programs or are identified with false positives in others. Removing these spectra can improve the database search results and lower computational expense. 相似文献

14.

Caryoscope: An Open Source Java application for viewing microarray data in a genomic context

Ihab?AB?Awad Christian?A?Rees Tina?Hernandez-Boussard Catherine?A?Ball Gavin?Sherlock Email author 《BMC bioinformatics》2004,5(1):151

Background

Microarray-based comparative genome hybridization experiments generate data that can be mapped onto the genome. These data are interpreted more easily when represented graphically in a genomic context. 相似文献

15.

Automated FingerPrint Background removal: FPB

Simone Scalabrin Michele Morgante Alberto Policriti 《BMC bioinformatics》2009,10(1):127-7

Background

The construction of a whole-genome physical map has been an essential component of numerous genome projects initiated since the inception of the Human Genome Project. Its usefulness has been proved for whole-genome shotgun projects as a post-assembly validation and recently it has also been used in the assembly step to constrain on BACs positions. Fingerprinting is usually the method of choice for construction of physical maps. A clone fingerprint is composed of true peaks representing real fragments and background peaks, mainly composed of E. coli genomic DNA, partial digestions, star activity by-products, and machine background. High-throughput fingerprinting leads to the production of thousands of BAC clone fingerprints per day. That is why background peaks removal has become an important issue and needs to be automatized, especially in capillary electrophoresis based fingerprints. 相似文献

16.

A whole-genome assembly of the domestic cow, Bos taurus 总被引：4，自引：0，他引：4

Aleksey V Zimin Arthur L Delcher Liliana Florea David R Kelley Michael C Schatz Daniela Puiu Finnian Hanrahan Geo Pertea Curtis P Van Tassell Tad S Sonstegard Guillaume Marçais Michael Roberts Poorani Subramanian James A Yorke Steven L Salzberg 《Genome biology》2009,10(4):R42-10

Background

The genome of the domestic cow, Bos taurus, was sequenced using a mixture of hierarchical and whole-genome shotgun sequencing methods.

Results

We have assembled the 35 million sequence reads and applied a variety of assembly improvement techniques, creating an assembly of 2.86 billion base pairs that has multiple improvements over previous assemblies: it is more complete, covering more of the genome; thousands of gaps have been closed; many erroneous inversions, deletions, and translocations have been corrected; and thousands of single-nucleotide errors have been corrected. Our evaluation using independent metrics demonstrates that the resulting assembly is substantially more accurate and complete than alternative versions.

Conclusions

By using independent mapping data and conserved synteny between the cow and human genomes, we were able to construct an assembly with excellent large-scale contiguity in which a large majority (approximately 91%) of the genome has been placed onto the 30 B. taurus chromosomes. We constructed a new cow-human synteny map that expands upon previous maps. We also identified for the first time a portion of the B. taurus Y chromosome. 相似文献

17.

Average genome size estimation improves comparative metagenomics and sheds light on the functional ecology of the human microbiome

Stephen Nayfach Katherine S Pollard 《Genome biology》2015,16(1)

Average genome size is an important, yet often overlooked, property of microbial communities. We developed MicrobeCensus to rapidly and accurately estimate average genome size from shotgun metagenomic data and applied our tool to 1,352 human microbiome samples. We found that average genome size differs significantly within and between body sites and tracks with major functional and taxonomic differences. In the gut, average genome size is positively correlated with the abundance of Bacteroides and genes related to carbohydrate metabolism. Importantly, we found that average genome size variation can bias comparative analyses, and that normalization improves detection of differentially abundant genes.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-015-0611-7) contains supplementary material, which is available to authorized users. 相似文献

18.

rSW-seq: Algorithm for detection of copy number alterations in deep sequencing data

Tae-Min Kim Lovelace J Luquette Ruibin Xi Peter J Park 《BMC bioinformatics》2010,11(1):432

Background

Recent advances in sequencing technologies have enabled generation of large-scale genome sequencing data. These data can be used to characterize a variety of genomic features, including the DNA copy number profile of a cancer genome. A robust and reliable method for screening chromosomal alterations would allow a detailed characterization of the cancer genome with unprecedented accuracy. 相似文献

19.

Integrated functional visualization of eukaryotic genomes

Rohit Ghai Hannes Lindemann Trinad Chakraborty 《BMC bioinformatics》2006,7(1):348-9

Background

Increasing amounts of data from large scale whole genome analysis efforts demands convenient tools for manipulation, visualization and investigation. Whole genome plots offer an intuitive window to the analysis. We describe two applications that enable users to easily plot and explore whole genome data from their own or other researchers' experiments. 相似文献

20.

A high-throughput <Emphasis Type="Italic">de novo</Emphasis> sequencing approach for shotgun proteomics using high-resolution tandem mass spectrometry

Chongle Pan Byung H Park William H McDonald Patricia A Carey Jillian F Banfield Nathan C VerBerkmoes Robert L Hettich Nagiza F Samatova 《BMC bioinformatics》2010,11(1):118

Background

High-resolution tandem mass spectra can now be readily acquired with hybrid instruments, such as LTQ-Orbitrap and LTQ-FT, in high-throughput shotgun proteomics workflows. The improved spectral quality enables more accurate de novo sequencing for identification of post-translational modifications and amino acid polymorphisms. 相似文献