首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background

Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools.

Results

We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics). Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units) version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim.

Conclusions

NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it’s freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php.  相似文献   

2.
A major challenge in the field of shotgun metagenomics is the accurate identification of organisms present within a microbial community, based on classification of short sequence reads. Though existing microbial community profiling methods have attempted to rapidly classify the millions of reads output from modern sequencers, the combination of incomplete databases, similarity among otherwise divergent genomes, errors and biases in sequencing technologies, and the large volumes of sequencing data required for metagenome sequencing has led to unacceptably high false discovery rates (FDR). Here, we present the application of a novel, gene-independent and signature-based metagenomic taxonomic profiling method with significantly and consistently smaller FDR than any other available method. Our algorithm circumvents false positives using a series of non-redundant signature databases and examines Genomic Origins Through Taxonomic CHAllenge (GOTTCHA). GOTTCHA was tested and validated on 20 synthetic and mock datasets ranging in community composition and complexity, was applied successfully to data generated from spiked environmental and clinical samples, and robustly demonstrates superior performance compared with other available tools.  相似文献   

3.
The molecular size of mu and pi symbionts of Parameciumaurelia has been calculated from renaturation kinetic data. Observed values were 0.78 × 109 daltons for mu particle DNA and 0.81 × 109 daltons for pi particle DNA. Estimates of analytical complexity were 4.45 × 109 and 5.05 × 109 daltons respectively. Based on these data, mu and pi symbionts appear to possess multiple genomes and contain a minimum of 5 or 6 copies of each DNA sequence.  相似文献   

4.
5.
Next generation sequencing (NGS) of PCR amplicons is a standard approach to detect genetic variations in personalized medicine such as cancer diagnostics. Computer programs used in the NGS community often miss insertions and deletions (indels) that constitute a large part of known human mutations. We have developed HeurAA, an open source, heuristic amplicon aligner program. We tested the program on simulated datasets as well as experimental data from multiplex sequencing of 40 amplicons in 12 oncogenes collected on a 454 Genome Sequencer from lung cancer cell lines. We found that HeurAA can accurately detect all indels, and is more than an order of magnitude faster than previous programs. HeurAA can compare reads and reference sequences up to several thousand base pairs in length, and it can evaluate data from complex mixtures containing reads of different gene-segments from different samples. HeurAA is written in C and Perl for Linux operating systems, the code and the documentation are available for research applications at http://sourceforge.net/projects/heuraa/  相似文献   

6.
7.
8.

Background

The B3 DNA binding domain includes five families: auxin response factor (ARF), abscisic acid-insensitive3 (ABI3), high level expression of sugar inducible (HSI), related to ABI3/VP1 (RAV) and reproductive meristem (REM). The release of the complete genomes of the angiosperm eudicots Arabidopsis thaliana and Populus trichocarpa, the monocot Orysa sativa, the bryophyte Physcomitrella patens,the green algae Chlamydomonas reinhardtii and Volvox carteri and the red algae Cyanidioschyzon melorae provided an exceptional opportunity to study the evolution of this superfamily.

Methodology

In order to better understand the origin and the diversification of B3 domains in plants, we combined comparative phylogenetic analysis with exon/intron structure and duplication events. In addition, we investigated the conservation and divergence of the B3 domain during the origin and evolution of each family.

Conclusions

Our data indicate that showed that the B3 containing genes have undergone extensive duplication events, and that the REM family B3 domain has a highly diverged DNA binding. Our results also indicate that the founding member of the B3 gene family is likely to be similar to the ABI3/HSI genes found in C. reinhardtii and V. carteri. Among the B3 families, ABI3, HSI, RAV and ARF are most structurally conserved, whereas the REM family has experienced a rapid divergence. These results are discussed in light of their functional and evolutionary roles in plant development.  相似文献   

9.

Background

Wolfram syndrome (WFS) is a recessive neurologic and endocrinologic degenerative disorder, and is also known as DIDMOAD (Diabetes Insipidus, early-onset Diabetes Mellitus, progressive Optic Atrophy and Deafness) syndrome. Most affected individuals carry recessive mutations in the Wolfram syndrome 1 gene (WFS1). However, the phenotypic pleiomorphism, rarity and molecular complexity of this disease complicate our efforts to understand WFS. To address this limitation, we aimed to describe complications and to elucidate the contributions of WFS1 mutations to clinical manifestations in Japanese patients with WFS.

Methodology

The minimal ascertainment criterion for diagnosing WFS was having both early onset diabetes mellitus and bilateral optic atrophy. Genetic analysis for WFS1 was performed by direct sequencing.

Principal Findings

Sixty-seven patients were identified nationally for a prevalence of one per 710,000, with 33 patients (49%) having all 4 components of DIDMOAD. In 40 subjects who agreed to participate in this investigation from 30 unrelated families, the earliest manifestation was DM at a median age of 8.7 years, followed by OA at a median age of 15.8 years. However, either OA or DI was the first diagnosed feature in 6 subjects. In 10, features other than DM predated OA. Twenty-seven patients (67.5%) had a broad spectrum of recessive mutations in WFS1. Two patients had mutations in only one allele. Eleven patients (27.5%) had intact WFS1 alleles. Ages at onset of both DM and OA in patients with recessive WFS1 mutations were indistinguishable from those in patients without WFS1 mutations. In the patients with predicted complete loss-of-function mutations, ages at the onsets of both DM and OA were significantly earlier than those in patients with predicted partial-loss-of function mutations.

Conclusion/Significance

This study emphasizes the clinical and genetic heterogeneity in patients with WFS. Genotype-phenotype correlations may exist in patients with WFS1 mutations, as demonstrated by the disease onset.  相似文献   

10.
Sarcolemmal membrane-associated protein (SLMAP) is a tail-anchored protein involved in fundamental cellular processes, such as myoblast fusion, cell cycle progression, and chromosomal inheritance. Further, SLMAP misexpression is associated with endothelial dysfunctions in diabetes and cancer. SLMAP is part of the conserved striatin-interacting phosphatase and kinase (STRIPAK) complex required for specific signaling pathways in yeasts, filamentous fungi, insects, and mammals. In filamentous fungi, STRIPAK was initially discovered in Sordaria macrospora, a model system for fungal differentiation. Here, we functionally characterize the STRIPAK subunit PRO45, a homolog of human SLMAP. We show that PRO45 is required for sexual propagation and cell-to-cell fusion and that its forkhead-associated (FHA) domain is essential for these processes. Protein-protein interaction studies revealed that PRO45 binds to STRIPAK subunits PRO11 and SmMOB3, which are also required for sexual propagation. Superresolution structured-illumination microscopy (SIM) further established that PRO45 localizes to the nuclear envelope, endoplasmic reticulum, and mitochondria. SIM also showed that localization to the nuclear envelope requires STRIPAK subunits PRO11 and PRO22, whereas for mitochondria it does not. Taken together, our study provides important insights into fundamental roles of the fungal SLMAP homolog PRO45 and suggests STRIPAK-related and STRIPAK-unrelated functions.  相似文献   

11.
There is a need for techniques capable of identifying the antigenic epitopes targeted by polyclonal antibody responses during deliberate or natural immunization. Although successful, traditional phage library screening is laborious and can map only some of the epitopes. To accelerate and improve epitope identification, we have employed massive sequencing of phage-displayed antigen-specific libraries using the Illumina MiSeq platform. This enabled us to precisely identify the regions of a model antigen, the meningococcal NadA virulence factor, targeted by serum antibodies in vaccinated individuals and to rank hundreds of antigenic fragments according to their immunoreactivity. We found that next generation sequencing can significantly empower the analysis of antigen-specific libraries by allowing simultaneous processing of dozens of library/serum combinations in less than two days, including the time required for antibody-mediated library selection. Moreover, compared with traditional plaque picking, the new technology (named Phage-based Representation OF Immuno-Ligand Epitope Repertoire or PROFILER) provides superior resolution in epitope identification. PROFILER seems ideally suited to streamline and guide rational antigen design, adjuvant selection, and quality control of newly produced vaccines. Furthermore, this method is also susceptible to find important applications in other fields covered by traditional quantitative serology.  相似文献   

12.
The Ubiquitin-fold modifier 1 (Ufm1) is a ubiquitin-like protein that can also be conjugated to protein substrates and subsequently alter their fates. Both UFMylation and de-UFMylation are mediated by Ufm1-specific proteases (UFSPs). In humans, it is widely believed that UFSP2 is the only active Ufm1 protease involved in Ufm1 maturation and de-UFMylation, whereas UFSP1 is thought to be inactive. Here, Liang et al. provide strong evidence showing that human UFSP1 is also an active Ufm1 protease. These results solve an age-old mystery in the human Ufm1 conjugation system and could have a greater impact not only on Ufm1 biology but also on the translation of genes employing nontraditional start codons.  相似文献   

13.
Triggering receptor expressed on myeloid cells-2 (TREM-2) is rapidly emerging as a key regulator of the innate immune response via its regulation of macrophage inflammatory responses. Here we demonstrate that proximal TREM-2 signaling parallels other DAP12-based receptor systems in its use of Syk and Src-family kinases. However, we find that the linker for activation of T cells (LAT) is severely reduced as monocytes differentiate into macrophages and that TREM-2 exclusively uses the linker for activation of B cells (LAB encoded by the gene Lat2−/−) to mediate downstream signaling. LAB is required for TREM-2-mediated activation of Erk1/2 and dampens proximal TREM-2 signals through a novel LAT-independent mechanism resulting in macrophages with proinflammatory properties. Thus, Lat2−/− macrophages have increased TREM-2-induced proximal phosphorylation, and lipopolysaccharide stimulation of these cells leads to increased interleukin-10 (IL-10) and decreased IL-12p40 production relative to wild type cells. Together these data identify LAB as a critical, LAT-independent regulator of TREM-2 signaling and macrophage development capable of controlling subsequent inflammatory responses.  相似文献   

14.
Small RNAs undergo maturation events that precisely determine the length and structure required for their function. CRISPRs (clustered regularly interspaced short palindromic repeats) encode small RNAs (crRNAs) that together with CRISPR-associated (cas) genes constitute a sequence-specific prokaryotic immune system for anti-viral and anti-plasmid defense. crRNAs are subject to multiple processing events during their biogenesis, and little is known about the mechanism of the final maturation step. We show that in the Staphylococcus epidermidis type III CRISPR-Cas system, mature crRNAs are measured in a Cas10·Csm ribonucleoprotein complex to yield discrete lengths that differ by 6-nucleotide increments. We looked for mutants that impact this crRNA size pattern and found that an alanine substitution of a conserved aspartate residue of Csm3 eliminates the 6-nucleotide increments in the length of crRNAs. In vitro, recombinant Csm3 binds RNA molecules at multiple sites, producing gel-shift patterns that suggest that each protein binds 6 nucleotides of substrate. In vivo, changes in the levels of Csm3 modulate the crRNA size distribution without disrupting the 6-nucleotide periodicity. Our data support a model in which multiple Csm3 molecules within the Cas10·Csm complex bind the crRNA with a 6-nucleotide periodicity to function as a ruler that measures the extent of crRNA maturation.  相似文献   

15.
The marine-derived Scopulariopsis brevicaulis strain LF580 produces scopularides A and B, which have anticancerous properties. We carried out genome sequencing using three next-generation DNA sequencing methods. De novo hybrid assembly yielded 621 scaffolds with a total size of 32.2 Mb and 16298 putative gene models. We identified a large non-ribosomal peptide synthetase gene (nrps1) and supporting pks2 gene in the same biosynthetic gene cluster. This cluster and the genes within the cluster are functionally active as confirmed by RNA-Seq. Characterization of carbohydrate-active enzymes and major facilitator superfamily (MFS)-type transporters lead to postulate S. brevicaulis originated from a soil fungus, which came into contact with the marine sponge Tethya aurantium. This marine sponge seems to provide shelter to this fungus and micro-environment suitable for its survival in the ocean. This study also builds the platform for further investigations of the role of life-style and secondary metabolites from S. brevicaulis.  相似文献   

16.
The Thailand-Cambodia border is the epicenter for drug-resistant falciparum malaria. Previous studies have shown that chloroquine (CQ) and pyrimethamine resistance originated in this region and eventually spread to other Asian countries and Africa. However, there is a dearth in understanding the origin and evolution of dhps alleles associated with sulfadoxine resistance. The present study was designed to reveal the origin(s) of sulfadoxine resistance in Cambodia and its evolutionary relationship to African and South American dhps alleles. We sequenced 234 Cambodian Plasmodium falciparum isolates for the dhps codons S436A/F, A437G, K540E, A581G and A613S/T implicated in sulfadoxine resistance. We also genotyped 10 microsatellite loci around dhps to determine the genetic backgrounds of various alleles and compared them with the backgrounds of alleles prevalent in Africa and South America. In addition to previously known highly-resistant triple mutant dhps alleles SGEGA and AGEAA (codons 436, 437, 540, 581, 613 are sequentially indicated), a large proportion of the isolates (19.3%) contained a 540N mutation in association with 437G/581G yielding a previously unreported triple mutant allele, SGNGA. Microsatellite data strongly suggest the strength of selection was greater on triple mutant dhps alleles followed by the double and single mutants. We provide evidence for at least three independent origins for the double mutants, one each for the SGKGA, AGKAA and SGEAA alleles. Our data suggest that the triple mutant allele SGEGA and the novel allele SGNGA have common origin on the SGKGA background, whereas the AGEAA triple mutant was derived from AGKAA on multiple, albeit limited, genetic backgrounds. The SGEAA did not share haplotypes with any of the triple mutants. Comparative analysis of the microsatellite haplotypes flanking dhps alleles from Cambodia, Kenya, Cameroon and Venezuela revealed an independent origin of sulfadoxine resistant alleles in each of these regions.  相似文献   

17.
C-C chemokine receptor 5 (CCR5) is a receptor for chemokines and a co-receptor for HIV-1 entry into the target CD4+ cells. CCR5 delta 32 deletion is a loss-of-function mutation, resistant to HIV-1 infection. We tried to induce the CCR5 delta 32 mutation harnessing the genome editing technique, CRISPR-Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats, CRISPR and CRISPR associated protein 9, Cas9) in the commonly used cell line human embryonic kidney HEK 293T cells. Surprisingly, we found that HEK293T cells are heterozygous for CCR5 delta 32 mutation, in contrast to the wild type CCR5 cells, human acute T cell leukemia cell line Jurkat and human breast adenocarcinoma cell line MDA-MB-231 cells. This finding indicates that at least one human cell line is heterozygous for the CCR5 delta 32 mutation. We also found that in PCR amplification, wild type CCR5 DNA and mutant delta 32 DNA can form mismatched heteroduplex and move slowly in gel electrophoresis.  相似文献   

18.
Shewanella oneidensis couples oxidation of lactate to respiration of many substrates. Here we report that llpR (l-lactate-positive regulator, SO_3460) encodes a positive regulator of l-lactate utilization distinct from previously studied regulators. We also demonstrate d-lactate inhibition of l-lactate utilization in S. oneidensis, resulting in preferential utilization of the d isomer.  相似文献   

19.

Background

Nesprins (Nuclear envelope spectrin-repeat proteins) are a novel family of giant spectrin-repeat containing proteins. The nesprin-1 and nesprin-2 genes consist of 146 and 116 exons which encode proteins of ∼1mDa and ∼800 kDa is size respectively when all the exons are utilised in translation. However emerging data suggests that the nesprins have multiple alternative start and termination sites throughout their genes allowing the generation of smaller isoforms.

Results

In this study we set out to identify novel alternatively transcribed nesprin variants by screening the EST database and by using RACE analysis to identify cDNA ends. These two methods provided potential hits for alternative start and termination sites that were validated by PCR and DNA sequencing. We show that these alternative sites are not only expressed in a tissue specific manner but by combining different sites together it is possible to create a wide array of nesprin variants. By cloning and expressing small novel nesprin variants into human fibroblasts and U2OS cells we show localization to actin stress-fibres, focal adhesions, microtubules, the nucleolus, nuclear matrix and the nuclear envelope (NE). Furthermore we show that the sub-cellular localization of individual nesprin variants can vary depending on the cell type, suggesting any single nesprin variant may have different functions in different cell types.

Conclusions

These studies suggest nesprins act as highly versatile tissue specific intracellular protein scaffolds and identify potential novel functions for nesprins beyond cytoplasmic-nuclear coupling. These alternate functions may also account for the diverse range of disease phenotypes observed when these genes are mutated.  相似文献   

20.
A mass spectral method for the quantitation of the percentages of deoxyadenosine, deoxyguanosine, deoxycytidine, and thymidine in intact DNAs has been devised. Standard curves for each nucleoside have been constructed which are based upon the observation that a direct correlation exists between the heights (% deflection) of diagnostic peaks from these nucleosides in a mass spectrum and the published percent composition of specific DNAs. Analyses of DNA from Clostridiumperfringens, Micrococcusluteus, Escherichiacoli, Bacillussubtilis, Pseudomonasfluorescens, Drosophilamelanogaster, salmon sperm, and bacteriophage lambda were used to determine standard curves. The validity of the method was demonstrated by comparison of the results from the mass spectral procedure with results from the chemical analyses of the DNAs from calf thymus and wheat germ. Analysis of ØX-174 DNA yielded values consistent with the published values obtained via sequence analysis and indicated that the method is applicable to both single and double-stranded DNAs. Results from T2 DNA, which contains no cytidine, exhibited artificially high values for adenosine, guanosine and thymidine with concomitant alteration in the A/T and G/C molar ratios. Such skewed results are useful in predicting the presence of modified nucleosides. The extreme sensitivity of the method has been exploited in the analysis of subnanogram quantities of restriction endonuclease fragments from DNA.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号