首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
G Sun  Y Wang  G Tao  Q Shen  W Cao  X Chang  W Zhang  C Shao  M Yi  S Shao  Y Yang 《Journal of virology》2012,86(16):8892-8893
Human parechoviruses (HPeVs) are a species in the Parechovirus genus of the Picornaviridae family. We report a complete genome sequence of a novel HPeV strain, CH-ZJ1, that was found in an infant with gastroenteritis in Zhenjiang City, China. The complete genome consists of 7,298 nucleotides (nt), excluding the 3' poly(A) tail; the open reading frame is mapped between nucleotide positions 654 and 7211 and encodes a 2,185-amino acid (aa) polyprotein. The phylogenetic tree obtained for the complete genome of this HPeV strain and the other HpeV strains available in GenBank indicated that CH-ZJ1 is intervenient between HpeV type 4 (HpeV4) and HpeV5. Phylogenetic analysis based on the 3D and VP1 genes reveals two incongruent trees. Recombination detection indicated that CH-ZJ1 might be a recombinant which was produced by more than one genomic recombination event that occurred among HPeV1, HPeV4, and HPeV3 strains.  相似文献   

2.
3.
The circulation of human parechoviruses (HPeVs) in the population was studied by environmental surveillance comprising of molecular analyses of sewage samples (n = 89) that were collected from 15 different locations in the Netherlands. Samples were taken from sewage originating from schools (n = 9) or from parts of municipalities (n = 6) during the Dutch school year 2010-2011. At 13/15 locations HPeV1, HPeV3, or HPeV6 RNA was detected at least once; however, sequence diversity did not reflect associations in time or place. A higher percentage of positives was observed in the samples originating from the municipalities. It was demonstrated that HPeV circulated in the studied population to a higher extent than would be expected from the current knowledge on infections predominating in young children.  相似文献   

4.
Extrachromosomal genomes of the adeleorinid parasite Hepatozoon canis infecting an Israeli dog were investigated using next-generation and standard sequencing technologies. A complete apicoplast genome and several mitochondrion-associated sequences were generated. The apicoplast genome (31,869?bp) possessed two copies of both large subunit (23S) and small subunit (16S) ribosomal RNA genes (rDNA) within an inverted repeat region, as well as 22 protein-coding sequences, 25 transfer RNA genes (tDNA) and seven open reading frames of unknown function. Although circular-mapping, the apicoplast genome was physically linear according to next-generation data. Unlike other apicoplast genomes, genes encoding ribosomal protein S19 and tDNAs for alanine, aspartic acid, histidine, threonine and valine were not identified. No complete mitochondrial genome was recovered using next-generation data or directed PCR amplifications. Eight mitochondrion-associated (215–3523?bp) contigs assembled from next-generation data encoded a complete cytochrome c oxidase subunit I coding sequence, a complete cytochrome c oxidase subunit III coding sequence, two complete cytochrome B coding sequences, a non-coding, pseudogene for cytochrome B and multiple fragmented mitochondrial rDNA genes (SSUA, SSUB, SSUD, LSUC, LSUG, RNA6, RNA10, RNA14, RNA18). The paucity of NGS reads generating each of the mitochondrion-like sequences suggested that a complete mitochondrial genome at typically high copy number was absent in H. canis. In contrast, the complete nuclear rDNA unit sequence of H. canis (18S rDNA to 28S rDNA, 6977?bp) had >1000-fold next-generation coverage. Multiple divergent (from 93.6% to 99.9% pairwise identities) nuclear 18S rDNA contigs were generated (three types with 10 subtypes total). To our knowledge this is the first apicoplast genome sequenced from any adeleorinid coccidium and the first mitochondrion-associated sequences from this serious pathogen of wild and domestic canids. These newly generated sequences may provide useful genetic loci for high-resolution species-level genotyping that is currently impossible using existing nuclear rDNA targets.  相似文献   

5.
In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnV-GCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of co-dons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns.  相似文献   

6.
We determined the complete mitochondrial DNA (mtDNA) sequence of a fluke, Paramphistomum cervi (Digenea: Paramphistomidae). This genome (14,014 bp) is slightly larger than that of Clonorchis sinensis (13,875 bp), but smaller than those of other digenean species. The mt genome of P. cervi contains 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 2 non-coding regions (NCRs), a complement consistent with those of other digeneans. The arrangement of protein-coding and ribosomal RNA genes in the P. cervi mitochondrial genome is identical to that of other digeneans except for a group of Schistosoma species that exhibit a derived arrangement. The positions of some transfer RNA genes differ. Bayesian phylogenetic analyses, based on concatenated nucleotide sequences and amino-acid sequences of the 12 protein-coding genes, placed P. cervi within the Order Plagiorchiida, but relationships depicted within that order were not quite as expected from previous studies. The complete mtDNA sequence of P. cervi provides important genetic markers for diagnostics, ecological and evolutionary studies of digeneans.  相似文献   

7.
Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton.  相似文献   

8.
This is the first de novo assembly and annotation of a complete mitochondrial genome in the Ericales order from the American cranberry (Vaccinium macrocarpon Ait.). Moreover, only four complete Asterid mitochondrial genomes have been made publicly available. The cranberry mitochondrial genome was assembled and reconstructed from whole genome 454 Roche GS-FLX and Illumina shotgun sequences. Compared with other Asterids, the reconstruction of the genome revealed an average size mitochondrion (459,678 nt) with relatively little repetitive sequences and DNA of plastid origin. The complete mitochondrial genome of cranberry was annotated obtaining a total of 34 genes classified based on their putative function, plus three ribosomal RNAs, and 17 transfer RNAs. Maternal organellar cranberry inheritance was inferred by analyzing gene variation in the cranberry mitochondria and plastid genomes. The annotation of cranberry mitochondrial genome revealed the presence of two copies of tRNA-Sec and a selenocysteine insertion sequence (SECIS) element which were lost in plants during evolution. This is the first report of a land plant possessing selenocysteine insertion machinery at the sequence level.  相似文献   

9.
Radish (Raphanus sativus L.) is an edible root vegetable crop that is cultivated worldwide and whose genome has been sequenced. Here we report the complete nucleotide sequence of the radish cultivar WK10039 chloroplast (cp) genome, along with a de novo assembly strategy using whole genome shotgun sequence reads obtained by next generation sequencing. The radish cp genome is 153,368 bp in length and has a typical quadripartite structure, composed of a pair of inverted repeat regions (26,217 bp each), a large single copy region (83,170 bp), and a small single copy region (17,764 bp). The radish cp genome contains 87 predicted protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Sequence analysis revealed the presence of 91 simple sequence repeats (SSRs) in the radish cp genome.  相似文献   

10.
The changing profile of infection over time for Human Parechoviruses (HPeVs) is not well known and no detailed study has been reported to date in China. This investigation on HPeV infection in hospitalized children in Lanzhou, China revealed variations in epidemiological characteristics after a three-year interval. To assess the changes that had occurred, epidemiological and clinical characteristics of HPeVs were characterized and compared with previously reported data by our group. A comparable positivity rate (25.3%, 73/289) was revealed after the three-year interval with the majority of the infected children (95.9%, 70/73) being younger than two years of age. While a temporal change in the seasonal distribution was noted in the current study, HPeVs were more frequently detected during July to November compared to September to December in the previous study. Changes in HPeV genotypes patterns, a temporal change in the prevalence of HPeV1, a younger susceptible age to HPeV3 compared with HPeV1 and a tendency of older children to be infected with HPeV4 are in contrast to our previous report. HPeV2, a rarely reported genotype, was identified for the first time in China. In addition, an exclusive trinucleotide (GAT) insertion in the HPeV4 nucleotide sequence was identified. However, the profiles of co-infection with other enteric related viruses were similar to our previous findings. In summary, these data suggest temporal variation in the seasonal distribution of HPeV and changing patterns of HPeV genotypes over time in the study region.  相似文献   

11.
12.
De novo microbial genome sequencing reached a turning point with third-generation sequencing (TGS) platforms, and several microbial genomes have been improved by TGS long reads. Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and it has a function in the production of the traditional Japanese fermented food “natto.” The B. subtilis natto BEST195 genome was previously sequenced with short reads, but it included some incomplete regions. We resequenced the BEST195 genome using a PacBio RS sequencer, and we successfully obtained a complete genome sequence from one scaffold without any gaps, and we also applied Illumina MiSeq short reads to enhance quality. Compared with the previous BEST195 draft genome and Marburg 168 genome, we found that incomplete regions in the previous genome sequence were attributed to GC-bias and repetitive sequences, and we also identified some novel genes that are found only in the new genome.  相似文献   

13.
The complete plastid genome sequence of the American cranberry (Vaccinium macrocarpon Ait.) was reconstructed using next-generation sequencing data by in silico procedures. We used Roche 454 shotgun sequence data to isolate cranberry plastid-specific sequences of “HyRed” via homology comparisons with complete sequences from several species available at the National Center for Biotechnology Information database. Eleven cranberry plastid contigs were selected for the construction of the plastid genome-based homologies and on raw reads flowing through contigs and connection information. We assembled and annotated a cranberry plastid genome (82,284 reads; 185x coverage) with a length of 176 kb and the typical structure found in plants, but with several structural rearrangements in the large single-copy region when compared to other plastid asterid genomes. To evaluate the reliability of the sequence data, phylogenetic analysis of 30 species outside the order Ericales (with 54 genes) showed Vaccinium inside the clade Asteridae, as reported in other studies using single genes. The cranberry plastid genome sequence will allow the accumulation of critical data useful for breeding and a suite of other genetic studies.  相似文献   

14.
The complete genome sequence and the gene organization of a novel insect picorna-like virus, Spodoptera exigua virus (SeV), were determined. The genomic RNA of the SeV was 9501 nt in length excluding the poly(A) tail and contained a single, large open reading frame (nt 392–9424) encoding a 3010 aa polyprotein. Sequence comparisons with other viral polyproteins revealed that the consensus sequences for picornavirus RNA helicase, cysteine protease, and RNA-dependent RNA polymerase (RdRp) proteins are found on the genome in that order from the 5′ to the 3′ end. In terms of sequence similarity, identity, and genome organization, SeV resembled insect picorna-like viruses belonging to the genus Iflavirus. A phylogenetic analysis based on the eight conserved domains in the RdRp sequence showed that SeV was most closely related to the Perina nuda virus and Ectropis obliqua picorna-like virus, suggesting that these three insect picorna-like viruses might share a common ancestor.  相似文献   

15.
Little is known about the variations of nematode mitogenomes (mtDNA). Sequencing a complete mtDNA using a PCR approach remains a challenge due to frequent genome reorganizations and low sequence similarities between divergent nematode lineages. Here, a genome skimming approach based on HiSeq sequencing (shotgun) was used to assemble de novo the first complete mtDNA sequence of a root-knot nematode (Meloidogyne graminicola). An AT-rich genome (84.3%) of 20,030 bp was obtained with a mean sequencing depth superior to 300. Thirty-six genes were identified with a semi-automated approach. A comparison with a gene map of the M. javanica mitochondrial genome indicates that the gene order is conserved within this nematode lineage. However, deep genome rearrangements were observed when comparing with other species of the superfamily Hoplolaimoidea. Repeat elements of 111 bp and 94 bp were found in a long non-coding region of 7.5 kb, as similarly reported in Mjavanica and Mhapla. This study points out the power of next generation sequencing to produce complete mitochondrial genomes, even without a reference sequence, and possibly opening new avenues for species/race identification, phylogenetics and population genetics of nematodes.  相似文献   

16.
Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.  相似文献   

17.
Francisella tularensis subspecies holarctica FTNF002-00 strain was originally obtained from the first known clinical case of bacteremic F. tularensis pneumonia in Southern Europe isolated from an immunocompetent individual. The FTNF002-00 complete genome contains the RD23 deletion and represents a type strain for a clonal population from the first epidemic tularemia outbreak in Spain between 1997–1998. Here, we present the complete sequence analysis of the FTNF002-00 genome. The complete genome sequence of FTNF002-00 revealed several large as well as small genomic differences with respect to two other published complete genome sequences of F. tularensis subsp. holarctica strains, LVS and OSU18. The FTNF002-00 genome shares >99.9% sequence similarity with LVS and OSU18, and is also ∼5 MB smaller by comparison. The overall organization of the FTNF002-00 genome is remarkably identical to those of LVS and OSU18, except for a single 3.9 kb inversion in FTNF002-00. Twelve regions of difference ranging from 0.1–1.5 kb and forty-two small insertions and deletions were identified in a comparative analysis of FTNF002-00, LVS, and OSU18 genomes. Two small deletions appear to inactivate two genes in FTNF002-00 causing them to become pseudogenes; the intact genes encode a protein of unknown function and a drug:H+ antiporter. In addition, we identified ninety-nine proteins in FTNF002-00 containing amino acid mutations compared to LVS and OSU18. Several non-conserved amino acid replacements were identified, one of which occurs in the virulence-associated intracellular growth locus subunit D protein. Many of these changes in FTNF002-00 are likely the consequence of direct selection that increases the fitness of this subsp. holarctica clone within its endemic population. Our complete genome sequence analyses lay the foundation for experimental testing of these possibilities.  相似文献   

18.
Despite major advances in next-generation sequencing, assembly of sequencing data, especially data from novel microorganisms or re-emerging pathogens, remains constrained by the lack of suitable reference sequences. De novo assembly is the best approach to achieve an accurate finished sequence, but multiple sequencing platforms or paired-end libraries are often required to achieve full genome coverage. In this study, we demonstrated a method to assemble complete bacterial genome sequences by integrating shotgun Roche 454 pyrosequencing with optical whole genome mapping (WGM). The whole genome restriction map (WGRM) was used as the reference to scaffold de novo assembled sequence contigs through a stepwise process. Large de novo contigs were placed in the correct order and orientation through alignment to the WGRM. De novo contigs that were not aligned to WGRM were merged into scaffolds using contig branching structure information. These extended scaffolds were then aligned to the WGRM to identify the overlaps to be eliminated and the gaps and mismatches to be resolved with unused contigs. The process was repeated until a sequence with full coverage and alignment with the whole genome map was achieved. Using this method we were able to achieved 100% WGRM coverage without a paired-end library. We assembled complete sequences for three distinct genetic components of a clinical isolate of Providencia stuartii: a bacterial chromosome, a novel bla NDM-1 plasmid, and a novel bacteriophage, without separately purifying them to homogeneity.  相似文献   

19.
The complete genome sequence of Bacillus subtilis strain QB928 was constructed to facilitate studies in the evolution of the genetic code. With a widespread use of the strain in Bacillus subtilis genetics studies, its complete genome sequence would facilitate deeper understanding of Bacillus subtilis genetics.  相似文献   

20.
Magnolia grandiflora is an important medicinal,ornamental and horticultural plant species.The chloroplast(cp) genome of M.grandiflora was sequenced using a 454 sequencing platform and the genome structure was compared with other related species.The complete cp genome of M.grandiflora was 159623 bp in length and contained a pair of inverted repeats(IR) of 26563 bp separated by large and small single copy(LSC,SSC) regions of 87757 and 18740 bp,respectively.A total of 129 genes were successfully annotated,18 of which included introns.The identity,number and GC content of M.grandiflora cp genes were similar to those of other Magnoliaceae species genomes.Analysis revealed 218 simple sequence repeat(SSR) loci,most composed of A or T,contributing to a bias in base composition.The types and abundances of repeat units in Magnoliaceae species were relatively conserved and these loci will be useful for developing M.grandiflora cp genome vectors.In addition,results indicated that the cp genome size in Magnoliaceae species and the position of the IR border were closely related to the length of the ycf1 gene.Phylogenetic analyses based on 66 shared genes from 30 species using maximum parsimony(MP) and maximum likelihood(ML) methods provided strong support for the phylogenetic position of Magnolia.The availability of the complete cp genome sequence of M.grandiflora provides valuable information for breeding of desirable varieties,cp genetic engineering,developing useful molecular markers and phylogenetic analyses in Magnoliaceae.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号