首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 140 毫秒
1.
Synechocystis sp. PCC 6803 is a widely used model cyanobacterium for studying photosynthesis, phototaxis, the production of biofuels and many other aspects. Here we present a re-sequencing study of the genome and seven plasmids of one of the most widely used Synechocystis sp. PCC 6803 substrains, the glucose tolerant and motile Moscow or ‘PCC-M’ strain, revealing considerable evidence for recent microevolution. Seven single nucleotide polymorphisms (SNPs) specifically shared between ‘PCC-M’ and the ‘PCC-N and PCC-P’ substrains indicate that ‘PCC-M’ belongs to the ‘PCC’ group of motile strains. The identified indels and SNPs in ‘PCC-M’ are likely to affect glucose tolerance, motility, phage resistance, certain stress responses as well as functions in the primary metabolism, potentially relevant for the synthesis of alkanes. Three SNPs in intergenic regions could affect the promoter activities of two protein-coding genes and one cis-antisense RNA. Two deletions in ‘PCC-M’ affect parts of clustered regularly interspaced short palindrome repeats-associated spacer-repeat regions on plasmid pSYSA, in one case by an unusual recombination between spacer sequences.  相似文献   

2.
To gain genetic insights into the early-flowering phenotype of ornamental cherry, also known as sakura, we determined the genome sequences of two early-flowering cherry (Cerasus × kanzakura) varieties, ‘Kawazu-zakura’ and ‘Atami-zakura’. Because the two varieties are interspecific hybrids, likely derived from crosses between Cerasus campanulata (early-flowering species) and Cerasus speciosa, we employed the haplotype-resolved sequence assembly strategy. Genome sequence reads obtained from each variety by single-molecule real-time sequencing (SMRT) were split into two subsets, based on the genome sequence information of the two probable ancestors, and assembled to obtain haplotype-phased genome sequences. The resultant genome assembly of ‘Kawazu-zakura’ spanned 519.8 Mb with 1,544 contigs and an N50 value of 1,220.5 kb, while that of ‘Atami-zakura’ totalled 509.6 Mb with 2,180 contigs and an N50 value of 709.1 kb. A total of 72,702 and 69,528 potential protein-coding genes were predicted in the genome assemblies of ‘Kawazu-zakura’ and ‘Atami-zakura’, respectively. Gene clustering analysis identified 2,634 clusters uniquely presented in the C. campanulata haplotype sequences, which might contribute to its early-flowering phenotype. Genome sequences determined in this study provide fundamental information for elucidating the molecular and genetic mechanisms underlying the early-flowering phenotype of ornamental cherry tree varieties and their relatives.  相似文献   

3.
Scrub typhus (‘Tsutsugamushi’ disease in Japanese) is a mite-borne infectious disease. The causative agent is Orientia tsutsugamushi, an obligate intracellular bacterium belonging to the family Rickettsiaceae of the subdivision alpha-Proteobacteria. In this study, we determined the complete genome sequence of O. tsutsugamushi strain Ikeda, which comprises a single chromosome of 2 008 987 bp and contains 1967 protein coding sequences (CDSs). The chromosome is much larger than those of other members of Rickettsiaceae, and 46.7% of the sequence was occupied by repetitive sequences derived from an integrative and conjugative element, 10 types of transposable elements, and seven types of short repeats of unknown origins. The massive amplification and degradation of these elements have generated a huge number of repeated genes (1196 CDSs, categorized into 85 families), many of which are pseudogenes (766 CDSs), and also induced intensive genome shuffling. By comparing the gene content with those of other family members of Rickettsiacea, we identified the core gene set of the family Rickettsiaceae and found that, while much more extensive gene loss has taken place among the housekeeping genes of Orientia than those of Rickettsia, O. tsutsugamushi has acquired a large number of foreign genes. The O. tsutsugamushi genome sequence is thus a prominent example of the high plasticity of bacterial genomes, and provides the genetic basis for a better understanding of the biology of O. tsutsugamushi and the pathogenesis of ‘Tsutsugamushi’ disease.Key words: Orientia tsutsugamushi, genome sequencing, obligate intracellular bacterium, repetitive sequence, IS element, integrative and conjugative element, gene amplification, genome reduction  相似文献   

4.
In this work, it is described the sequencing and annotation of the genome of the yeast strain ISA1307, isolated from a sparkling wine continuous production plant. This strain, formerly considered of the Zygosaccharomyces bailii species, has been used to study Z. bailii physiology, in particular, its extreme tolerance to acetic acid stress at low pH. The analysis of the genome sequence described in this work indicates that strain ISA1307 is an interspecies hybrid between Z. bailii and a closely related species. The genome sequence of ISA1307 is distributed through 154 scaffolds and has a size of around 21.2 Mb, corresponding to 96% of the genome size estimated by flow cytometry. Annotation of ISA1307 genome includes 4385 duplicated genes (∼90% of the total number of predicted genes) and 1155 predicted single-copy genes. The functional categories including a higher number of genes are ‘Metabolism and generation of energy’, ‘Protein folding, modification and targeting’ and ‘Biogenesis of cellular components’. The knowledge of the genome sequence of the ISA1307 strain is expected to contribute to accelerate systems-level understanding of stress resistance mechanisms in Z. bailii and to inspire and guide novel biotechnological applications of this yeast species/strain in fermentation processes, given its high resilience to acidic stress. The availability of the ISA1307 genome sequence also paves the way to a better understanding of the genetic mechanisms underlying the generation and selection of more robust hybrid yeast strains in the stressful environment of wine fermentations.  相似文献   

5.
Ipomoea trifida (H. B. K.) G. Don. is the most likely diploid ancestor of the hexaploid sweet potato, I. batatas (L.) Lam. To assist in analysis of the sweet potato genome, de novo whole-genome sequencing was performed with two lines of I. trifida, namely the selfed line Mx23Hm and the highly heterozygous line 0431-1, using the Illumina HiSeq platform. We classified the sequences thus obtained as either ‘core candidates’ (common to the two lines) or ‘line specific’. The total lengths of the assembled sequences of Mx23Hm (ITR_r1.0) was 513 Mb, while that of 0431-1 (ITRk_r1.0) was 712 Mb. Of the assembled sequences, 240 Mb (Mx23Hm) and 353 Mb (0431-1) were classified into core candidate sequences. A total of 62,407 (62.4 Mb) and 109,449 (87.2 Mb) putative genes were identified, respectively, in the genomes of Mx23Hm and 0431-1, of which 11,823 were derived from core sequences of Mx23Hm, while 28,831 were from the core candidate sequence of 0431-1. There were a total of 1,464,173 single-nucleotide polymorphisms and 16,682 copy number variations (CNVs) in the two assembled genomic sequences (under the condition of log2 ratio of >1 and CNV size >1,000 bases). The results presented here are expected to contribute to the progress of genomic and genetic studies of I. trifida, as well as studies of the sweet potato and the genus Ipomoea in general.  相似文献   

6.
Prevotella intermedia is a pathogenic bacterium involved in periodontal diseases. Here, we present the complete genome sequence of a clinical strain, OMA14, of this bacterium along with the results of comparative genome analysis with strain 17 of the same species whose genome has also been sequenced, but not fully analysed yet. The genomes of both strains consist of two circular chromosomes: the larger chromosomes are similar in size and exhibit a high overall linearity of gene organizations, whereas the smaller chromosomes show a significant size variation and have undergone remarkable genome rearrangements. Unique features of the Pre. intermedia genomes are the presence of a remarkable number of essential genes on the second chromosomes and the abundance of conjugative and mobilizable transposons (CTns and MTns). The CTns/MTns are particularly abundant in the second chromosomes, involved in its extensive genome rearrangement, and have introduced a number of strain-specific genes into each strain. We also found a novel 188-bp repeat sequence that has been highly amplified in Pre. intermedia and are specifically distributed among the Pre. intermedia-related species. These findings expand our understanding of the genetic features of Pre. intermedia and the roles of CTns and MTns in the evolution of bacteria.  相似文献   

7.
The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. ‘Francesco’ was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568 887 315 bp, consisting of 45 088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16 644 bp and 60 737 bp, respectively, and the longest scaffold was 1 287 144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp.  相似文献   

8.
The development of the emerging field of ‘paleovirology’ allows biologists to reconstruct the evolutionary history of fossil endogenous retroviral sequences integrated within the genome of living organisms and has led to the retrieval of conserved, ancient retroviral genes ‘exapted’ by ancestral hosts to fulfil essential physiological roles, syncytin genes being undoubtedly among the most remarkable examples of such a phenomenon. Indeed, syncytins are ‘new’ genes encoding proteins derived from the envelope protein of endogenous retroviral elements that have been captured and domesticated on multiple occasions and independently in diverse mammalian species, through a process of convergent evolution. Knockout of syncytin genes in mice provided evidence for their absolute requirement for placenta development and embryo survival, via formation by cell–cell fusion of syncytial cell layers at the fetal–maternal interface. These genes of exogenous origin, acquired ‘by chance’ and yet still ‘necessary’ to carry out a basic function in placental mammals, may have been pivotal in the emergence of mammalian ancestors with a placenta from egg-laying animals via the capture of a founding retroviral env gene, subsequently replaced in the diverse mammalian lineages by new env-derived syncytin genes, each providing its host with a positive selective advantage.  相似文献   

9.
10.
Toxoplasma gondii, an obligate intracellular protozoan parasite of the phylum Apicomplexa, can infect all warm-blooded vertebrates, including humans, livestock, and marine mammals. The aim of this study was to investigate whether superoxide dismutase (SOD) of T. gondii can be used as a new marker for genetic study or a potential vaccine candidate. The partial genome region of the SOD gene was amplified and sequenced from 10 different T. gondii isolates from different parts of the world, and all the sequences were examined by PCR-RFLP, sequence analysis, and phylogenetic reconstruction. The results showed that partial SOD gene sequences ranged from 1,702 bp to 1,712 bp and A + T contents varied from 50.1% to 51.1% among all examined isolates. Sequence alignment analysis identified total 43 variable nucleotide positions, and these results showed that 97.5% sequence similarity of SOD gene among all examined isolates. Phylogenetic analysis revealed that these SOD sequences were not an effective molecular marker for differential identification of T. gondii strains. The research demonstrated existence of low sequence variation in the SOD gene among T. gondii strains of different genotypes from different hosts and geographical regions.  相似文献   

11.
Klebsiella pneumoniae U25 is a multidrug resistant strain isolated from a tertiary care hospital in Chennai, India. Here, we report the complete annotated genome sequence of strain U25 obtained using PacBio RSII. This is the first report of the whole genome of K. pneumoniaespecies from Chennai. It consists of a single circular chromosome of size 5,491,870-bp and two plasmids of size 211,813 and 172,619-bp. The genes associated with multidrug resistance were identified. The chromosome of U25 was found to have eight antibiotic resistant genes [blaOXA-1,blaSHV-28, aac(6’)1b-cr,catB3, oqxAB, dfrA1]. The plasmid pMGRU25-001 was found to have only one resistant gene (catA1) while plasmid pMGRU25-002 had 20 resistant genes [strAB, aadA1,aac(6’)-Ib, aac(3)-IId,sul1,2, blaTEM-1A,1B,blaOXA-9, blaCTX-M-15,blaSHV-11, cmlA1, erm(B),mph(A)]. A mutation in the porin OmpK36 was identified which is likely to be associated with the intermediate resistance to carbapenems in the absence of carbapenemase genes. U25 is one of the few K. pneumoniaestrains to harbour clustered regularly interspaced short palindromic repeats (CRISPR) systems. Two CRISPR arrays corresponding to Cas3 family helicase were identified in the genome. When compared to K. pneumoniaeNTUHK2044, a transposase gene InsH of IS5-13 was found inserted.  相似文献   

12.

Background and Aims

The production of triploid banana and plantain (Musa spp.) cultivars with improved characteristics (e.g. greater disease resistance or higher yield), while still preserving the main features of current popular cultivars (e.g. taste and cooking quality), remains a major challenge for Musa breeders. In this regard, breeders require a sound knowledge of the lineage of the current sterile triploid cultivars, to select diploid parents that are able to transmit desirable traits, together with a breeding strategy ensuring final triploidization and sterility. Highly polymorphic single sequence repeats (SSRs) are valuable markers for investigating phylogenetic relationships.

Methods

Here, the allelic distribution of each of 22 SSR loci across 561 Musa accessions is analysed.

Key Results and Conclusions

We determine the closest diploid progenitors of the triploid ‘Cavendish’ and ‘Gros Michel’ subgroups, valuable information for breeding programmes. Nevertheless, in establishing the likely monoclonal origin of the main edible triploid banana subgroups (i.e. ‘Cavendish’, ‘Plantain’ and ‘Mutika-Lujugira’), we postulated that the huge phenotypic diversity observed within these subgroups did not result from gamete recombination, but rather from epigenetic regulations. This emphasizes the need to investigate the regulatory mechanisms of genome expression on a unique model in the plant kingdom. We also propose experimental standards to compare additional and independent genotyping data for reference.  相似文献   

13.
Sequence Analysis of the Genome of an Oil-Bearing Tree, Jatropha curcas L.   总被引:2,自引:0,他引:2  
《DNA research》2011,18(1):65-76
The whole genome of Jatropha curcas was sequenced, using a combination of the conventional Sanger method and new-generation multiplex sequencing methods. Total length of the non-redundant sequences thus obtained was 285 858 490 bp consisting of 120 586 contigs and 29 831 singlets. They accounted for ∼95% of the gene-containing regions with the average G + C content was 34.3%. A total of 40 929 complete and partial structures of protein encoding genes have been deduced. Comparison with genes of other plant species indicated that 1529 (4%) of the putative protein-encoding genes are specific to the Euphorbiaceae family. A high degree of microsynteny was observed with the genome of castor bean and, to a lesser extent, with those of soybean and Arabidopsis thaliana. In parallel with genome sequencing, cDNAs derived from leaf and callus tissues were subjected to pyrosequencing, and a total of 21 225 unigene data have been generated. Polymorphism analysis using microsatellite markers developed from the genomic sequence data obtained was performed with 12 J. curcas lines collected from various parts of the world to estimate their genetic diversity. The genomic sequence and accompanying information presented here are expected to serve as valuable resources for the acceleration of fundamental and applied research with J. curcas, especially in the fields of environment-related research such as biofuel production. Further information on the genomic sequences and DNA markers is available at http://www.kazusa.or.jp/jatropha/.  相似文献   

14.
2005年多国合作的国际水稻(Oryza sativa)基因组测序项目绘制了粳稻(O.sativa subsp.japonica)品种日本晴的参考基因组序列。最近,中国科学家发布了2个籼稻(O.sativa subsp.indica)品种(明恢63和珍汕97)的高质量参考基因组序列,为籼稻的功能基因组学研究和分子育种应用提供了便利。  相似文献   

15.
植原体寄主种类多, 危害范围广, 开展其遗传多样性、关键基因调控等方面研究有助于提高该病害综合防治水平。通过长片段PCR引物扩增我国PaWB-sdyz、PaWB-fjfz和LY-fjya1植原体株系tuf基因及其上游6个基因的片段, 进行植原体基因启动子保守区域序列特征和多位点序列分析。利用启动子探针载体pSUPV4检测植原体tuf基因上游序列的启动子活性。扩增获得PaWB-sdyz、PaWB-fjfz、LY-fjya1株系tuf基因上游12,745-12,748 bp序列, 比较分析发现PaWB-sdyz、PaWB-fjfz、LY-fjya1、OY-M、AYWB、PAa、SLY、AT植原体株系tuf与其上游6个基因的结构顺序皆为5’-rplL-rpoB-rpoC-rps12-rps7-fusA-tuf-3’。推测出可能的植原体启动子保守区域模式序列: T90T100G92T75G67A85 (-35区); T90A96T92A98T73T90 (-10区)。基于8个植原体株系的rplL-tuf核苷酸序列编码基因、非编码序列、氨基酸序列的多位点序列分析可将不同植原体株系以较高的支持率清晰地区分, 不同植原体株系rplL-tuf核苷酸非编码区变异水平更高。16SrI组植原体tuf基因上游序列存在3种变异类型, 其代表株系PaWB-fjfz、LY-fjya1 tuf基因上游130 bp片段和CWB-hnsy1 tuf基因上游129 bp片段皆具有启动子活性。  相似文献   

16.
Evolvulus alsinoides, belonging to the family Convolvulaceae, is an important medicinal plant widely used as a nootropic in the Indian traditional medicine system. In the genus Evolvulus, no research on the chloroplast genome has been published. Hence, the present study focuses on annotation, characterization, identification of mutational hotspots, and phylogenetic analysis in the complete chloroplast genome (cp) of E. alsinoides. Genome comparison and evolutionary dynamics were performed with the species of Solanales. The cp genome has 114 genes (80 protein-coding genes, 30 transfer RNA, and 4 ribosomal RNA genes) that were unique with total genome size of 157,015 bp. The cp genome possesses 69 RNA editing sites and 44 simple sequence repeats (SSRs). Predicted SSRs were randomly selected and validated experimentally. Six divergent hotspots such as trnQ-UUG, trnF-GAA, psaI, clpP, ndhF, and ycf1 were discovered from the cp genome. These microsatellites and divergent hot spot sequences of the Taxa ‘Evolvulus’ could be employed as molecular markers for species identification and genetic divergence investigations. The LSC area was found to be more conserved than the SSC and IR region in genome comparison. The IR contraction and expansion studies show that nine genes rpl2, rpl23, ycf1, ycf2, ycf1, ndhF, ndhA, matK, and psbK were present in the IR-LSC and IR-SSC boundaries of the cp genome. Fifty-four protein-coding genes in the cp genome were under negative selection pressure, indicating that they were well conserved and were undergoing purifying selection. The phylogenetic analysis reveals that E. alsinoides is closely related to the genus Cressa with some divergence from the genus Ipomoea. This is the first time the chloroplast genome of the genus Evolvulus has been published. The findings of the present study and chloroplast genome data could be a valuable resource for future studies in population genetics, genetic diversity, and evolutionary relationship of the family Convolvulaceae.Supplementary InformationThe online version contains supplementary material available at 10.1007/s12298-021-01051-w.  相似文献   

17.

Background

Pseudomonas aeruginosa is an important opportunistic pathogen responsible for many infections in hospitalized and immunocompromised patients. Previous reports estimated that approximately 10% of its 6.6 Mbp genome varies from strain to strain and is therefore referred to as “accessory genome”. Elements within the accessory genome of P. aeruginosa have been associated with differences in virulence and antibiotic resistance. As whole genome sequencing of bacterial strains becomes more widespread and cost-effective, methods to quickly and reliably identify accessory genomic elements in newly sequenced P. aeruginosa genomes will be needed.

Results

We developed a bioinformatic method for identifying the accessory genome of P. aeruginosa. First, the core genome was determined based on sequence conserved among the completed genomes of twelve reference strains using Spine, a software program developed for this purpose. The core genome was 5.84 Mbp in size and contained 5,316 coding sequences. We then developed an in silico genome subtraction program named AGEnt to filter out core genomic sequences from P. aeruginosa whole genomes to identify accessory genomic sequences of these reference strains. This analysis determined that the accessory genome of P. aeruginosa ranged from 6.9-18.0% of the total genome, was enriched for genes associated with mobile elements, and was comprised of a majority of genes with unknown or unclear function. Using these genomes, we showed that AGEnt performed well compared to other publically available programs designed to detect accessory genomic elements. We then demonstrated the utility of the AGEnt program by applying it to the draft genomes of two previously unsequenced P. aeruginosa strains, PA99 and PA103.

Conclusions

The P. aeruginosa genome is rich in accessory genetic material. The AGEnt program accurately identified the accessory genomes of newly sequenced P. aeruginosa strains, even when draft genomes were used. As P. aeruginosa genomes become available at an increasingly rapid pace, this program will be useful in cataloging the expanding accessory genome of this bacterium and in discerning correlations between phenotype and accessory genome makeup. The combination of Spine and AGEnt should be useful in defining the accessory genomes of other bacterial species as well.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-737) contains supplementary material, which is available to authorized users.  相似文献   

18.
Liu YH  Cao JS  Li GJ  Wu XH  Wang BG  Xu P  Hu TT  Lu ZF  Patrick JW  Ruan YL 《Annals of botany》2012,109(7):1277-1284

Background and Aims

Coordination of sugar transport and metabolism between developing seeds and their enclosing fruit tissues is little understood. In this study the physiological mechanism is examined using two genotypes of asparagus bean (Vigna unguiculata ssp. sesquipedialis) differing in pod wall and seed growth rates. Pod growth dominates over seed growth in genotype ‘Zhijiang 121’ but not in ‘Zhijiang 282’ in which a ‘bulging pod’ phenotype is apparent from 8 d post-anthesis (dpa) onward.

Methods

Seed and pod wall growth rates and degree of pod-bulging were measured in the two genotypes together with assays of activities of sucrose-degrading enzymes and sugar content in pod wall and seed and evaluation of cellular pathways of phloem unloading in seed coat using a symplasmic fluorescent dye, 5(6)-carboxyfluorescein (CF).

Key Results

Activities of cell wall, cytoplasmic and vacuolar invertases (CWIN, CIN and VIN) were significantly smaller in pod walls of ‘282’ than in ‘121’ at 10 dpa onwards. Low INV activities were associated with weak pod wall growth of ‘282’. In seed coats, CF was confined within the vasculature in ‘282’ but moved beyond the vasculature in ‘121’, indicating apoplasmic and symplasmic phloem unloading, respectively. Higher CWIN activity in ‘282’ seed coats at 6–8 dpa correlated with high hexose concentration in embryos and enhanced early seed growth. However, CWIN activity in ‘282’ decreased significantly compared with ‘121’ from 10 dpa onwards, coinciding with earlier commencement of nuclei endoreduplication in their embryos.

Conclusions

The study shows genotypic differences between ‘bulging pod’ and ‘non-bulging’ phenotypes of asparagus bean in sucrose metabolism in relation to the pathway of phloem unloading in developing seed coats, and to pod and seed growth. Low INV activity in pod wall corresponds to its shortened and weak growth period; by contrast, the apoplasmic path in the seed coat is associated with high CWIN activity and strong early seed growth.  相似文献   

19.
An important step in ‘metagenomics’ analysis is the assembly of multiple genomes from mixed sequence reads of multiple species in a microbial community. Most conventional pipelines use a single-genome assembler with carefully optimized parameters. A limitation of a single-genome assembler for de novo metagenome assembly is that sequences of highly abundant species are likely misidentified as repeats in a single genome, resulting in a number of small fragmented scaffolds. We extended a single-genome assembler for short reads, known as ‘Velvet’, to metagenome assembly, which we called ‘MetaVelvet’, for mixed short reads of multiple species. Our fundamental concept was to first decompose a de Bruijn graph constructed from mixed short reads into individual sub-graphs, and second, to build scaffolds based on each decomposed de Bruijn sub-graph as an isolate species genome. We made use of two features, the coverage (abundance) difference and graph connectivity, for the decomposition of the de Bruijn graph. For simulated datasets, MetaVelvet succeeded in generating significantly higher N50 scores than any single-genome assemblers. MetaVelvet also reconstructed relatively low-coverage genome sequences as scaffolds. On real datasets of human gut microbial read data, MetaVelvet produced longer scaffolds and increased the number of predicted genes.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号