首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The complete nucleotide sequence of the duckweed (Lemna minor) chloroplast genome (cpDNA) was determined. The cpDNA is a circular molecule of 165,955 bp containing a pair of 31,223-bp inverted repeat regions (IRs), which are separated by small and large single-copy regions of 89,906 and 13,603 bp, respectively. The entire gene pool and relative positions of 112 genes (78 protein-encoding genes, 30 tRNA genes, and 4 rRNA genes) are almost identical to those of Amborella trichopoda cpDNA; the minor difference is the absence of infA and ycf15 genes in the duckweed cpDNA. The inverted repeat is expanded to include ycf1 and rps15 genes; this pattern is unique and does not occur in any other sequenced cpDNA of land plants. As in basal angiosperms and eudicots, but not in other monocots, the borders between IRs and a large single-copy region are located upstream of rps19 and downstream of trnH, so that trnH is not included in IRs. The model of rearrangements of the chloroplast genome during the evolution of monocots is proposed as the result of the comparison of cpDNA structures in duckweed and other monocots. The phylogenetic analyses of 61 protein-coding genes from 38 plastid genome sequences provided strong support for the monophyly of monocots and position of Lemna as the next diverging lineage of monocots after Acorales. Our analyses also provided support for Amborella as a sister to all other angiosperms, but in the bayesian phylogeny inference based on the first two codon positions Amborella united with Nymphaeales.  相似文献   

2.
This work reports the complete plastid (pt) DNA sequence of Seseli montanum L. of the Apiaceae family, determined using next-generation sequencing technology. The complete genome sequence has been deposited in GenBank with accession No. KM035851. The S. montanum plastome is 147,823 bp in length. The plastid genome has a typical structure for angiosperms and contains a large single-copy region (LSC) of 92,620 bp and a small single-copy region (SSC) of 17,481 bp separated by a pair of 18,861 bp inverted repeats (IRa and IRb). The composition, gene order, and AT-content in the S. montanum plastome are similar to that of a typical flowering plant pt DNA. One hundred fourteen unique genes have been identified, including 30 tRNA genes, four rRNA genes, and 80 protein genes. Of 18 intron-containing genes found, 16 genes have one intron, and two genes (ycf3, clpP) have two introns. Comparative analysis of Apiaceae plastomes reveals in the S. montanum plastome a LSC/IRb junction shift, so that the part of the ycf2 (4980 bp) gene is located in the LSC, but the other part of ycf2 (1301 bp) is within the inverted repeat. Thus, structural rearrangements in the plastid genome of S. montanum result in an enlargement of the LSC region by means of capture of a large part of ycf2, in contrast to eight Apiaceae plastomes where the complete ycf2 gene sequence is located in the inverted repeat.  相似文献   

3.
Chloroplast genome organization, gene order, and content are highly conserved among land plants. We sequenced the chloroplast genome of Trachelium caeruleum L. (Campanulaceae), a member of an angiosperm family known for highly rearranged genomes. The total genome size is 162,321 bp, with an inverted repeat (IR) of 27,273 bp, large single-copy (LSC) region of 100,114 bp, and small single-copy (SSC) region of 7,661 bp. The genome encodes 112 different genes, with 17 duplicated in the IR, a tRNA gene (trnI-cau) duplicated once in the LSC region, and a protein-coding gene (psbJ) with two duplicate copies, for a total of 132 putatively intact genes. ndhK may be a pseudogene with internal stop codons, and clpP, ycf1, and ycf2 are so highly diverged that they also may be pseudogenes. ycf15, rpl23, infA, and accD are truncated and likely nonfunctional. The most conspicuous feature of the Trachelium genome is the presence of 18 internally unrearranged blocks of genes inverted or relocated within the genome relative to the ancestral gene order of angiosperm chloroplast genomes. Recombination between repeats or tRNA genes has been suggested as a mechanism of chloroplast genome rearrangements. The Trachelium chloroplast genome shares with Pelargonium and Jasminum both a higher number of repeats and larger repeated sequences in comparison to eight other angiosperm chloroplast genomes, and these are concentrated near rearrangement endpoints. Genes for tRNAs occur at many but not all inversion endpoints, so some combination of repeats and tRNA genes may have mediated these rearrangements.  相似文献   

4.
The plastid genome of Trifolium subterraneum is 144,763 bp, about 20 kb longer than those of closely related legumes, which also lost one copy of the large inverted repeat (IR). The genome has undergone extensive genomic reconfiguration, including the loss of six genes (accD, infA, rpl22, rps16, rps18, and ycf1) and two introns (clpP and rps12) and numerous gene order changes, attributable to 14–18 inversions. All endpoints of rearranged gene clusters are flanked by repeated sequences, tRNAs, or pseudogenes. One unusual feature of the Trifolium subterraneum genome is the large number of dispersed repeats, which comprise 19.5% (ca. 28 kb) of the genome (versus about 4% for other angiosperms) and account for part of the increase in genome size. Nine genes (psbT, rbcL, clpP, rps3, rpl23, atpB, psbN, trnI-cau, and ycf3) have also been duplicated either partially or completely. rpl23 is the most highly duplicated gene, with portions of this gene duplicated six times. Comparisons of the Trifolium plastid genome with the Plant Repeat Database and searches for flanking inverted repeats suggest that the high incidence of dispersed repeats and rearrangements is not likely the result of transposition. Trifolium has 19.5 kb of unique DNA distributed among 160 fragments ranging in size from 30 to 494 bp, greatly surpassing the other five sequenced legume plastid genomes in novel DNA content. At least some of this unique DNA may represent horizontal transfer from bacterial genomes. These unusual features provide direction for the development of more complex models of plastid genome evolution. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

5.
Joinvilleaceae is a family of tropical grass-like monocots that comprises only the genus Joinvillea. Previous studies have placed Joinvilleaceae in close phylogenetic proximity to the well-studied grass family. A full plastome sequence was determined and characterized for J. ascendens. The plastome was sequenced with next generation methods, fully assembled de novo and annotated. The assembly revealed two novel inversions specific to the Joinvilleaceae lineage and at least one novel plastid inversion in the Joinvilleaceae-Poaceae lineage. Two previously documented inversions in the Joinvilleaceae-Poaceae lineage and one previously documented inversion in the Poaceae lineage were also verified. Inversion events were identified visually and verified computationally by simulation mutations. Additionally, the loss and subsequent degradation of the accD gene in order Poales was explored extensively in Poaceae and J. ascendens. The two novel inversions along with changes in gene composition between families better delimited lineages in the Poales. The presence of large inversions and subsequent reversals in this small family suggested a high potential for large-scale rearrangements to occur in plastid genomes.  相似文献   

6.
Ma PF  Guo ZH  Li DZ 《PloS one》2012,7(1):e30297

Background

Compared to their counterparts in animals, the mitochondrial (mt) genomes of angiosperms exhibit a number of unique features. However, unravelling their evolution is hindered by the few completed genomes, of which are essentially Sanger sequenced. While next-generation sequencing technologies have revolutionized chloroplast genome sequencing, they are just beginning to be applied to angiosperm mt genomes. Chloroplast genomes of grasses (Poaceae) have undergone episodic evolution and the evolutionary rate was suggested to be correlated between chloroplast and mt genomes in Poaceae. It is interesting to investigate whether correlated rate change also occurred in grass mt genomes as expected under lineage effects. A time-calibrated phylogenetic tree is needed to examine rate change.

Methodology/Principal Findings

We determined a largely completed mt genome from a bamboo, Ferrocalamus rimosivaginus (Poaceae), through Illumina sequencing of total DNA. With combination of de novo and reference-guided assembly, 39.5-fold coverage Illumina reads were finally assembled into scaffolds totalling 432,839 bp. The assembled genome contains nearly the same genes as the completed mt genomes in Poaceae. For examining evolutionary rate in grass mt genomes, we reconstructed a phylogenetic tree including 22 taxa based on 31 mt genes. The topology of the well-resolved tree was almost identical to that inferred from chloroplast genome with only minor difference. The inconsistency possibly derived from long branch attraction in mtDNA tree. By calculating absolute substitution rates, we found significant rate change (∼4-fold) in mt genome before and after the diversification of Poaceae both in synonymous and nonsynonymous terms. Furthermore, the rate change was correlated with that of chloroplast genomes in grasses.

Conclusions/Significance

Our result demonstrates that it is a rapid and efficient approach to obtain angiosperm mt genome sequences using Illumina sequencing technology. The parallel episodic evolution of mt and chloroplast genomes in grasses is consistent with lineage effects.  相似文献   

7.

Background

The number of completely sequenced plastid genomes available is growing rapidly. This array of sequences presents new opportunities to perform comParative analyses. In comParative studies, it is often useful to compare across wide phylogenetic spans and, within angiosperms, to include representatives from basally diverging lineages such as the genomes reported here: Nuphar advena (from a basal-most lineage) and Ranunculus macranthus (a basal eudicot). We report these two new plastid genome sequences and make comparisons (within angiosperms, seed plants, or all photosynthetic lineages) to evaluate features such as the status of ycf15 and ycf68 as protein coding genes, the distribution of simple sequence repeats (SSRs) and longer dispersed repeats (SDR), and patterns of nucleotide composition.

Results

The Nuphar [GenBank:NC_008788] and Ranunculus [GenBank:NC_008796] plastid genomes share characteristics of gene content and organization with many other chloroplast genomes. Like other plastid genomes, these genomes are A+T-rich, except for rRNA and tRNA genes. Detailed comparisons of Nuphar with Nymphaea, another Nymphaeaceae, show that more than two-thirds of these genomes exhibit at least 95% sequence identity and that most SSRs are shared. In broader comparisons, SSRs vary among genomes in s of abundance and length and most contain repeat motifs based on A and T nucleotides.

Conclusion

SSR and SDR abundance varies by genome and, for SSRs, is proportional to genome size. Long SDRs are rare in the genomes assessed. SSRs occur less frequently than predicted and, although the majority of the repeat motifs do include A and T nucleotides, the A+T bias in SSRs is less than that predicted from the underlying genomic nucleotide composition. In codon usage third positions show an A+T bias, however variation in codon usage does not correlate with differences in A+T-richness. Thus, although plastome nucleotide composition shows "A+T richness", an A+T bias is not apparent upon more in-depth analysis, at least in these aspects. The pattern of evolution in the sequences identified as ycf15 and ycf68 is not consistent with them being protein-coding genes. In fact, these regions show no evidence of sequence conservation beyond what is normal for non-coding regions of the IR.  相似文献   

8.
Comparative chloroplast genome analyses are mostly carried out at lower taxonomic levels, such as the family and genus levels. At higher taxonomic levels, chloroplast genomes are generally used to reconstruct phylogenies. However, little attention has been paid to chloroplast genome evolution within orders. Here, we present the chloroplast genome of Sedum sarmentosum and take advantage of several available (or elucidated) chloroplast genomes to examine the evolution of chloroplast genomes in Saxifragales. The chloroplast genome of S. sarmentosum is 150,448 bp long and includes 82,212 bp of a large single-copy (LSC) region, 16.670 bp of a small single-copy (SSC) region, and a pair of 25,783 bp sequences of inverted repeats (IRs).The genome contains 131 unique genes, 18 of which are duplicated within the IRs. Based on a comparative analysis of chloroplast genomes from four representative Saxifragales families, we observed two gene losses and two pseudogenes in Paeonia obovata, and the loss of an intron was detected in the rps16 gene of Penthorum chinense. Comparisons among the 72 common protein-coding genes confirmed that the chloroplast genomes of S. sarmentosum and Paeonia obovata exhibit accelerated sequence evolution. Furthermore, a strong correlation was observed between the rates of genome evolution and genome size. The detected genome size variations are predominantly caused by the length of intergenic spacers, rather than losses of genes and introns, gene pseudogenization or IR expansion or contraction. The genome sizes of these species are negatively correlated with nucleotide substitution rates. Species with shorter duration of the life cycle tend to exhibit shorter chloroplast genomes than those with longer life cycles.  相似文献   

9.
Sesame (Sesamum indicum L.) is one of the oldest oilseed crops. In order to investigate the evolutionary characters according to the Sesame Genome Project, apart from sequencing its nuclear genome, we sequenced the complete chloroplast genome of S. indicum cv. Yuzhi 11 (white seeded) using Illumina and 454 sequencing. Comparisons of chloroplast genomes between S. indicum and the 18 other higher plants were then analyzed. The chloroplast genome of cv. Yuzhi 11 contains 153,338 bp and a total of 114 unique genes (KC569603). The number of chloroplast genes in sesame is the same as that in Nicotiana tabacum, Vitis vinifera and Platanus occidentalis. The variation in the length of the large single-copy (LSC) regions and inverted repeats (IR) in sesame compared to 18 other higher plant species was the main contributor to size variation in the cp genome in these species. The 77 functional chloroplast genes, except for ycf1 and ycf2, were highly conserved. The deletion of the cp ycf1 gene sequence in cp genomes may be due either to its transfer to the nuclear genome, as has occurred in sesame, or direct deletion, as has occurred in Panax ginseng and Cucumis sativus. The sesame ycf2 gene is only 5,721 bp in length and has lost about 1,179 bp. Nucleotides 1–585 of ycf2 when queried in BLAST had hits in the sesame draft genome. Five repeats (R10, R12, R13, R14 and R17) were unique to the sesame chloroplast genome. We also found that IR contraction/expansion in the cp genome alters its rate of evolution. Chloroplast genes and repeats display the signature of convergent evolution in sesame and other species. These findings provide a foundation for further investigation of cp genome evolution in Sesamum and other higher plants.  相似文献   

10.
Tilia is an ecologically and economically important genus in the family Malvaceae. However, there is no complete plastid genome of Tilia sequenced to date, and the taxonomy of Tilia is difficult owing to frequent hybridization and polyploidization. A well-supported interspecific relationships of this genus is not available due to limited informative sites from the commonly used molecular markers. We report here the complete plastid genome sequences of four Tilia species determined by the Illumina technology. The Tilia plastid genome is 162,653 bp to 162,796 bp in length, encoding 113 unique genes and a total number of 130 genes. The gene order and organization of the Tilia plastid genome exhibits the general structure of angiosperms and is very similar to other published plastid genomes of Malvaceae. As other long-lived tree genera, the sequence divergence among the four Tilia plastid genomes is very low. And we analyzed the nucleotide substitution patterns and the evolution of insertions and deletions in the Tilia plastid genomes. Finally, we build a phylogeny of the four sampled Tilia species with high supports using plastid phylogenomics, suggesting that it is an efficient way to resolve the phylogenetic relationships of this genus.  相似文献   

11.
12.
We sequenced to completion the circular plastid genome of the red alga Gracilaria tenuistipitata var. liui. This is the first plastid genome sequence from the subclass Florideophycidae (Rhodophyta). The genome is composed of 183,883 bp and contains 238 predicted genes, including a single copy of the ribosomal RNA operon. Comparisons with the plastid genome of Porphyra pupurea reveal strong conservation of gene content and order, but we found major genomic rearrangements and the presence of coding regions that are specific to Gracilaria. Phylogenetic analysis of a data set of 41 concatenated proteins from 23 plastid and two cyanobacterial genomes support red algal plastid monophyly and a specific evolutionary relationship between the Florideophycidae and the Bangiales. Gracilaria maintains a surprisingly ancient gene content in its plastid genome and, together with other Rhodophyta, contains the most complete repertoire of plastid genes known in photosynthetic eukaryotes.Supplementary material () is available for this article.[Reviewing Editor: Dr. W. Ford Doolittle]  相似文献   

13.
14.
In this study, we fully sequenced the circular plastid genome of a brown alga, Undaria pinnatifida. The genome is 130,383 base pairs (bp) in size; it contains a large single-copy (LSC, 76,598 bp) and a small single-copy region (SSC, 42,977 bp), separated by two inverted repeats (IRa and IRb: 5,404 bp). The genome contains 139 protein-coding, 28 tRNA, and 6 rRNA genes; none of these genes contains introns. Organization and gene contents of the U. pinnatifida plastid genome were similar to those of Saccharina japonica. There is a co-linear relationship between the plastid genome of U. pinnatifida and that of three previously sequenced large brown algal species. Phylogenetic analyses of 43 taxa based on 23 plastid protein-coding genes grouped all plastids into a red or green lineage. In the large brown algae branch, U. pinnatifida and S. japonica formed a sister clade with much closer relationship to Ectocarpus siliculosus than to Fucus vesiculosus. For the first time, the start codon ATT was identified in the plastid genome of large brown algae, in the atpA gene of U. pinnatifida. In addition, we found a gene-length change induced by a 3-bp repetitive DNA in ycf35 and ilvB genes of the U. pinnatifida plastid genome.  相似文献   

15.
We have applied a two-gene system based on the sequences of nuclear genes encoding multi-domain plastid acetyl-CoA carboxylase (ACCase) and plastid 3-phosphoglycerate kinase (PGK) to study grass evolution. Our analysis revealed that these genes are single-copy in most of the grass species studied, allowing the establishment of orthologous relationships between them. These relationships are consistent with the known facts of their evolution: the eukaryotic origin of the plastid ACCase, created by duplication of a gene encoding the cytosolic multi-domain ACCase gene early in grass evolution, and the prokaryotic (endosymbiont) origin of the plastid PGK. The major phylogenetic relationships among grasses deduced from the nucleotide sequence comparisons of ACCase and PGK genes are consistent with each other and with the milestones of grass evolution revealed by other methods. Nucleotide substitution rates were calculated based on multiple pairwise sequence comparisons. On a relative basis, with the divergence of the Pooideae and Panicoideae subfamilies set at 60 million years ago (MYA), events leading to the Triticum/Aegilops complex occurred at the following intervals: divergence of Lolium (Lolium rigidum) at 35 MYA, divergence of Hordeum (Hordeum vulgare) at 11 MYA and divergence of Secale (Secale cereale) at 7 MYA. On the same scale, gene duplication leading to the multi-domain plastid ACCase in grasses occurred at 129 MYA, divergence of grass and dicot plastid PGK genes at 137 MYA, and divergence of grass and dicot cytosolic PGK genes at 155 MYA. The ACCase and PGK genes provide a well-understood two-locus system to study grass phylogeny, evolution and systematics.  相似文献   

16.
17.
The complete nucleotide sequence of the cucumber (C. sativus L. var. Borszczagowski) chloroplast genome has been determined. The genome is composed of 155,293 bp containing a pair of inverted repeats of 25,191 bp, which are separated by two single-copy regions, a small 18,222-bp one and a large 86,688-bp one. The chloroplast genome of cucumber contains 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes (4 rRNA species), and 37 tRNA genes (30 tRNA species), with 18 of them located in the inverted repeat region. Of these genes, 16 contain one intron, and two genes and one ycf contain 2 introns. Twenty-one small inversions that form stem-loop structures, ranging from 18 to 49 bp, have been identified. Eight of them show similarity to those of other species, while eight seem to be cucumber specific. Detailed comparisons of ycf2 and ycf15, and the overall structure to other chloroplast genomes were performed.  相似文献   

18.
The grass family (Poaceae) includes all commercial cereal crops and is a major contributor to biomass in various terrestrial ecosystems. The ancestry of all grass genomes includes a shared whole-genome duplication (WGD), named rho (ρ) WGD, but the evolutionary significance of ρ-WGD remains elusive. We sequenced the genome of Pharus latifolius, a grass species (producing a true spikelet) in the subfamily Pharoideae, a sister lineage to the core Poaceae including the (Panicoideae, Arundinoideae, Chloridoideae, Micrairoideae, Aristidoideae, and Danthonioideae (PACMAD) and Bambusoideae, Oryzoideae, and Pooideae (BOP) clades. Our results indicate that the P. latifolius genome has evolved slowly relative to cereal grass genomes, as reflected by moderate rates of molecular evolution, limited chromosome rearrangements and a low rate of gene loss for duplicated genes. We show that the ρ-WGD event occurred approximately 98.2 million years ago (Ma) in a common ancestor of the Pharoideae and the PACMAD and BOP grasses. This was followed by contrasting patterns of diploidization in the Pharus and core Poaceae lineages. The presence of two FRIZZY PANICLE-like genes in P. latifolius, and duplicated MADS-box genes, support the hypothesis that the ρ-WGD may have played a role in the origin and functional diversification of the spikelet, an adaptation in grasses related directly to cereal yields. The P. latifolius genome sheds light on the origin and early evolution of grasses underpinning the biology and breeding of cereals.

The Pharus genome fills an important genomic gap, providing numerous insights into how whole-genome duplication contributed to the origin and diversification of the grass family.  相似文献   

19.
The complete plastid genome sequence of the American cranberry (Vaccinium macrocarpon Ait.) was reconstructed using next-generation sequencing data by in silico procedures. We used Roche 454 shotgun sequence data to isolate cranberry plastid-specific sequences of “HyRed” via homology comparisons with complete sequences from several species available at the National Center for Biotechnology Information database. Eleven cranberry plastid contigs were selected for the construction of the plastid genome-based homologies and on raw reads flowing through contigs and connection information. We assembled and annotated a cranberry plastid genome (82,284 reads; 185x coverage) with a length of 176 kb and the typical structure found in plants, but with several structural rearrangements in the large single-copy region when compared to other plastid asterid genomes. To evaluate the reliability of the sequence data, phylogenetic analysis of 30 species outside the order Ericales (with 54 genes) showed Vaccinium inside the clade Asteridae, as reported in other studies using single genes. The cranberry plastid genome sequence will allow the accumulation of critical data useful for breeding and a suite of other genetic studies.  相似文献   

20.
Although plastid genome (plastome) structure is highly conserved across most seed plants, investigations during the past two decades have revealed several disparately related lineages that experienced substantial rearrangements. Most plastomes contain a large inverted repeat and two single-copy regions, and a few dispersed repeats; however, the plastomes of some taxa harbour long repeat sequences (>300 bp). These long repeats make it challenging to assemble complete plastomes using short-read data, leading to misassemblies and consensus sequences with spurious rearrangements. Single-molecule, long-read sequencing has the potential to overcome these challenges, yet there is no consensus on the most effective method for accurately assembling plastomes using long-read data. We generated a pipeline, plastid Genome Assembly Using Long-read data (ptGAUL), to address the problem of plastome assembly using long-read data from Oxford Nanopore Technologies (ONT) or Pacific Biosciences platforms. We demonstrated the efficacy of the ptGAUL pipeline using 16 published long-read data sets. We showed that ptGAUL quickly produces accurate and unbiased assemblies using only ~50× coverage of plastome data. Additionally, we deployed ptGAUL to assemble four new Juncus (Juncaceae) plastomes using ONT long reads. Our results revealed many long repeats and rearrangements in Juncus plastomes compared with basal lineages of Poales. The ptGAUL pipeline is available on GitHub: https://github.com/Bean061/ptgaul .  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号