首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The sequence of the chloroplast genome, which is inherited maternally, contains useful information for many scientific fields such as plant systematics, biogeography and biotechnology because its characteristics are highly conserved among species. There is an increase in chloroplast genomes of angiosperms that have been sequenced in recent years. In this study, the nucleotide sequence of the chloroplast genome (cpDNA) of Veratrum patulum Loes. (Melanthiaceae, Liliales) was analyzed completely. The circular double-stranded DNA of 153,699 bp consists of two inverted repeat (IR) regions of 26,360 bp each, a large single copy of 83,372 bp, and a small single copy of 17,607 bp. This plastome contains 81 protein-coding genes, 30 distinct tRNA and four genes of rRNA. In addition, there are six hypothetical coding regions (ycf1, ycf2, ycf3, ycf4, ycf15 and ycf68) and two open reading frames (ORF42 and ORF56), which are also found in the chloroplast genomes of the other species. The gene orders and gene contents of the V. patulum plastid genome are similar to that of Smilax china, Lilium longiflorum and Alstroemeria aurea, members of the Smilacaceae, Liliaceae and Alstroemeriaceae (Liliales), respectively. However, the loss rps16 exon 2 in V. patulum results in the difference in the large single copy regions in comparison with other species. The base substitution rate is quite similar among genes of these species. Additionally, the base substitution rate of inverted repeat region was smaller than that of single copy regions in all observed species of Liliales. The IR regions were expanded to trnH_GUG in V. patulum, a part of rps19 in L. longiflorum and A. aurea, and whole sequence of rps19 in S. china. Furthermore, the IGS lengths of rbcL-accD-psaI region were variable among Liliales species, suggesting that this region might be a hotspot of indel events and the informative site for phylogenetic studies in Liliales. In general, the whole chloroplast genome of V. patulum, a potential medicinal plant, will contribute to research on the genetic applications of this genus.  相似文献   

2.
Sesame (Sesamum indicum L.) is one of the oldest oilseed crops. In order to investigate the evolutionary characters according to the Sesame Genome Project, apart from sequencing its nuclear genome, we sequenced the complete chloroplast genome of S. indicum cv. Yuzhi 11 (white seeded) using Illumina and 454 sequencing. Comparisons of chloroplast genomes between S. indicum and the 18 other higher plants were then analyzed. The chloroplast genome of cv. Yuzhi 11 contains 153,338 bp and a total of 114 unique genes (KC569603). The number of chloroplast genes in sesame is the same as that in Nicotiana tabacum, Vitis vinifera and Platanus occidentalis. The variation in the length of the large single-copy (LSC) regions and inverted repeats (IR) in sesame compared to 18 other higher plant species was the main contributor to size variation in the cp genome in these species. The 77 functional chloroplast genes, except for ycf1 and ycf2, were highly conserved. The deletion of the cp ycf1 gene sequence in cp genomes may be due either to its transfer to the nuclear genome, as has occurred in sesame, or direct deletion, as has occurred in Panax ginseng and Cucumis sativus. The sesame ycf2 gene is only 5,721 bp in length and has lost about 1,179 bp. Nucleotides 1–585 of ycf2 when queried in BLAST had hits in the sesame draft genome. Five repeats (R10, R12, R13, R14 and R17) were unique to the sesame chloroplast genome. We also found that IR contraction/expansion in the cp genome alters its rate of evolution. Chloroplast genes and repeats display the signature of convergent evolution in sesame and other species. These findings provide a foundation for further investigation of cp genome evolution in Sesamum and other higher plants.  相似文献   

3.

Background

Camellia , comprising more than 200 species, is a valuable economic commodity due to its enormously popular commercial products: tea leaves, flowers, and high-quality edible oils. It is the largest and most important genus in the family Theaceae. However, phylogenetic resolution of the species has proven to be difficult. Consequently, the interspecies relationships of the genus Camellia are still hotly debated. Phylogenomics is an attractive avenue that can be used to reconstruct the tree of life, especially at low taxonomic levels.

Methodology/Principal Findings

Seven complete chloroplast (cp) genomes were sequenced from six species representing different subdivisions of the genus Camellia using Illumina sequencing technology. Four junctions between the single-copy segments and the inverted repeats were confirmed and genome assemblies were validated by PCR-based product sequencing using 123 pairs of primers covering preliminary cp genome assemblies. The length of the Camellia cp genome was found to be about 157kb, which contained 123 unique genes and 23 were duplicated in the IR regions. We determined that the complete Camellia cp genome was relatively well conserved, but contained enough genetic differences to provide useful phylogenetic information. Phylogenetic relationships were analyzed using seven complete cp genomes of six Camellia species. We also identified rapidly evolving regions of the cp genome that have the potential to be used for further species identification and phylogenetic resolution.

Conclusions/Significance

In this study, we wanted to determine if analyzing completely sequenced cp genomes could help settle these controversies of interspecies relationships in Camellia . The results demonstrate that cp genome data are beneficial in resolving species definition because they indicate that organelle-based “barcodes”, can be established for a species and then used to unmask interspecies phylogenetic relationships. It reveals that phylogenomics based on cp genomes is an effective approach for achieving phylogenetic resolution between Camellia species.  相似文献   

4.
5.

Background

The number of completely sequenced plastid genomes available is growing rapidly. This array of sequences presents new opportunities to perform comParative analyses. In comParative studies, it is often useful to compare across wide phylogenetic spans and, within angiosperms, to include representatives from basally diverging lineages such as the genomes reported here: Nuphar advena (from a basal-most lineage) and Ranunculus macranthus (a basal eudicot). We report these two new plastid genome sequences and make comparisons (within angiosperms, seed plants, or all photosynthetic lineages) to evaluate features such as the status of ycf15 and ycf68 as protein coding genes, the distribution of simple sequence repeats (SSRs) and longer dispersed repeats (SDR), and patterns of nucleotide composition.

Results

The Nuphar [GenBank:NC_008788] and Ranunculus [GenBank:NC_008796] plastid genomes share characteristics of gene content and organization with many other chloroplast genomes. Like other plastid genomes, these genomes are A+T-rich, except for rRNA and tRNA genes. Detailed comparisons of Nuphar with Nymphaea, another Nymphaeaceae, show that more than two-thirds of these genomes exhibit at least 95% sequence identity and that most SSRs are shared. In broader comparisons, SSRs vary among genomes in s of abundance and length and most contain repeat motifs based on A and T nucleotides.

Conclusion

SSR and SDR abundance varies by genome and, for SSRs, is proportional to genome size. Long SDRs are rare in the genomes assessed. SSRs occur less frequently than predicted and, although the majority of the repeat motifs do include A and T nucleotides, the A+T bias in SSRs is less than that predicted from the underlying genomic nucleotide composition. In codon usage third positions show an A+T bias, however variation in codon usage does not correlate with differences in A+T-richness. Thus, although plastome nucleotide composition shows "A+T richness", an A+T bias is not apparent upon more in-depth analysis, at least in these aspects. The pattern of evolution in the sequences identified as ycf15 and ycf68 is not consistent with them being protein-coding genes. In fact, these regions show no evidence of sequence conservation beyond what is normal for non-coding regions of the IR.  相似文献   

6.
7.
The chloroplast genomes of the pennate diatom Phaeodactylum tricornutum and the centric diatom Thalassiosira pseudonana have been completely sequenced and are compared with those of other secondary plastids of the red lineage: the centric diatom Odontella sinensis, the haptophyte Emiliania huxleyi, and the cryptophyte Guillardia theta. All five chromist genomes are compact, with small intergenic regions and no introns. The three diatom genomes are similar in gene content with 127-130 protein-coding genes, and genes for 27 tRNAs, three ribosomal RNAs and two small RNAs (tmRNA and signal recognition particle RNA). All three genomes have open-reading frames corresponding to ORFs148, 355 and 380 of O. sinensis, which have been assigned the names ycf88, ycf89 and ycf90. Gene order is not strictly conserved, but there are a number of conserved gene clusters showing remnants of red algal origin. The acpP, tsf and psb28 genes appear to be on the way from the plastid to the host nucleus, indicating that endosymbiotic gene transfer is a continuing process.  相似文献   

8.
9.
《Genomics》2020,112(1):659-668
The NCBI database has >15 chloroplast (cp) genome sequences available for different Camellia species but none for C. assamica. There is no report of any mitochondrial (mt) genome in the Camellia genus or Theaceae family. With the strong believes that these organelle genomes can play a great tool for taxonomic and phylogenetic analysis, we successfully assembled and analyzed cp and mt genome of C. assamica. We assembled the complete mt genome of C. assamica in a single circular contig of 707,441 bp length comprising of a total of 66 annotated genes, including 35 protein-coding genes, 29 tRNAs and two rRNAs. The first ever cp genome of C. assamica resulted in a circular contig of 157,353 bp length with a typical quadripartite structure. Phylogenetic analysis based on these organelle genomes showed that C. assamica was closely related to C. sinensis and C. leptophylla. It also supports Caryophyllales as Superasterids.  相似文献   

10.
The complete nucleotide sequence of the cucumber (C. sativus L. var. Borszczagowski) chloroplast genome has been determined. The genome is composed of 155,293 bp containing a pair of inverted repeats of 25,191 bp, which are separated by two single-copy regions, a small 18,222-bp one and a large 86,688-bp one. The chloroplast genome of cucumber contains 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes (4 rRNA species), and 37 tRNA genes (30 tRNA species), with 18 of them located in the inverted repeat region. Of these genes, 16 contain one intron, and two genes and one ycf contain 2 introns. Twenty-one small inversions that form stem-loop structures, ranging from 18 to 49 bp, have been identified. Eight of them show similarity to those of other species, while eight seem to be cucumber specific. Detailed comparisons of ycf2 and ycf15, and the overall structure to other chloroplast genomes were performed.  相似文献   

11.
Chloroplast genome organization, gene order, and content are highly conserved among land plants. We sequenced the chloroplast genome of Trachelium caeruleum L. (Campanulaceae), a member of an angiosperm family known for highly rearranged genomes. The total genome size is 162,321 bp, with an inverted repeat (IR) of 27,273 bp, large single-copy (LSC) region of 100,114 bp, and small single-copy (SSC) region of 7,661 bp. The genome encodes 112 different genes, with 17 duplicated in the IR, a tRNA gene (trnI-cau) duplicated once in the LSC region, and a protein-coding gene (psbJ) with two duplicate copies, for a total of 132 putatively intact genes. ndhK may be a pseudogene with internal stop codons, and clpP, ycf1, and ycf2 are so highly diverged that they also may be pseudogenes. ycf15, rpl23, infA, and accD are truncated and likely nonfunctional. The most conspicuous feature of the Trachelium genome is the presence of 18 internally unrearranged blocks of genes inverted or relocated within the genome relative to the ancestral gene order of angiosperm chloroplast genomes. Recombination between repeats or tRNA genes has been suggested as a mechanism of chloroplast genome rearrangements. The Trachelium chloroplast genome shares with Pelargonium and Jasminum both a higher number of repeats and larger repeated sequences in comparison to eight other angiosperm chloroplast genomes, and these are concentrated near rearrangement endpoints. Genes for tRNAs occur at many but not all inversion endpoints, so some combination of repeats and tRNA genes may have mediated these rearrangements.  相似文献   

12.
The plastid genomes of early-diverging angiosperms were among the first land plant plastomes investigated. Despite their importance to understanding angiosperm evolution, no investigation has so far compared gene content or gene synteny of these plastid genomes with a focus on the Nymphaeales. Here, we report an evaluation and comparison of gene content, gene synteny and inverted repeat length for a set of 15 plastid genomes of early-diverging angiosperms. Seven plastid genomes of the Nymphaeales were newly sequenced for this investigation. We compare gene order and inverted repeat (IR) length across all genomes, review the gene annotations of previously published genomes, generate a multi-gene alignment of 77 plastid-encoded genes and reconstruct the phylogenetic relationships of the taxa under study. Our results show that gene content and synteny are highly conserved across early-diverging angiosperms: All species analyzed display complete gene synteny when accounting for expansions and contractions of the IRs. This conservation was initially obscured by ambiguous and potentially incorrect gene annotations in previously published genomes. We also report the presence of intact open reading frames across all taxa analyzed. The multi-gene phylogeny displays maximum support for the families Cabombaceae and Hydatellaceae, but no support for a clade of all Nymphaeaceae. It further indicates that the genus Victoria is embedded within Nymphaea. Plastid genomes of Trithuria were found to deviate by numerous substitutions and length changes in the IRs. Phylogenetic analyses further indicate that a previously published plastome named Nymphaea mexicana falls into a clade of N. odorata and should be re-evaluated.  相似文献   

13.
This current study presents, for the first time, the complete chloroplast genome of two Cleomaceae species: Dipterygium glaucum and Cleome chrysantha in order to evaluate the evolutionary relationship. The cp genome is 158,576 bp in length with 35.74% GC content in D. glaucum and 158,111 bp with 35.96% GC in C. chrysantha. Inverted repeats IR 26,209 bp, 26,251 bp each, LSC of 87,738 bp, 87,184 bp and SSC of 18,420 bp, 18,425 bp respectively. There are 136 genes in the genome, which includes 80 protein coding genes, 31 tRNA genes and four rRNA genes were observed in both chloroplast genomes. 117 genes are unique while the remaining 19 genes are duplicated in IR regions. The analysis of repeats shows that the cp genome includes all types of repeats with more frequent occurrences of palindromic; Also, this analysis indicates that the total number of simple sequence repeats (SSR) were 323 in D. glaucum, and 313 in C. chrysantha, of which the majority of the SSRs in these plastid genomes were mononucleotide repeats A/T which are located in the intergenic spacer. Moreover, the comparative analysis of the four cp sequences revealed four hotspot genes (atpF, rpoC2, rps19, and ycf1), these variable regions could be used as molecular makers for the species authentication as well as resources for inferring phylogenetic relationships of the species. All the relationships in the phylogenetic tree are with high support, this indicate that the complete chloroplast genome is a useful data for inferring phylogenetic relationship within the Cleomaceae and other families. The simple sequence repeats identified will be useful for identification, genetic diversity, and other evolutionary studies of the species. This study reported the first cp genome of the genus Dipterygium and Cleome. The finding of this study will be beneficial for biological disciplines such as evolutionary and genetic diversity studies of the species within the core Cleomaceae.  相似文献   

14.
15.
16.
17.
This work reports the complete plastid (pt) DNA sequence of Seseli montanum L. of the Apiaceae family, determined using next-generation sequencing technology. The complete genome sequence has been deposited in GenBank with accession No. KM035851. The S. montanum plastome is 147,823 bp in length. The plastid genome has a typical structure for angiosperms and contains a large single-copy region (LSC) of 92,620 bp and a small single-copy region (SSC) of 17,481 bp separated by a pair of 18,861 bp inverted repeats (IRa and IRb). The composition, gene order, and AT-content in the S. montanum plastome are similar to that of a typical flowering plant pt DNA. One hundred fourteen unique genes have been identified, including 30 tRNA genes, four rRNA genes, and 80 protein genes. Of 18 intron-containing genes found, 16 genes have one intron, and two genes (ycf3, clpP) have two introns. Comparative analysis of Apiaceae plastomes reveals in the S. montanum plastome a LSC/IRb junction shift, so that the part of the ycf2 (4980 bp) gene is located in the LSC, but the other part of ycf2 (1301 bp) is within the inverted repeat. Thus, structural rearrangements in the plastid genome of S. montanum result in an enlargement of the LSC region by means of capture of a large part of ycf2, in contrast to eight Apiaceae plastomes where the complete ycf2 gene sequence is located in the inverted repeat.  相似文献   

18.
The latest crystallographic model of the cyanobacterial photosystem II (PS II) core complex added one transmembrane low molecular weight (LMW) component to the previous model, suggesting the presence of an unknown transmembrane LMW component in PS II. We have investigated the polypeptide composition in highly purified intact PS II core complexes from Thermosynechococcus elongatus, the species which yielded the PS II crystallographic models described above, to identify the unknown component. Using an electrophoresis system specialized for separation of LMW hydrophobic proteins, a novel protein of ∼ 5 kDa was identified as a PS II component. Its N-terminal amino acid sequence was identical to that of Ycf12. The corresponding gene is known as one of the ycf (hypothetical chloroplast reading frame) genes, ycf12, and is widely conserved in chloroplast and cyanobacterial genomes. Nonetheless, the localization and function of the gene product have never been assigned. Our finding shows, for the first time, that ycf12 is actually expressed as a component of the PS II complex in the cell, revealing that a previously unidentified transmembrane protein exists in the PS II core complex.  相似文献   

19.
The complete nucleotide sequence of mulberry (Morus indica cv. K2) chloroplast genome (158,484 bp) has been determined using a combination of long PCR and shotgun-based approaches. This is the third angiosperm tree species whose plastome sequence has been completely deciphered. The circular double-stranded molecule comprises of two identical inverted repeats (25,678 bp each) separating a large and a small single-copy region of 87,386 bp and 19,742 bp, respectively. A total of 83 protein-coding genes including five genes duplicated in the inverted repeat regions, eight ribosomal RNA genes and 37 tRNA genes (30 gene species) representing 20 amino acids, were assigned on the basis of homology to predicted genes from other chloroplast genomes. The mulberry plastome lacks the genes infA, sprA, and rpl21 and contains two pseudogenes ycf15 and ycf68. Comparative analysis, based on sequence similarity, both at the gene and genome level, indicates Morus to be closer to Cucumis and Lotus, phylogenetically. However, at genome level, inclusion of non-coding regions brings it closer to Eucalyptus, followed by Cucumis. This may reflect differential selection pressure operating on the genic and intergenic regions of the chloroplast genome.Electronic supplementary material Supplementary material is available in the online version of this article at and is accessible for authorized users.Communicated by Y. Tsumura  相似文献   

20.
The chloroplast genomes of most higher plants contain two giant open reading frames designated ycf1 and ycf2. In tobacco, ycf1 potentially specifies a protein of 1901 amino acids. The putative gene product of the ycf2 reading frame is a protein of 2280 amino acids. In an attempt to determine the functions of ycf1 and ycf2, we have constructed several mutant alleles for targeted disruption and/or deletion of these two reading frames. The mutant alleles were introduced into the tobacco plastid genome by biolistic chloroplast transformation to replace the corresponding wild-type alleles by homologous recombination. Chloroplast transformants were obtained for all constructs and tested for their homoplastomic state. We report here that all transformed lines remained heteroplastomic even after repeated cycles of regeneration under high selective pressure. A balanced selection was observed in the presence of the antibiotic spectinomycin, resulting in maintenance of a fairly constant ratio of wild-type versus transformed genome copies. Upon removal of the antibiotic and therewith release of the selective pressure, sorting out towards the wild-type plastid genome occurred in all transplastomic lines. These findings suggest that ycf1 and ycf2 are functional genes and encode products that are essential for cell survival. The two reading frames are thus the first higher plant chloroplast genes identified as being indispensable.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号