首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Saccharomyces cerevisiae is one of the premier model systems for studying the genomics and evolution of transposable elements. The availability of the S. cerevisiae genome led to unprecedented insights into its five known transposable element families (the LTR retrotransposons Ty1-Ty5) in the years shortly after its completion. However, subsequent advances in bioinformatics tools for analysing transposable elements and the recent availability of genome sequences for multiple strains and species of yeast motivates new investigations into Ty evolution in S. cerevisiae. Here we provide a comprehensive phylogenetic and population genetic analysis of all Ty families in S. cerevisiae based on a systematic re-annotation of Ty elements in the S288c reference genome. We show that previous annotation efforts have underestimated the total copy number of Ty elements for all known families. In addition, we identify a new family of Ty3-like elements related to the S. paradoxus Ty3p which is composed entirely of degenerate solo LTRs. Phylogenetic analyses of LTR sequences identified three families with short-branch, recently active clades nested among long branch, inactive insertions (Ty1, Ty3, Ty4), one family with essentially all recently active elements (Ty2) and two families with only inactive elements (Ty3p and Ty5). Population genomic data from 38 additional strains of S. cerevisiae show that the majority of Ty insertions in the S288c reference genome are fixed in the species, with insertions in active clades being predominantly polymorphic and insertions in inactive clades being predominantly fixed. Finally, we use comparative genomic data to provide evidence that the Ty2 and Ty3p families have arisen in the S. cerevisiae genome by horizontal transfer. Our results demonstrate that the genome of a single individual contains important information about the state of TE population dynamics within a species and suggest that horizontal transfer may play an important role in shaping the genomic diversity of transposable elements in unicellular eukaryotes.  相似文献   

2.
Single-read sequence analysis of the termini of eight randomly picked clones ofAshbya gossypii genomic DNA revealed seven sequences with homology toSaccharomyces cerevisiae genes (15% to 69% on the amino acid level). One of these sequences appeared to code for the carboxy-terminus of threonine synthase, the product of theS. cerevisiae THR4 gene (52.4% identity over 82 amino acids). We cloned and sequenced the complete putativeAgTHR4 gene ofA. gossypii. It comprises 512 codons, two less than theS. cerevisiae THR4 gene. Overall identity at the amino acid sequence level is 67.4%. A continuous stretch of 32 amino acids displaying complete identity between these two fungal threonine synthases presumably contains the pyridoxal phosphate attachment site. Disruption of theA. gossypii gene led to threonine auxotrophy, which could be complemented by transformation with replicating plasmids carrying theAgTHR4 gene and variousS. cerevisiae ARS elements. Using these plasmids only very weak complementation of aS. cerevisiae thr4 mutation was observed. Investigation of sequences adjacent to theAgTHR4 gene identified three additional ORFs. Surprisingly, the order and orientation of these four ORFs is conserved inA. gossypii andS. cerevisiae.  相似文献   

3.
DNA clones containing foldback sequences, derived from Physarum polycephalum nuclear DNA, can be classified according to their pattern of hydridisation to Southern blots of genomic DNA. One group of DNA clones map to unique DNA loci when used as a probe to restriction digests of Physarum nuclear DNA. These cloned segments appear to contain dispersed repetitive sequence elements located at many hundreds of sites in the genome. Similar patterns of hybridisation are generated when these cloned DNA probes are annealed to DNA restriction fragments of genomic DNA obtained from a number of different Physarum strains, indicating that no detectable alteration has occurred at these genomic loci subsequent to the divergence of the strains as a result of the introduction or deletion of mobile genetic elements. However, deletion of segments of some cloned DNA fragments occurs following their propagation in Escherichia coli. A second, distinct group of clones are shown to be derived from highly methylated segments of Physarum DNA which contain very abundant repetitive sequences with regular, though complex, arrangements of restriction sites at their various genomic locations. It is suggested that these DNA segments contain clustered repetitive sequence elements. The results lead to the conclusion that foldback elements in Physarum DNA are located in segments of the genome which display markedly different patterns of sequence organisation and degree of DNA methylation.  相似文献   

4.
Despite recent advances in genome editing capabilities for the model organism Saccharomyces cerevisiae, the chromosomal integration of large biochemical pathways for stable industrial production remains challenging. In this work, we developed a simple platform for high-efficiency, single-step, markerless, multi-copy chromosomal integration of full biochemical pathways in Saccharomyces cerevisiae. In this Di-CRISPR (delta integration CRISPR-Cas) platform based on the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) and CRISPR-associated systems (Cas), we specifically designed guide RNA sequences to target multiple delta sites in the yeast genome. The generation of double stranded breaks at the delta sites allowed simultaneous integration of multiple copies of linearized donor DNA containing large biochemical pathways. With our newly developed Di-CRISPR platform, we were able to attain highly efficient and markerless integration of large biochemical pathways and achieve an unprecedented 18-copy genomic integration of a 24 kb combined xylose utilization and (R,R)-2,3-butanediol (BDO) production pathway in a single step, thus generating a strain that was able to produce BDO directly from xylose. The simplicity and high efficiency of the Di-CRISPR platform could provide a superior alternative to high copy plasmids and would render this platform an invaluable tool for genome editing and metabolic engineering in S. cerevisiae.  相似文献   

5.
Single-read sequence analysis of the termini of eight randomly picked clones ofAshbya gossypii genomic DNA revealed seven sequences with homology toSaccharomyces cerevisiae genes (15% to 69% on the amino acid level). One of these sequences appeared to code for the carboxy-terminus of threonine synthase, the product of theS. cerevisiae THR4 gene (52.4% identity over 82 amino acids). We cloned and sequenced the complete putativeAgTHR4 gene ofA. gossypii. It comprises 512 codons, two less than theS. cerevisiae THR4 gene. Overall identity at the amino acid sequence level is 67.4%. A continuous stretch of 32 amino acids displaying complete identity between these two fungal threonine synthases presumably contains the pyridoxal phosphate attachment site. Disruption of theA. gossypii gene led to threonine auxotrophy, which could be complemented by transformation with replicating plasmids carrying theAgTHR4 gene and variousS. cerevisiae ARS elements. Using these plasmids only very weak complementation of aS. cerevisiae thr4 mutation was observed. Investigation of sequences adjacent to theAgTHR4 gene identified three additional ORFs. Surprisingly, the order and orientation of these four ORFs is conserved inA. gossypii andS. cerevisiae.  相似文献   

6.
Due to the difficulty of multiple deletions using the Cre/loxP system, a simple, markerless multiple-deletion method based on a Cre/mutant lox system combining a right-element (RE) mutant lox site with a left-element (LE) mutant lox site was employed for large-scale genome rearrangements in Corynebacterium glutamicum. Eight distinct genomic regions that had been identified previously by comparative analysis of C. glutamicum R and C. glutamicum 13032 genomes were targeted for deletion. By homologous recombination, LE and RE mutant lox sites were integrated at each end of a target region. Highly efficient and accurate deletions between the two chromosomal mutant lox sites in the presence of Cre recombinase were realized. A deletion mutant lacking 190 kb of chromosomal regions, encoding a total of 188 open reading frames (ORFs), was obtained. These deletions represent the largest genomic excisions in C. glutamicum reported to date. Despite the loss of numerous predicted ORFs, the mutant exhibited normal growth under standard laboratory conditions. The Cre/loxP system using a pair of mutant lox sites provides a new, efficient genome rearrangement technique for C. glutamicum. It should facilitate the understanding of genome functions of microorganisms.  相似文献   

7.
The SED1 gene (YDR077W), coding for the major cell wall glycoprotein of Saccharomyces cerevisiae stationary-phase cells, contains two blocks of tandem repeat units located within two distinct regions of the nucleotide sequence. A PCR survey of the SED1 open reading frames (ORFs) of 186 previously uncharacterized grape must isolates of S. cerevisiae yielded 13 PCR profiles arising from different combinations of seven SED1 length variants in individuals homozygous or heterozygous for the gene. Comparison of the nucleotide sequences of a group of representatives of each of the seven length variants with those of S288C and the type strain, CBS1171, unequivocally identified them as SED1 alleles and provided evidence for the presence of two minisatellite-like sequences, variable in length, within the ORF of an S. cerevisiae gene. The segregation analyses of the SED1 length variants and other genetic markers in 13 isolates representative of each PCR profile suggested that molecular mechanisms involved in minisatellite expansion and contraction may be responsible for SED1 heterozygosities within a population of homothallic must isolates of S. cerevisiae.  相似文献   

8.
The organization of the 5S RNA gene cluster of Drosophila melanogaster is different in two Oregon R stocks that have been separated for a number of years. The Oregon R Yale population contains various different arrangements of the cluster. One of these is due to the insertion of a B104 element near one end of the cluster. Other arrangements lack the B104 insertion and have instead a variety of deletions originating in the vicinity of the B104 insertion site and removing from 0 to 60% of the 5S RNA genes without affecting nearby tRNA genes. In contrast, the Oregon R Heidelberg population has no B104 element in the 5S gene cluster and no heterogeneity in the arrangement of the cluster. We propose that transposable elements inserted at a genomic locus generate heterogeneity in a population at that locus due to excision of the element with and without accompanying deletions of flanking sequences. As a consequence, a fly population would accumulate a large number of deletions scattered throughout the genome in as many loci as contain transposable elements. We show further that D. melanogaster contains a large redundancy of 5S RNA genes since the 60% deletion of the cluster shows no visible phenotype when homozygous or when heterozygous against a total deletion of the entire 5S gene cluster.  相似文献   

9.
An unexpectedly high number of regulatory RNAs have been recently discovered that fine-tune the function of genes at all levels of expression. We employed Genomic SELEX, a method to identify protein-binding RNAs encoded in the genome, to search for further regulatory RNAs in Escherichia coli. We used the global regulator protein Hfq as bait, because it can interact with a large number of RNAs, promoting their interaction. The enriched SELEX pool was subjected to deep sequencing, and 8865 sequences were mapped to the E. coli genome. These short sequences represent genomic Hfq-aptamers and are part of potential regulatory elements within RNA molecules. The motif 5′-AAYAAYAA-3′ was enriched in the selected RNAs and confers low-nanomolar affinity to Hfq. The motif was confirmed to bind Hfq by DMS footprinting. The Hfq aptamers are 4-fold more frequent on the antisense strand of protein coding genes than on the sense strand. They were enriched opposite to translation start sites or opposite to intervening sequences between ORFs in operons. These results expand the repertoire of Hfq targets and also suggest that Hfq might regulate the expression of a large number of genes via interaction with cis-antisense RNAs.  相似文献   

10.
The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community.  相似文献   

11.
12.
A test for nucleotide sequence homology   总被引:3,自引:0,他引:3  
Two macromolecular sequences which have evolved from a common ancestor sequence will tend to include a large number of elements unaffected by replacement mutations in both sequences, as long as the evolutionary rate is not too high or the divergence time is not too great. The positions of corresponding elements may have changed in either daughter sequence due to deletion/insertion mutations involving other sequence elements, but their order can be expected to be the same in both sequences. These sets of correspondences, called matches, may be computed by a recursive algorithm which incorporates constraints on the number of deletion/insertion mutations hypothesized to have occurred. A test is developed which computes the significance of each deletion/insertion hypothesized, based on Monte-Carlo sampling of random sequences with the same base composition as the experimental sequences being tested. Applying the test to 5 S RNAs confirms the relation of Escherichia coli and KB carcinoma 5 S RNAs and establishes the previously undetected homology between Pseudomonas fluorescens and KB 5 S RNAs.  相似文献   

13.

Background

Spirodela polyrhiza is a species of the order Alismatales, which represent the basal lineage of monocots with more ancestral features than the Poales. Its complete sequence of the mitochondrial (mt) genome could provide clues for the understanding of the evolution of mt genomes in plant.

Methods

Spirodela polyrhiza mt genome was sequenced from total genomic DNA without physical separation of chloroplast and nuclear DNA using the SOLiD platform. Using a genome copy number sensitive assembly algorithm, the mt genome was successfully assembled. Gap closure and accuracy was determined with PCR products sequenced with the dideoxy method.

Conclusions

This is the most compact monocot mitochondrial genome with 228,493 bp. A total of 57 genes encode 35 known proteins, 3 ribosomal RNAs, and 19 tRNAs that recognize 15 amino acids. There are about 600 RNA editing sites predicted and three lineage specific protein-coding-gene losses. The mitochondrial genes, pseudogenes, and other hypothetical genes (ORFs) cover 71,783 bp (31.0%) of the genome. Imported plastid DNA accounts for an additional 9,295 bp (4.1%) of the mitochondrial DNA. Absence of transposable element sequences suggests that very few nuclear sequences have migrated into Spirodela mtDNA. Phylogenetic analysis of conserved protein-coding genes suggests that Spirodela shares the common ancestor with other monocots, but there is no obvious synteny between Spirodela and rice mtDNAs. After eliminating genes, introns, ORFs, and plastid-derived DNA, nearly four-fifths of the Spirodela mitochondrial genome is of unknown origin and function. Although it contains a similar chloroplast DNA content and range of RNA editing as other monocots, it is void of nuclear insertions, active gene loss, and comprises large regions of sequences of unknown origin in non-coding regions. Moreover, the lack of synteny with known mitochondrial genomic sequences shed new light on the early evolution of monocot mitochondrial genomes.  相似文献   

14.
The annotation of the well-studied organism, Saccharomyces cerevisiae, has been improving over the past decade while there are unresolved debates over the amount of biologically significant open reading frames (ORFs) in yeast genome. We revisited the total count of protein-coding genes in S. cerevisiae S288c genome using a theoretical approach by combining the Support Vector Machine (SVM) method with six widely used measurements of sequence statistical features. The accuracy of our method is over 99.5% in 10-fold cross-validation. Based on the annotation data in Saccharomyces Genome Database (SGD), we studied the coding capacity of all 1744 ORFs which lack experimental results and suggested that the overall number of chromosomal ORFs encoding proteins in yeast should be 6091 by removing 488 spurious ORFs. The importance of the present work lies in at least two aspects. First, cross-validation and retrospective examination showed the fidelity of our method in recognizing ORFs that likely encode proteins. Second, we have provided a web service that can be accessed at http://cobi.uestc.edu.cn/services/yeast/, which enables the prediction of protein-coding ORFs of the genus Saccharomyces with a high accuracy.  相似文献   

15.

Background

Rigorous study of mitochondrial functions and cell biology in the budding yeast, Saccharomyces cerevisiae has advanced our understanding of mitochondrial genetics. This yeast is now a powerful model for population genetics, owing to large genetic diversity and highly structured populations among wild isolates. Comparative mitochondrial genomic analyses between yeast species have revealed broad evolutionary changes in genome organization and architecture. A fine-scale view of recent evolutionary changes within S. cerevisiae has not been possible due to low numbers of complete mitochondrial sequences.

Results

To address challenges of sequencing AT-rich and repetitive mitochondrial DNAs (mtDNAs), we sequenced two divergent S. cerevisiae mtDNAs using a single-molecule sequencing platform (PacBio RS). Using de novo assemblies, we generated highly accurate complete mtDNA sequences. These mtDNA sequences were compared with 98 additional mtDNA sequences gathered from various published collections. Phylogenies based on mitochondrial coding sequences and intron profiles revealed that intraspecific diversity in mitochondrial genomes generally recapitulated the population structure of nuclear genomes. Analysis of intergenic sequence indicated a recent expansion of mobile elements in certain populations. Additionally, our analyses revealed that certain populations lacked introns previously believed conserved throughout the species, as well as the presence of introns never before reported in S. cerevisiae.

Conclusions

Our results revealed that the extensive variation in S. cerevisiae mtDNAs is often population specific, thus offering a window into the recent evolutionary processes shaping these genomes. In addition, we offer an effective strategy for sequencing these challenging AT-rich mitochondrial genomes for small scale projects.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1664-4) contains supplementary material, which is available to authorized users.  相似文献   

16.
Genome synthesis endows scientists the ability of de novo creating genomes absent in nature, by thorough redesigning DNA sequences and introducing numerous custom features. However, the genome synthesis is a labor‐ and time‐consuming work, and thus it is a challenge to verify and quantify the synthetic genome rapidly and precisely. Thus, specific DNA sequences different from native genomic sequences are designed into synthetic genomes during synthesis, namely genomic markers. Genomic markers can be easily detected by PCR reaction, whole‐genome sequencing (WGS) and a variety of methods to identify the synthetic genome from native one. Here, we review types and applications of genomic markers utilized in synthetic genomes, with the hope of providing a guidance for future works.  相似文献   

17.

Key message

Simple sequence repeat motifs were mined from the genome and EST sequences of Morus notabilis and archived in MulSatDB. Bioinformatics tools were integrated with the database for the analysis of genomic datasets.

Abstract

Mulberry is a crop of economic importance in sericulture, which shapes the lives of millions of rural people among different Eurasian and Latin American countries. Limited availability of genomic resources has constrained the molecular breeding efforts in mulberry, a poorly studied crop. Microsatellite or simple sequence repeat (SSR) has revolutionized the plant breeding and is used in linkage mapping, association studies, diversity, and parentage analysis, etc. Recent availability of mulberry whole genome assembly provided an opportunity for the development of mulberry-specific DNA markers. In this study, we mined a total of 217,312 microsatellites from whole genome and 961 microsatellites from EST sequences of Morus notabilis. Mono-repeats were predominant among both whole genome and EST sequences. The SSR containing EST sequences were functionally annotated, and SSRs mined from whole genome were mapped on chromosomes of the phylogenetically related genus—Fragaria vesca, to aid the selection of markers based on the function and location. All the mined markers were archived in the mulberry microsatellite database (MulSatDB), and the markers can be retrieved based on different criteria like marker location, repeat kind, motif type and size. Primer3plus and CMap tools are integrated with the database to design primers for PCR amplification and to visualize markers on F. vesca chromosomes, respectively. A blast tool is also integrated to collate new markers with the database. MulSatDB is the first and complete destination for mulberry researchers to browse SSR markers, design primers, and locate markers on strawberry chromosomes. MulSatDB is freely accessible at http://btismysore.in/mulsatdb.  相似文献   

18.
One of the defining features of the fermentation process used in the production of bioethanol from sugarcane feedstock is the dynamic nature of the yeast population. Minisatellite molecular markers are particularly useful for monitoring yeast communities because they produce polymorphic PCR products that typically display wide size variations. We compared the coding sequences derived from the genome of the sugarcane bioethanol strain JAY270/PE-2 to those of the reference Saccharomyces cerevisiae laboratory strain S288c, and searched for genes containing insertion or deletion polymorphisms larger than 24 bp. We then designed oligonucleotide primers flanking nine of these sites, and used them to amplify differentially sized PCR products. We analyzed the banding patterns in the most widely adopted sugarcane bioethanol strains and in several indigenous yeast contaminants, and found that our marker set had very good discriminatory power. Subsequently, these markers were used to successfully monitor the yeast cell populations in six sugarcane bioethanol distilleries. Additionally, we showed that most of the markers described here are also polymorphic among strains unrelated to bioethanol production, suggesting that they may be applied universally in S. cerevisiae. Because the relatively large polymorphisms are detectable in conventional agarose gels, our method is well suited to modestly equipped on-site laboratories at bioethanol distilleries, therefore providing both cost and time savings.  相似文献   

19.
Arbuscular mycorrhizal fungi (AMF) are common and important plant symbionts. They have coenocytic hyphae and form multinucleated spores. The nuclear genome of AMF is polymorphic and its organization is not well understood, which makes the development of reliable molecular markers challenging. In stark contrast, their mitochondrial genome (mtDNA) is homogeneous. To assess the intra- and inter-specific mitochondrial variability in closely related Glomus species, we performed 454 sequencing on total genomic DNA of Glomus sp. isolate DAOM-229456 and we compared its mtDNA with two G. irregulare isolates. We found that the mtDNA of Glomus sp. is homogeneous, identical in gene order and, with respect to the sequences of coding regions, almost identical to G. irregulare. However, certain genomic regions vary substantially, due to insertions/deletions of elements such as introns, mitochondrial plasmid-like DNA polymerase genes and mobile open reading frames. We found no evidence of mitochondrial or cytoplasmic plasmids in Glomus species, and mobile ORFs in Glomus are responsible for the formation of four gene hybrids in atp6, atp9, cox2, and nad3, which are most probably the result of horizontal gene transfer and are expressed at the mRNA level. We found evidence for substantial sequence variation in defined regions of mtDNA, even among closely related isolates with otherwise identical coding gene sequences. This variation makes it possible to design reliable intra- and inter-specific markers.  相似文献   

20.
InSaccharomyces cerevisiae the only known role of theCBP2 gene is the excision of the fifth intron of the mitochondrialcyt b gene (bI5). We have cloned theCBP2 gene fromSaccharomyces douglasii (a close relative ofS. cerevisiae). A comparison of theS. douglasii andS. cerevisiae sequences shows that there are 14% nucleotide substitutions in the coding region, with transitions being three times more frequent than transversions. At the protein level sequence identity is 87%. We have demonstrated that theS. douglasii CBP2 gene is essential for respiratory growth in the presence of a wild-typeS. douglasii mitochondrial genome, but not in the presence of an intronlessS. cerevisiae mitochondrial genome. Also theS. douglasii andS. cerevisiae CBP2 genes are completely interchangeable, even though the intron bI5 is absent from theS. douglasii mitochondrial genome.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号