首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 149 毫秒
1.
We here argue that data from comparative studies of genome size and karyotypes provide important information for planning comparative research on genome evolution. We document for 39 species of sepsids that there is a four‐fold difference in genome size (151–618 Mbp). Mapping genome sizes onto a phylogenetic hypothesis identifies that this range is the result of five genome expansions and four genome contractions that we here define as changes in genome size of more than 50 Mbp. We then generate karyotype data for 10 species and find no changes in chromosome number. The study reveals that the “Oriental” clade of sepsids is a promising system for studying genome evolution because it has experienced three genome expansion events. These events can be compared with an expansion in the “Neotropical” clade in order to reveal the mechanisms that underlie genome expansion in Sepsidae. A review of the literature on genome sizes and karyotypes reveals that they have been poorly documented in Metazoa. This means that researchers interested in the evolution of genome expansions and contractions are currently not being able to identify appropriate target taxa for genome sequencing. We thus argue for more comparative research on genome sizes and karyotypes and point out that historically species were chosen for genome sequencing for reasons not related to genome evolution (e.g. small genome size, model species status, phylogenetic position, interesting phenotypes). We believe that it is now time to use a more genome‐centric selection criterion, where species for whole genome sequencing are selected based on their importance for understanding genome evolution.  相似文献   

2.
高等植物基因组中,大部分序列为非表达序列,基因序列所占的比例很小,了解基因在基因组中的分布是研究基因组结构的一个重要方面。在美国能源部资助下,一个毛果杨无性系的基因组测序已经完成并对公众发布。杨树全基因组序列的完成,为我们了解林木基因组中基因的分布提供了一个特例。在本文中,我们利用泊松分析对杨树基因组中基因在各个染色体上的密度进行了检测,结果表明杨树基因组中各条染色体的基因含量存在显著差异。杨树全基因组测序项目揭示现代杨树基因组起源于一次古全基因组复制事件(称为杨柳科基因组复制),所以杨树基因组不同染色体间存在很大的同源复制片段。但是我们的研究显示,杨树基因组中大多数高度同源的染色体上基因的密度与染色体间的同源性没有明显关系,这说明杨柳科全基因组复制事件后,各个高度同源染色体上的基因发生了流失,且基因流失的速率是不一样的。同时本文还对近九万条毛果杨EST序列进行了比对分析,结果显示这些EST序列覆盖的基因仅占杨树基因组中基因总数的16.8%左右。EST测序虽然是发现基因的一个重要手段,但小规模EST测序对基因的覆盖度很低,所以小规模EST测序的应用价值是有限的。  相似文献   

3.
Venetis C  Theologidis I  Zouros E  Rodakis GC 《Gene》2007,406(1-2):79-90
Species of the marine mussel genus Mytilus are known to contain two mitochondrial genomes, one transmitted maternally (the F genome) and the other paternally (the M genome). The two genomes have diverged by more than 20% in DNA sequence. Here we present the complete sequence of a third genome, genome C, which we found in the sperm of a Mytilus galloprovincialis male. The coding part of the new genome resembles in sequence the F genome, from which it differs by about 2% on average, but differs from the M genome by as much as the F from the M. Its major control region (CR) is more than three times larger than that of the F or the M genome and consists of repeated sequence domains of the CR of the M genome flanked by domains of the CR of the F genome. We present a sequence of events that reconstruct most parsimoniously the derivation of the C genome from the F and M genomes. The sequence consists of a duplication of CR elements of the M genome and subsequent insertion of these tandemly repeated elements in the F genome by recombination. The fact that the C genome was found as the only mitochondrial genome in the sperm of the male from which it was extracted suggests that it is transmitted paternally.  相似文献   

4.
Repetitive DNA variation and pivotal-differential evolution of wild wheats.   总被引:1,自引:0,他引:1  
Several polyploid species in the genus Triticum contain a U genome derived from the diploid T. umbellulatum. In these species, the U genome is considered to be unmodified from the diploid based on chromosome pairing analysis, and it is referred to as pivotal. The additional genome(s) are considered to be modified, and they are thus referred to as differential genomes. The M genome derived from the diploid T. comosum is found in many U genome polyploids. In this study, we cloned three repetitive DNA sequences found primarily in the U genome and two repetitive DNA sequences found primarily in the M genome. We used these to monitor variation for these sequences in a large set of species containing U and M genomes. Investigation of sympatric and allopatric accessions of polyploid species did not show repetitive DNA similarities among sympatric species. This result does not support the idea that the polyploid species are continually exchanging genetic information through introgression. However, it is also possible that repetitive DNA is not a suitable means of addressing the question of introgression. The U genomes of both diploid and polyploid U genome species were similar regarding hybridization patterns observed with U genome probes. Much more variation was found both among diploid T. comosum accessions and polyploids containing M genomes. The observed variation supports the cytogenetic evidence that the M genome is more variable than the U genome. It also raises the possibility that the differential nature of the M genome may be due to variation within the diploid T. comosum, as well as among polyploid M genome species and accessions.  相似文献   

5.
Theoretical and practical advances in genome halving   总被引:4,自引:0,他引:4  
MOTIVATION: Duplication of an organism's entire genome is a rare but spectacular event, enabling the rapid emergence of multiple new gene functions. Over time, the parallel linkage of duplicated genes across chromosomes may be disrupted by reciprocal translocations, while the intra-chromosomal order of genes may be shuffled by inversions and transpositions. Some duplicate genes may evolve unrecognizably or be deleted. As a consequence, the only detectable signature of an ancient duplication event in a modern genome may be the presence of various chromosomal segments containing parallel paralogous genes, with each segment appearing exactly twice in the genome. The problem of reconstructing the linkage structure of an ancestral genome before duplication is known as genome halving with unordered chromosomes. RESULTS: In this paper, we derive a new upper bound on the genome halving distance that is tighter than the best known, and a new lower bound that is almost always tighter than the best known. We also define the notion of genome halving diameter, and obtain both upper and lower bounds for it. Our tighter bounds on genome halving distance yield a new algorithm for reconstructing an ancestral duplicated genome. We create a software package GenomeHalving based on this new algorithm and test it on the yeast genome, identifying a sequence of translocations for halving the yeast genome that is shorter than previously conjectured possible.  相似文献   

6.
Mutational equilibrium model of genome size evolution   总被引:22,自引:0,他引:22  
The paper describes a mutational equilibrium model of genome size evolution. This model is different from both adaptive and junk DNA models of genome size evolution in that it does not assume that genome size is maintained either by positive or stabilizing selection for the optimum genome size (as in adaptive theories) or by purifying selection against too much junk DNA (as in junk DNA theories). Instead the genome size is suggested to evolve until the loss of DNA through more frequent small deletions is equal to the rate of DNA gain through more frequent long insertions. The empirical basis for this theory is the finding of a strong correlation and of a clear power-function relationship between the rate of mutational DNA loss (per bp) through small deletions and genome size in animals. Genome size scales as a negative 1.3 power function of the deletion rate per nucleotide. Such a relationship is not predicted by either adaptive or junk DNA theories. However, if genome size is maintained at equilibrium by the balance of mutational forces, this empirilical relationship can be readily accommodated. Within this framework, this finding would imply that the rate of DNA gain through large insertions scales up a quarter-power function of genome size. On this view, as genome size grows, the rate of growth through large insertions is increasing as a quarter power function of genome size and the rate of DNA loss through small deletions increases linearly, until eventually, at the stable equilibrium genome size value, rates of growth and loss equal each other. The current data also suggest that the long-term variation is genome size in animals is brought about to a significant extent by changes in the intrinsic rates of DNA loss through small deletions. Both the origin of mutational biases and the adaptive consequences of such a mode of evolution of genome size are discussed.  相似文献   

7.
Angiosperms represent one of the key examples of evolutionary success, and their diversity dwarfs other land plants; this success has been linked, in part, to genome size and phenomena such as whole genome duplication events. However, while angiosperms exhibit a remarkable breadth of genome size, evidence linking overall genome size to diversity is equivocal, at best. Here, we show that the rates of speciation and genome size evolution are tightly correlated across land plants, and angiosperms show the highest rates for both, whereas very slow rates are seen in their comparatively species-poor sister group, the gymnosperms. No evidence is found linking overall genome size and rates of speciation. Within angiosperms, both the monocots and eudicots show the highest rates of speciation and genome size evolution, and these data suggest a potential explanation for the megadiversity of angiosperms. It is difficult to associate high rates of diversification with different types of polyploidy, but it is likely that high rates of evolution correlate with a smaller genome size after genome duplications. The diversity of angiosperms may, in part, be due to an ability to increase evolvability by benefiting from whole genome duplications, transposable elements and general genome plasticity.  相似文献   

8.
BACKGROUND: In studies looking at individual polyploid species, the most common patterns of genomic change are that either genome size in the polyploid is additive (i.e. the sum of parental genome donors) or there is evidence of genome downsizing. Reports showing an increase in genome size are rare. In a large-scale analysis of 3008 species, genome downsizing was shown to be a widespread biological response to polyploidy. Polyploidy in the genus Nicotiana (Solanaceae) is common with approx. 40 % of the approx. 75 species being allotetraploid. Recent advances in understanding phylogenetic relationships of Nicotiana species and dating polyploid formation enable a temporal dimension to be added to the analysis of genome size evolution in these polyploids. METHODS: Genome sizes were measured in 18 species of Nicotiana (nine diploids and nine polyploids) ranging in age from <200,000 years to approx. 4.5 Myr old, to determine the direction and extent of genome size change following polyploidy. These data were combined with data from genomic in situ hybridization and increasing amounts of information on sequence composition in Nicotiana to provide insights into the molecular basis of genome size changes. KEY RESULTS AND CONCLUSIONS: By comparing the expected genome size of the polyploid (based on summing the genome size of species identified as either a parent or most closely related to the diploid progenitors) with the observed genome size, four polyploids showed genome downsizing and five showed increases. There was no discernable pattern in the direction of genome size change with age of polyploids, although with increasing age the amount of genome size change increased. In older polyploids (approx. 4.5 million years old) the increase in genome size was associated with loss of detectable genomic in situ hybridization signal, whereas some hybridization signal was still detected in species exhibiting genome downsizing. The possible significance of these results is discussed.  相似文献   

9.
Sorting by weighted reversals, transpositions, and inverted transpositions.   总被引:1,自引:0,他引:1  
During evolution, genomes are subject to genome rearrangements that alter the ordering and orientation of genes on the chromosomes. If a genome consists of a single chromosome (like mitochondrial, chloroplast, or bacterial genomes), the biologically relevant genome rearrangements are (1) inversions--also called reversals--where a section of the genome is excised, reversed in orientation, and reinserted and (2) transpositions, where a section of the genome is excised and reinserted at a new position in the genome; if this also involves an inversion, one speaks of an inverted transposition. To reconstruct ancient events in the evolutionary history of organisms, one is interested in finding an optimal sequence of genome rearrangements that transforms a given genome into another genome. It is well known that this problem is equivalent to the problem of "sorting" a signed permutation into the identity permutation. In this paper, we provide a 1.5-approximation algorithm for sorting by weighted reversals, transpositions and inverted transpositions for biologically realistic weights.  相似文献   

10.
Infection with hepatitis delta virus (HDV) is an important cause of acute and chronic liver disease and can be rapidly fatal. Sequencing of the HDV RNA genome has revealed variability at the C-terminal end of the delta antigen reading frame. One genome type (termed the S genome) synthesizes a 24-kDa protein thought to be required for genome replication. Another genome type (termed the L genome) extends the reading frame by 19 amino acids as a result of a single base change. Replication of the S and L genomes was studied in cultured fibroblasts. While the S genome efficiently initiated genome replication, the L genome did not. Moreover, in a codelivery experiment, L genome RNA inhibited replication of the S genome. Potent trans inhibition was also observed following cotransfection of the S genome and a plasmid encoding the larger delta antigen. Mutational analysis indicated that the inhibitory activity was not a simple function of the large delta antigen reading frame's extra length. Implications for the viral life cycle, clinical infection, and potential treatment are discussed.  相似文献   

11.
Genome phylogenetic analysis based on extended gene contents   总被引:1,自引:0,他引:1  
With the rapid growth of entire genome data, whole-genome approaches such as gene content become popular for genome phylogeny inference, including the tree of life. However, the underlying model for genome evolution is unclear, and the proposed (ad hoc) genome distance measure may violate the additivity. In this article, we formulate a stochastic framework for genome evolution, which provides a basis for defining an additive genome distance. However, we show that it is difficult to utilize the typical gene content data-i.e., the presence or absence of gene families across genomes-to estimate the genome distance. We solve this problem by introducing the concept of extended gene content; that is, the status of a gene family in a given genome could be absence, presence as single copy, or presence as duplicates, any of which can be used to estimate the genome distance and phylogenetic inference. Computer simulation shows that the new tree-making method is efficient, consistent, and fairly robust. The example of 35 microbial complete genomes demonstrates that it is useful not only to study the universal tree of life but also to explore the evolutionary pattern of genomes.  相似文献   

12.
One of the most complex and computationally intensive tasks of genome sequence analysis is genome assembly. Even today, few centres have the resources, in both software and hardware, to assemble a genome from the thousands or millions of individual sequences generated in a whole-genome shotgun sequencing project. With the rapid growth in the number of sequenced genomes has come an increase in the number of organisms for which two or more closely related species have been sequenced. This has created the possibility of building a comparative genome assembly algorithm, which can assemble a newly sequenced genome by mapping it onto a reference genome. We describe here a novel algorithm for comparative genome assembly that can accurately assemble a typical bacterial genome in less than four minutes on a standard desktop computer. The software is available as part of the open-source AMOS project.  相似文献   

13.
Background and aimsGenome size varies considerably across the diversity of plant life. Although genome size is, by definition, affected by genetic presence/absence variants, which are ubiquitous in population sequencing studies, genome size is often treated as an intrinsic property of a species. Here, we studied intra- and interspecific genome size variation in taxonomically complex British eyebrights (Euphrasia, Orobanchaceae). Our aim is to document genome size diversity and investigate underlying evolutionary processes shaping variation between individuals, populations and species.MethodsWe generated genome size data for 192 individuals of diploid and tetraploid Euphrasia and analysed genome size variation in relation to ploidy, taxonomy, population affiliation and geography. We further compared the genomic repeat content of 30 samples.Key resultsWe found considerable intraspecific genome size variation, and observed isolation-by-distance for genome size in outcrossing diploids. Tetraploid Euphrasia showed contrasting patterns, with genome size increasing with latitude in outcrossing Euphrasia arctica, but with little genome size variation in the highly selfing Euphrasia micrantha. Interspecific differences in genome size and the genomic proportions of repeat sequences were small.ConclusionsWe show the utility of treating genome size as the outcome of polygenic variation. Like other types of genetic variation, such as single nucleotide polymorphisms, genome size variation may be affected by ongoing hybridization and the extent of population subdivision. In addition to selection on associated traits, genome size is predicted to be affected indirectly by selection due to pleiotropy of the underlying presence/absence variants.  相似文献   

14.
Accurate base-assignment in repeat regions of a whole genome shotgun assembly is an unsolved problem. Since reads in repeat regions cannot be easily attributed to a unique location in the genome, current assemblers may place these reads arbitrarily. As a result, the base-assignment error rate in repeats is likely to be much higher than that in the rest of the genome. We developed an iterative algorithm, EULER-AIR, that is able to correct base-assignment errors in finished genome sequences in public databases. The Wolbachia genome is among the best finished genomes. Using this genome project as an example, we demonstrated that EULER-AIR can 1) discover and correct base-assignment errors, 2) provide accurate read assignments, 3) utilize finishing reads for accurate base-assignment, and 4) provide guidance for designing finishing experiments. In the genome of Wolbachia, EULER-AIR found 16 positions with ambiguous base-assignment and two positions with erroneous bases. Besides Wolbachia, many other genome sequencing projects have significantly fewer finishing reads and, hence, are likely to contain more base-assignment errors in repeats. We demonstrate that EULER-AIR is a software tool that can be used to find and correct base-assignment errors in a genome assembly project  相似文献   

15.
Extraction of genome sequences from metagenomic data is crucial for reconstructing the metabolism of microbial communities that cannot be mimicked in the laboratory. A complete Methanococcus maripaludis genome was generated from metagenomic data derived from a thermophilic subsurface oil reservoir. M. maripaludis is a hydrogenotrophic methanogenic species that is common in mesophilic saline environments. Comparison of the genome from the thermophilic, subsurface environment with the genome of the type species will provide insight into the adaptation of a methanogenic genome to an oil reservoir environment.  相似文献   

16.
Whole genome engineering is now feasible with the aid of genome editing and synthesis tools. Synthesizing a genome from scratch allows modifications of the genomic structure and function to an extent that was hitherto not possible, which will finally lead to new insights into the basic principles of life and enable valuable applications. With several recent genome synthesis projects as examples, the technical details to synthesize a genome and applications of synthetic genome are addressed in this perspective. A series of ongoing or future synthetic genomics projects, including the different genomes to be synthesized in GP-write, synthetic minimal genome, massively recoded genome, chimeric genome and synthetic genome with expanded genetic alphabet, are also discussed here with a special focus on theoretical and technical impediments in the design and synthesis process. Synthetic genomics will become a commonplace to engineer pathways and genomes according to arbitrary sets of design principles with the development of high-efficient, low-cost genome synthesis and assembly technologies.  相似文献   

17.
Wheat is the most important cereal in the world in terms of acreage and productivity. We sequenced and assembled the plastid genome of one Egyptian wheat cultivar using next-generation sequence data. The size of the plastid genome is 133,873 bp, which is 672 bp smaller than the published plastid genome of “Chinese Spring” cultivar, due mainly to the presence of three sequences from the rice plastid genome. The difference in size between the previously published wheat plastid genome and the sequence reported here is due to contamination of the published genome with rice plastid DNA, most of which is present in three sequences of 332, 131 and 131 bp. The corrected plastid genome of wheat has been submitted to GenBank (accession number KJ592713) and can be used in future comparisons.  相似文献   

18.
LaJeunessse and colleagues (LaJeunesse et al. 2005) have recently documented small genome sizes of Symbiodinium and concluded that Symbiodinium is a dinoflagellate lineage with the smallest genome. The conclusion is inconsistent with recent discoveries of picoplanktonic dinoflagellates. The search for the smallest genome and the effort to understand the evolutionary history of dinoflagellate genome should be an area of research in the years to come, which can be greatly aided by an understanding on the current hypotheses regarding mechanisms of genome size evolution. Even the smallest dinoflagellate genome documented to date is too large to be sequenced with current technology, but sequencing of chromosomes or expressed genes of key representative species is feasible and can be very insightful for understanding genome composition and function in this important lineage of eukaryotes.  相似文献   

19.
Genome content analysis has been used as a source of phylogenetic information in large prokaryotic tree of life studies. Recently the sequencing of many eukaryotic genomes has allowed for the similar use of genome content analysis for these organisms too. In this communication we examine the utility of genome content analysis for recovering phylogenetic patterns in several eukaryotic groups. By constructing multiple matrices using different e value cutoffs we examine the dynamics of altering the e value cutoff on five eukaryotic genome data sets. Our analysis indicates that the e value cutoff that is used as a criterion in the construction of the genome content matrix is a critical factor in both the accuracy and information content of the analysis. Strikingly, genome content by itself is not a reliable or accurate source of characters for phylogenetic analysis of the taxa in the five data sets we analyzed. We discuss two problems--small genome attraction and genome duplications as being involved in the rather poor performance of genome content data in recovering eukaryotic phylogeny.  相似文献   

20.
Moso bamboo (Phyllostachys pubescens) is one of the world’s most important bamboo species. It has the largest area of all planted bamboo—over two-thirds of the total bamboo forest area—and the highest economic value in China. Moso bamboo is a tetraploid (4x=48) and a special member of the grasses family. Although several genomes have been sequenced or are being sequenced in the grasses family, we know little about the genome of the bambusoids (bamboos). In this study, the moso bamboo genome size was estimated to be about 2034 Mb by flow cytometry (FCM), using maize (cv. B73) and rice (cv. Nipponbare) as internal references. The rice genome has been sequenced and the maize genome is being sequenced. We found that the size of the moso bamboo genome was similar to that of maize but significantly larger than that of rice. To determine whether the bamboo genome had a high proportion of repeat elements, similar to that of the maize genome, approximately 1000 genome survey sequences (GSS) were generated. Sequence analysis showed that the proportion of repeat elements was 23.3% for the bamboo genome, which is significantly lower than that of the maize genome (65.7%). The bamboo repeat elements were mainly Gypsy/DIRS1 and Ty1/Copia LTR retrotransposons (14.7%), with a few DNA transposons. However, more genomic sequences are needed to confirm the above results due to several factors, such as the limitation of our GSS data. This study is the first to investigate sequence composition of the bamboo genome. Our results are valuable for future genome research of moso and other bamboos.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号