首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
X Zhao  Y Tian  R Yang  H Feng  Q Ouyang  Y Tian  Z Tan  M Li  Y Niu  J Jiang  G Shen  R Yu 《BMC genomics》2012,13(1):435
ABSTRACT: BACKGROUND: Relationship between the level of repetitiveness in genomic sequence and genome size has been investigated by making use of complete prokaryotic and eukaryotic genomes, but relevant studies have been rarely made in virus genomes. RESULTS: In this study, a total of 257 viruses were examined, which cover 90% of genera. The results showed that simple sequence repeats (SSRs) is strongly, positively and significantly correlated with genome size. Certain repeat class is distributed in a certain range of genome sequence length. Mono-, di- and tri- repeats are widely distributed in all virus genomes, tetra- SSRs as a common component consist in genomes which more than 100 kb in size; in the range of genome < 100 kb, genomes containing penta- and hexa- SSRs are not more than 50%. Principal components analysis (PCA) indicated that dinucleotide repeat affects the differences of SSRs most strongly among virus genomes. Results showed that SSRs tend to accumulate in larger virus genomes; and the longer genome sequence, the longer repeat units. CONCLUSIONS: We conducted this research standing on the height of the whole virus. We concluded that genome size is an important factor in affecting the occurrence of SSRs; hosts are also responsible for the variances of SSRs content to a certain degree.  相似文献   

2.
MOTIVATION: One of the major features of genomic DNA sequences, distinguishing them from texts in most spoken or artificial languages, is their high repetitiveness. Variation in the repetitiveness of genomic texts reflects the presence and density of different biologically important messages. Thus, deviation from an expected number of repeats in both directions indicates a possible presence of a biological signal. Linguistic complexity corresponds to repetitiveness of a genomic text, and potential regulatory sites may be discovered through construction of typical patterns of complexity distribution. RESULTS: We developed software for fast calculation of linguistic sequence complexity of DNA sequences. Our program utilizes suffix trees to compute the number of subwords present in genomic sequences, thereby allowing calculation of linguistic complexity in time linear in genome size. The measure of linguistic complexity was applied to the complete genome of Haemophilus influenzae. Maps of complexity along the entire genome were obtained using sliding windows of 40, 100, and 2000 nucleotides. This approach provided an efficient way to detect simple sequence repeats in this genome. In addition, local profiles of complexity distribution around the starts of translation were constructed for 21 complete prokaryotic genomes. We hypothesize that complexity profiles correspond to evolutionary relationships between organisms. We found principal differences in profiles of the GC-rich and other (non-GC-rich) genomes. We also found characteristic differences in profiles of AT genomes, which probably reflect individual species variations in translational regulation. AVAILABILITY: The program is available upon request from Alexander Bolshoy or at http://csweb.haifa.ac.il/library/#complex.  相似文献   

3.
Bosco G  Campbell P  Leiva-Neto JT  Markow TA 《Genetics》2007,177(3):1277-1290
The size of eukaryotic genomes can vary by several orders of magnitude, yet genome size does not correlate with the number of genes nor with the size or complexity of the organism. Although "whole"-genome sequences, such as those now available for 12 Drosophila species, provide information about euchromatic DNA content, they cannot give an accurate estimate of genome sizes that include heterochromatin or repetitive DNA content. Moreover, genome sequences typically represent only one strain or isolate of a single species that does not reflect intraspecies variation. To more accurately estimate whole-genome DNA content and compare these estimates to newly assembled genomes, we used flow cytometry to measure the 2C genome values, relative to Drosophila melanogaster. We estimated genome sizes for the 12 sequenced Drosophila species as well as 91 different strains of 38 species of Drosophilidae. Significant differences in intra- and interspecific 2C genome values exist within the Drosophilidae. Furthermore, by measuring polyploid 16C ovarian follicle cell underreplication we estimated the amount of satellite DNA in each of these species. We found a strong correlation between genome size and amount of satellite underreplication. Addition and loss of heterochromatin satellite repeat elements appear to have made major contributions to the large differences in genome size observed in the Drosophilidae.  相似文献   

4.
Analyses of diverse eukaryotes reveal that genomes are dynamic, sometimes dramatically so. In numerous lineages across the eukaryotic tree of life, DNA content varies within individuals throughout life cycles and among individuals within species. Discovery of examples of genome dynamism is accelerating as genome sequences are completed from diverse eukaryotes. Though much is known about genomes in animals, fungi, and plants, these lineages represent only 3 of the 60-200 lineages of eukaryotes. Here, we discuss diverse genomic strategies in exemplar eukaryotic lineages, including numerous microbial eukaryotes, to reveal dramatic variation that challenges established views of genome evolution. For example, in the life cycle of some members of the "radiolaria," ploidy increases from haploid (N) to approximately 1,000N, whereas intrapopulation variability of the enteric parasite Entamoeba ranges from 4N to 40N. Variation has also been found within our own species, with substantial differences in both gene content and chromosome lengths between individuals. Data on the dynamic nature of genomes shift the perception of the genome from being fixed and characteristic of a species (typological) to plastic due to variation within and between species.  相似文献   

5.
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.  相似文献   

6.
B chromosomes and genome size in flowering plants.   总被引:2,自引:0,他引:2  
B chromosomes are extra chromosomes found in some, but not all, individuals within a species, often maintained by giving themselves an advantage in transmission, i.e. they drive. Here we show that the presence of B chromosomes correlates to and varies strongly and positively with total genome size (excluding the Bs and corrected for ploidy) both at a global level and via a comparison of independent taxonomic contrasts. B chromosomes are largely absent from species with small genomes; however, species with large genomes are studied more frequently than species with small genomes and Bs are more likely to be reported in well-studied species. We controlled for intensity of study using logistic regression. This regression analysis also included effects of degree of outbreeding, which is positively associated with Bs and genome size, and chromosome number, which is negatively associated with Bs and genome size, as well as variable ploidy (more than one ploidy level in a species). Genome size, breeding system and chromosome number all contribute independently to the distribution of B chromosomes, while variable ploidy does not have a significant effect. The genome size correlates are consistent with reduced selection against extra DNA in species with large genomes and with increased generation of B sequences from large A genomes.  相似文献   

7.
Transposable elements and the evolution of genome size in eukaryotes   总被引:32,自引:2,他引:30  
Kidwell MG 《Genetica》2002,115(1):49-63
It is generally accepted that the wide variation in genome size observed among eukaryotic species is more closely correlated with the amount of repetitive DNA than with the number of coding genes. Major types of repetitive DNA include transposable elements, satellite DNAs, simple sequences and tandem repeats, but reliable estimates of the relative contributions of these various types to total genome size have been hard to obtain. With the advent of genome sequencing, such information is starting to become available, but no firm conclusions can yet be made from the limited data currently available. Here, the ways in which transposable elements contribute both directly and indirectly to genome size variation are explored. Limited evidence is provided to support the existence of an approximately linear relationship between total transposable element DNA and genome size. Copy numbers per family are low and globally constrained in small genomes, but vary widely in large genomes. Thus, the partial release of transposable element copy number constraints appears to be a major characteristic of large genomes.  相似文献   

8.
Genome size and complexity vary tremendously among eukaryotic species and their organelles. Comparisons across deeply divergent eukaryotic lineages have suggested that variation in mutation rates may explain this diversity, with increased mutational burdens favoring reduced genome size and complexity. The discovery that mitochondrial mutation rates can differ by orders of magnitude among closely related angiosperm species presents a unique opportunity to test this hypothesis. We sequenced the mitochondrial genomes from two species in the angiosperm genus Silene with recent and dramatic accelerations in their mitochondrial mutation rates. Contrary to theoretical predictions, these genomes have experienced a massive proliferation of noncoding content. At 6.7 and 11.3 Mb, they are by far the largest known mitochondrial genomes, larger than most bacterial genomes and even some nuclear genomes. In contrast, two slowly evolving Silene mitochondrial genomes are smaller than average for angiosperms. Consequently, this genus captures approximately 98% of known variation in organelle genome size. The expanded genomes reveal several architectural changes, including the evolution of complex multichromosomal structures (with 59 and 128 circular-mapping chromosomes, ranging in size from 44 to 192 kb). They also exhibit a substantial reduction in recombination and gene conversion activity as measured by the relative frequency of alternative genome conformations and the level of sequence divergence between repeat copies. The evolution of mutation rate, genome size, and chromosome structure can therefore be extremely rapid and interrelated in ways not predicted by current evolutionary theories. Our results raise the hypothesis that changes in recombinational processes, including gene conversion, may be a central force driving the evolution of both mutation rate and genome structure.  相似文献   

9.
Hemiascomycete yeasts cover an evolutionary span comparable to that of the entire phylum of chordates. Since this group currently contains the largest number of complete genome sequences it presents unique opportunities to understand the evolution of genome organization in eukaryotes. We inferred rates of genome instability on all branches of a phylogenetic tree for 11 species and calculated species-specific rates of genome rearrangements. We characterized all inversion events that occurred within synteny blocks between six representatives of the different lineages. We show that the rates of macro- and microrearrangements of gene order are correlated within individual lineages but are highly variable across different lineages. The most unstable genomes correspond to the pathogenic yeasts Candida albicans and Candida glabrata. Chromosomal maps have been intensively shuffled by numerous interchromosomal rearrangements, even between species that have retained a very high physical fraction of their genomes within small synteny blocks. Despite this intensive reshuffling of gene positions, essential genes, which cluster in low recombination regions in the genome of Saccharomyces cerevisiae, tend to remain syntenic during evolution. This work reveals that the high plasticity of eukaryotic genomes results from rearrangement rates that vary between lineages but also at different evolutionary times of a given lineage.  相似文献   

10.
Transposable elements (TEs) have been identified in every organism in which they have been looked for. The sequencing of large genomes, such as the human genome and those of Drosophila, Arabidopsis, Caenorhabditis, has also shown that they are a major constituent of these genomes, accounting for 15% of the genome of Drosophila, 45% of the human genome, and more than 70% in some plants and amphibians. Compared with the 1% of genomic DNA dedicated to protein-coding sequences in the human genome, this has prompted various researchers to suggest that the TEs and the other repetitive sequences that constitute the so-called "noncoding DNA", are where the most stimulating discoveries will be made in the future (Bromham, 2002). We are therefore getting further and further from the original idea that this DNA was simply "junk DNA", that owed its presence in the genome entirely to its capacity for selfish transposition. Our understanding of the structures of TEs, their distribution along the genomes, their sequence and insertion polymorphisms within genomes, and within and between populations and species, their impact on genes and on the regulatory mechanisms of genetic expression, their effects on exon shuffling and other phenomena that reshape the genome, and their impact on genome size has increased dramatically in recent years. This leads to a more general picture of the impact of TEs on genomes, though many copies are still mainly selfish or junk DNA. In this review we focus mainly on discoveries made in Drosophila, but we also use information about other genomes when this helps to elucidate the general processes involved in the organization, plasticity, and evolution of genomes.  相似文献   

11.
T Inukai  A Sako  H Y Hirano  Y Sano 《Génome》2000,43(4):589-596
In plant genomes as well as other eukaryotic genomes, meiotic recombination does not occur uniformly. At the level of the gene, high recombination frequencies are often observed within genetic loci in maize, but this feature of intragenic recombination is not seen at the csr1 locus in Arabidopsis. These observations suggest that meiotic recombination in plant genomes varies considerably among species. In the present study we investigated meiotic recombination at the wx locus in rice. The mutation sites of wx mutants induced by ethyl methanesulfonate (EMS) treatment or gamma-ray irradiation and a spontaneous wx mutant were physically characterized, and the genetic distances between those wx mutation sites were estimated by pollen analysis. Based on these results, the recombination frequency at the wx locus in rice was estimated as 27.3 kb/cM, which was about 10 times higher than the average for the genome, suggesting that there was a radically different rate of meiotic recombination for intra- and intergenic regions in the rice genome.  相似文献   

12.
The contribution of slippage-like processes to genome evolution   总被引:19,自引:0,他引:19  
Simple sequences present in long (>30 kb) sequences representative of the single-copy genome of five species (Homo sapiens, Caenorhabditis elegans Saccharomyces cerevisiae, E. coli, and Mycobacterium leprae) have been analyzed. A close relationship was observed between genome size and the overall level of sequence repetition. This suggested that the incorporation of simple sequences had accompanied increases of genome size during evolution. Densities of simple sequence motifs were higher in noncoding regions than in coding regions in eukaryotes but not in eubacteria. All five genomes showed very biased frequency distributions of simple sequence motifs in all species, particularly in eukaryotes where AAA and TTT predominated. Interspecific comparisons showed that noncoding sequences in eukaryotes showed highly significantly similar frequency distributions of simple sequence motifs but this was not true of coding sequences. ANOVA of the frequency distributions of simple sequence motifs indicated strong contributions from motif base composition and repeat unit length, but much of the variation remained unexplained by these parameters. The sequence composition of simple sequences therefore appears to reflect both underlying sequence biases in slippage-like processes and the action of selection. Frequency distributions of simple sequence motifs in coding sequences correlated weakly or not at all with those in noncoding sequences. Selection on coding sequences to eliminate undesirable sequences may therefore have been strong, particularly in the human lineage.  相似文献   

13.
Genome and protein evolution in eukaryotes   总被引:1,自引:0,他引:1  
The past year has seen the completion of the genome sequence of the flowering plant Arabidopsis thaliana and the initial sequence reports of the human genome. The availability of completely sequenced eukaryotic genomes from disparate phylogenetic lineages has opened the door to comparative analyses and a better understanding of the evolutionary processes shaping genomes. Complex many-to-many relationships between genes from different species appear to be the norm, suggesting that transfer of detailed functional annotation will not be straightforward. In addition to expansion and contraction of gene families, new genes evolve from recombination of pre-existing domains, although some domain families do appear to have evolved recently and to be specific to restricted phylogenetic lineages. The overall picture is of a huge diversity of gene content within eukaryotic genomes, reflecting different functional demands in different species.  相似文献   

14.
The two living groups of flying vertebrates, birds and bats, both have constricted genome sizes compared with their close relatives. But nothing is known about the genomic characteristics of pterosaurs, which took to the air over 70 Myr before birds and were the first group of vertebrates to evolve powered flight. Here, we estimate genome size for four species of pterosaurs and seven species of basal archosauromorphs using a Bayesian comparative approach. Our results suggest that small genomes commonly associated with flight in bats and birds also evolved in pterosaurs, and that the rate of genome-size evolution is proportional to genome size within amniotes, with the fastest rates occurring in lineages with the largest genomes. We examine the role that drift may have played in the evolution of genome size within tetrapods by testing for correlated evolution between genome size and body size, but find no support for this hypothesis. By contrast, we find evidence suggesting that a combination of adaptation and phylogenetic inertia best explains the correlated evolution of flight and genome-size contraction. These results suggest that small genome/cell size evolved prior to or concurrently with flight in pterosaurs. We predict that, similar to the pattern seen in theropod dinosaurs, genome-size contraction preceded flight in pterosaurs and bats.  相似文献   

15.
Genome compaction and stability in microsporidian intracellular parasites   总被引:13,自引:0,他引:13  
Microsporidian genomes are extraordinary among eukaryotes for their extreme reduction: although they are similar in form to other eukaryotic genomes, they are typically smaller than many prokaryotic genomes. At the same time, their rates of sequence evolution are among the highest for eukaryotic organisms. To explore the effects of compaction on nuclear genome evolution, we sequenced 685,000 bp of the Antonospora locustae genome (formerly Nosema locustae) and compared its organization with the recently completed genome of the human parasite Encephalitozoon cuniculi. Despite being very distantly related, the genomes of these two microsporidian species have retained an unexpected degree of synteny: 13% of genes are in the same context, and 30% of the genes were separated by a small number of short rearrangements. Microsporidian genomes are, therefore, paradoxically composed of rapidly evolving sequences harbored within a slowly evolving genome, although these two processes are sometimes considered to be coupled. Microsporidian genomes show that eukaryotic genomes (like genes) do not evolve in a clock-like fashion, and genome stability may result from compaction in addition to a lack of recombination, as has been traditionally thought to occur in bacterial and organelle genomes.  相似文献   

16.
Mitochondria are archetypal eukaryotic organelles that were acquired by endosymbiosis of an ancient species of alpha‐proteobacteria by the last eukaryotic common ancestor. The genetic information contained within the mitochondrial genome has been an important source of information for resolving relationships among eukaryotic taxa. In this study, we utilized mitochondrial and chloroplast genomes to explore relationships among prasinophytes. Prasinophytes are represented by diverse early‐diverging green algae whose physical structures and genomes have the potential to elucidate the traits of the last common ancestor of the Viridiplantae (or Chloroplastida). We constructed de novo mitochondrial genomes for two prasinophyte algal species, Pyramimonas parkeae and Cymbomonas tetramitiformis, representing the prasinophyte clade. Comparisons of genome structure and gene order between these species and to those of other prasinophytes revealed that the mitochondrial genomes of P. parkeae and C. tetramitiformis are more similar to each other than to other prasinophytes, consistent with other molecular inferences of the close relationship between these two species. Phylogenetic analyses using the inferred amino acid sequences of mitochondrial and chloroplast protein‐coding genes resolved a clade consisting of P. parkeae and C. tetramitiformis; and this group (representing the prasinophyte clade I) branched with the clade II, consistent with previous studies based on the use of nuclear gene markers.  相似文献   

17.

Background

Entamoeba histolytica is a significant cause of disease worldwide. However, little is known about the genetic diversity of the parasite. We re-sequenced the genomes of ten laboratory cultured lines of the eukaryotic pathogen Entamoeba histolytica in order to develop a picture of genetic diversity across the genome.

Results

The extreme nucleotide composition bias and repetitiveness of the E. histolytica genome provide a challenge for short-read mapping, yet we were able to define putative single nucleotide polymorphisms in a large portion of the genome. The results suggest a rather low level of single nucleotide diversity, although genes and gene families with putative roles in virulence are among the more polymorphic genes. We did observe large differences in coverage depth among genes, indicating differences in gene copy number between genomes. We found evidence indicating that recombination has occurred in the history of the sequenced genomes, suggesting that E. histolytica may reproduce sexually.

Conclusions

E. histolytica displays a relatively low level of nucleotide diversity across its genome. However, large differences in gene family content and gene copy number are seen among the sequenced genomes. The pattern of polymorphism indicates that E. histolytica reproduces sexually, or has done so in the past, which has previously been suggested but not proven.  相似文献   

18.
It is universally true in ecological communities, terrestrial or aquatic, temperate or tropical, that some species are very abundant, others are moderately common, and the majority are rare. Likewise, eukaryotic genomes also contain classes or “species” of genetic elements that vary greatly in abundance: DNA transposons, retrotransposons, satellite sequences, simple repeats and their less abundant functional sequences such as RNA or genes. Are the patterns of relative species abundance and diversity similar among ecological communities and genomes? Previous dynamical models of genomic diversity have focused on the selective forces shaping the abundance and diversity of transposable elements (TEs). However, ideally, models of genome dynamics should consider not only TEs, but also the diversity of all genetic classes or “species” populating eukaryotic genomes. Here, in an analysis of the diversity and abundance of genetic elements in >500 eukaryotic chromosomes, we show that the patterns are consistent with a neutral hypothesis of genome assembly in virtually all chromosomes tested. The distributions of relative abundance of genetic elements are quite precisely predicted by the dynamics of an ecological model for which the principle of functional equivalence is the main assumption. We hypothesize that at large temporal scales an overarching neutral or nearly neutral process governs the evolution of abundance and diversity of genetic elements in eukaryotic genomes.  相似文献   

19.
太行菊(Opisthopappus taihangensis)、长裂太行菊(O.longilobus),为太行山特有多年生崖壁草本植物,菊科(Compositae)重要野生资源,具有较高的经济与生态价值。为确定适合两物种的全基因组测序策略,该研究利用流式细胞法和高通量测序技术,分析两物种基因组大小、杂合率、重复序列及GC含量等信息。结果表明:(1)流式细胞法估算太行菊基因组大小约为2.1 Gb,长裂太行菊基因组大小约为2.4 Gb。(2)高通量测序修正后太行菊基因组大小为3.13 Gb,重复序列比例为84.35%,杂合度为0.99%,GC含量为36.56%;长裂太行菊基因组为3.18 Gb,重复序列比例为83.83%,杂合度为1.17%,GC含量为36.62%。(3)初步组装后GC含量分布及平均深度存在异常,出现分层现象,可能是两物种基因组杂合率较高所致。综上结果表明,太行菊、长裂太行菊均属于高重复、高杂合、大基因组的复杂基因组,建议使用Illumina+PacBio测序组装策略,进行全基因组测序分析。  相似文献   

20.
Whole genome duplication (WGD) and subsequent evolution of gene pairs have been shown to have shaped the present day genomes of most, if not all, plants and to have played an essential role in the evolution of many eukaryotic genomes. Analysis of the rice (Oryza sativa ssp. japonica) genome sequence suggested an ancestral WGD ~50-70 Ma common to all cereals and a segmental duplication between chromosomes 11 and 12 as recently as 5 Ma. More recent studies based on coding sequences have demonstrated that gene conversion is responsible for the high sequence conservation which suggested such a recent duplication. We previously showed that gene conversion has been a recurrent process throughout the Oryza genus and in closely related species and that orthologous duplicated regions are also highly conserved in other cereal genomes. We have extended these studies to compare megabase regions of genomic (coding and noncoding) sequences between two cultivated (O. sativa, Oryza glaberrima) and one wild (Oryza brachyantha) rice species using a novel approach of topological incongruency. The high levels of intraspecies conservation of both gene and nongene sequences, particularly in O. brachyantha, indicate long-range conversion events less than 4 Ma in all three species. These observations demonstrate megabase-scale conversion initiated within a highly rearranged region located at ~2.1 Mb from the chromosome termini and emphasize the importance of gene conversion in cereal genome evolution.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号