首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Vanishing GC-rich isochores in mammalian genomes   总被引:25,自引:0,他引:25  
Duret L  Semon M  Piganeau G  Mouchiroud D  Galtier N 《Genetics》2002,162(4):1837-1847
To understand the origin and evolution of isochores-the peculiar spatial distribution of GC content within mammalian genomes-we analyzed the synonymous substitution pattern in coding sequences from closely related species in different mammalian orders. In primate and cetartiodactyls, GC-rich genes are undergoing a large excess of GC --> AT substitutions over AT --> GC substitutions: GC-rich isochores are slowly disappearing from the genome of these two mammalian orders. In rodents, our analyses suggest both a decrease in GC content of GC-rich isochores and an increase in GC-poor isochores, but more data will be necessary to assess the significance of this pattern. These observations question the conclusions of previous works that assumed that base composition was at equilibrium. Analysis of allele frequency in human polymorphism data, however, confirmed that in the GC-rich parts of the genome, GC alleles have a higher probability of fixation than AT alleles. This fixation bias appears not strong enough to overcome the large excess of GC --> AT mutations. Thus, whatever the evolutionary force (neutral or selective) at the origin of GC-rich isochores, this force is no longer effective in mammals. We propose a model based on the biased gene conversion hypothesis that accounts for the origin of GC-rich isochores in the ancestral amniote genome and for their decline in present-day mammals.  相似文献   

2.
Vertebrate genomes are mosaics of isochores, defined as long (>100 kb) regions with relatively homogeneous within-region base composition. Birds and mammals have more GC-rich isochores than amphibians and fish, and the GC-rich isochores of birds and mammals have been suggested to be an adaptation to homeothermy. If this hypothesis is correct, all poikilothermic (cold-blooded) vertebrates, including the nonavian reptiles, are expected to lack a GC-rich isochore structure. Previous studies using various methods to examine isochore structure in crocodilians, turtles, and squamates have led to different conclusions. We collected more than 6000 expressed sequence tags (ESTs) from the American alligator to overcome sample size limitations suggested to be the fundamental problem in the previous reptilian studies. The alligator ESTs were assembled and aligned with their human, mouse, chicken, and western clawed frog orthologs, resulting in 366 alignments. Analyses of third-codon-position GC content provided conclusive evidence that the poikilothermic alligator has GC-rich isochores, like homeothermic birds and mammals. We placed these results in a theoretical framework able to unify available models of isochore evolution. The data collected for this study allowed us to reject the models that explain the evolution of GC content using changes in body temperature associated with the transition from poikilothermy to homeothermy. Falsification of these models places fundamental constraints upon the plausible pathways for the evolution of isochores. Electronic supplementary material The online version of this article (doi: ) contains supplementary material, which is available to authorized users. Reviewing Editor: Dr. Nicolas Galtier  相似文献   

3.
A compositional map of human chromosome 21.   总被引:9,自引:0,他引:9       下载免费PDF全文
K Gardiner  B Aissani    G Bernardi 《The EMBO journal》1990,9(6):1853-1858
GC-poor and GC-rich isochores, the long (greater than 300 kb) compositionally homogeneous DNA segments that form the genome of warm-blooded vertebrates, are located in G- and R-bands respectively of metaphase chromosomes. The precise correspondence between GC-rich isochores and R-band structure is still, however, an open problem, because GC-rich isochores are compositionally heterogeneous and only represent one-third of the genome, with the GC-richest family (which is by far the highest in gene concentration) corresponding to less than 5% of the genome. In order to clarify this issue and, more generally, to correlate DNA composition and chromosomal structure in an unequivocal way, we have developed a new approach, compositional mapping. This consists of assessing the base composition over 0.2-0.3 Mb (megabase) regions surrounding landmarks that were previously localized on the physical map. Compositional mapping was applied here to the long arm of human chromosome 21, using 53 probes that had already been used in physical mapping. The results obtained provide a direct demonstration that the DNA stretches of G-bands essentially correspond to GC-poor isochores, and that R-band DNA is characterized by a compositional heterogeneity that is much more striking than expected, in that it comprises isochores covering the full spectrum of GC levels. GC-poor isochores of R-bands may, however, correspond to 'thin' G-bands, as visualized at high resolution, leaving GC-rich and very GC-rich isochores as the real components of (high-resolution) R-band DNA.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

4.
Arabidopsis thaliana is an important model system for the study of plant biology. We have analyzed the complete genome sequences of Arabidopsis by using a newly developed windowless method for the GC content computation, the cumulative GC profile. It is shown that the Arabidopsis genome is organized into a mosaic structure of isochores. All the centromeric regions are located in GC-rich isochores, called centromere-isochores, which are characterized by a high GC content but low gene and T-DNA insertion densities. This characteristic distinguishes centromere-isochores from the other class of GC-rich isochores, called GC-isochores, which have high gene and T-DNA insertion densities. Consequently, 15 isochores have been identified, i.e., 7 AT-isochores, 3 GC-isochores, and 5 centromere-isochores. The genes in centromere-isochores, which have the highest GC content, have much shorter intron lengths and lower intron numbers, compared to those of the other two types. There is also considerable difference in the numbers and lengths of transposable elements (TEs) between AT and GC-isochores, i.e., the TE number (length) of AT-isochores is 6.3 (7.3) times that of GC-isochores. It is generally believed that TEs are accumulated in the regions surrounding the centromeres. However, within these TE-rich regions, there are regions of extremely low TE numbers (TE deserts), which correspond to the positions of centromere-isochores. In addition, a heterochromatic knob is located at the boundary of an AT-isochore. Furthermore, we show that the differences in GC content among isochores are mainly due to the GC content variation of introns, the third codon positions and intergenic regions.[Reviewing Editor: Martin Kreitman]  相似文献   

5.
In a recent paper in these pages, Cohen et al. search for isochores in the human genome, based on a system of attributes that they assign to isochores. The putative isochores that they find and choose for presentation are almost all below 45% GC and cover only about 41% of the genome. Closer inspection reveals that the authors' methodology systematically loses GC-rich isochores because it does not anticipate the considerable fluctuations and corresponding long-range correlations that characterize mammalian DNA and that are highest in GC-rich DNA. Thus, they over-fragment GC-rich isochores (and also many GC-poor isochores) beyond recognition.  相似文献   

6.
Pavlícek A  Jabbari K  Paces J  Paces V  Hejnar JV  Bernardi G 《Gene》2001,276(1-2):39-45
Alus and LINEs (LINE1) are widespread classes of repeats that are very unevenly distributed in the human genome. The majority of GC-poor LINEs reside in the GC-poor isochores whereas GC-rich Alus are mostly present in GC-rich isochores. The discovery that LINES and Alus share similar target site duplication and a common AT-rich insertion site specificity raised the question as to why these two families of repeats show such a different distribution in the genome. This problem was investigated here by studying the isochore distributions of subfamilies of LINES and Alus characterized by different degrees of divergence from the consensus sequences, and of Alus, LINEs and pseudogenes located on chromosomes 21 and 22. Young Alus are more frequent in the GC-poor part of the genome than old Alus. This suggests that the gradual accumulation of Alus in GC-rich isochores has occurred because of their higher stability in compositionally matching chromosomal regions. Densities of Alus and LINEs increase and decrease, respectively, with increasing GC levels, except for the telomeric regions of the analyzed chromosomes. In addition to LINEs, processed pseudogenes are also more frequent in GC-poor isochores. Finally, the present results on Alu and LINE stability/exclusion predict significant losses of Alu DNA from the GC-poor isochores during evolution, a phenomenon apparently due to negative selection against sequences that differ from the isochore composition.  相似文献   

7.
The distribution in the human genome of the largest family of mobile elements, the Alu sequences, has been investigated for the past 30 years, and the vast majority of Alu sequences were shown to have the highest density in GC-rich isochores. Ten years ago, it was discovered, however, that the small "youngest" (most recently transposed) Alu families had a strikingly different distribution compared with the "old" families. This raised the question as to how this change took place in evolution. We solved what was considered to be a "mystery" by 1) revisiting our previous results on the integration and stability of retroviral sequences, and 2) assessing the densities of acceptor sites TTTT/AA in isochore families. We could conclude 1) that the open state of chromatin structure plays a crucial role in allowing not only the initial integration of retroviral sequences but also that of the youngest Alu sequences, and 2) that the distribution of old Alus can be explained as due to Alu sequences being unstable in the GC-poor isochores but stable in the compositionally matching GC-rich isochores, again in line with what happens in the case of retroviral sequences.  相似文献   

8.
Chen LL  Gao F 《The FEBS journal》2005,272(13):3328-3336
Eukaryotic genomes are composed of isochores, i.e. long sequences relatively homogeneous in GC content. In this paper, the isochore structure of Arabidopsis thaliana genome has been studied using a windowless technique based on the Z curve method and intuitive curves are drawn for all the five chromosomes. Using these curves, we can calculate the GC content at any resolution, even at the base level. It is observed that all the five chromosomes are composed of several GC-rich and AT-rich regions alternatively. Usually, these regions, named 'isochore-like regions', have large fluctuations in the GC content. Five isochores with little fluctuations are also observed. Detailed analyses have been performed for these isochores. A GC-rich 'isochore-like region' and a GC-isochore in chromosome II and IV, respectively, are the nucleolar organizer regions (NORs), and genes located in the two regions prefer to use GC-ending codons. Another GC-isochore located in chromosome II is a mitochondrial DNA insertion region, the position and size of this region is precisely predicted by the current method. The amino acid usage and codon preference of genes in this organellar-to-nuclear transfer region show significant difference from other regions. Moreover, the centromeres are located in GC-rich 'isochore-like regions' in all the five chromosomes. The current method can provide a useful tool for analyzing whole genomic sequences of eukaryotes.  相似文献   

9.
Pesole G  Bernardi G  Saccone C 《FEBS letters》1999,464(1-2):60-62
The efficiency of AUG start codon recognition in translation initiation is modulated by its sequence context. Here we investigated a non-redundant set of 5914 human genes and show that this context is different in genes located in different isochores. In particular, of the two main consensus start sequences, RCCaugR is five-fold more represented than AARaugR in genes from the GC-rich H3 isochores compared to genes from the GC-poor L isochores. Furthermore, genes located in GC-rich isochores have shorter 5' UTRs and stronger avoidance of upstream AUG than genes located in GC-poor isochores. This suggests that genes requiring highly efficient translation are located in GC-rich isochores and genes requiring fine modulation of expression are located in GC-poor isochores. This is in agreement with independent data from the literature concerning the location of housekeeping and tissue-specific genes, respectively.  相似文献   

10.
Warm-blooded isochore structure in Nile crocodile and turtle.   总被引:11,自引:0,他引:11  
  相似文献   

11.
We compared the exon/intron organization of vertebrate genes belonging to different isochore classes, as predicted by their GC content at third codon position. Two main features have emerged from the analysis of sequences published in GenBank: (1) genes coding for long proteins (i.e., 500 aa) are almost two times more frequent in GC-poor than in GC-rich isochores; (2) intervening sequences (=sum of introns) are on average three times longer in GC-poor than in GC-rich isochores. These patterns are observed among human, mouse, rat, cow, and even chicken genes and are therefore likely to be common to all warm-blooded vertebrates. Analysis of Xenopus sequences suggests that the same patterns exist in cold-blooded vertebrates. It could be argued that such results do not reflect the reality because sequence databases are not representative of entire genomes. However, analysis of biases in GenBank revealed that the observed discrepancies between GC-rich and GC-poor isochores are not artifactual, and are probably largely underestimated. We investigated the distribution of microsatellites and interspersed repeats in introns of human and mouse genes from different isochores. This analysis confirmed previous studies showing that Ll repeats are almost absent from GC-rich isochores. Microsatellites and SINES (Alu, B1, B2) are found at roughly equal frequencies in introns from all isochore classes. Globally, the presence of repeated sequences does not account for the increased intron length in GC-poor isochores. The relationships between gene structure and global genome organization and evolution are discussed.  相似文献   

12.
The mammalian genome is not a random sequence but shows a specific, evolutionarily conserved structure that becomes manifest in its isochore pattern. Isochores, i.e. stretches of DNA with a distinct sequence composition and thus a specific GC content, cause the chromosomal banding pattern. This fundamental level of genome organization is related to several functional features like the replication timing of a DNA sequence. GC richness of genomic regions generally corresponds to an early replication time during S phase. Recently, we demonstrated this interdependency on a molecular level for an abrupt transition from a GC-poor isochore to a GC-rich one in the NF1 gene region; this isochore boundary also separates late from early replicating chromatin. Now, we analyzed another genomic region containing four isochores separated by three sharp isochore transitions. Again, the GC-rich isochores were found to be replicating early, the GC-poor isochores late in S phase; one of the replication time zones was discovered to consist of one single replicon. At the boundaries between isochores, that all show no special sequence elements, the replication machinery stopped for several hours. Thus, our results emphasize the importance of isochores as functional genomic units, and of isochore transitions as genomic landmarks with a key function for chromosome organization and basic biological properties.  相似文献   

13.
Vertebrate genomes are comprised of isochores that are relatively long (>100 kb) regions with a relatively homogenous (either GC-rich or AT-rich) base composition and with rather sharp boundaries with neighboring isochores. Mammals and living archosaurs (birds and crocodilians) have heterogeneous genomes that include very GC-rich isochores. In sharp contrast, the genomes of amphibians and fishes are more homogeneous and they have a lower overall GC content. Because DNA with higher GC content is more thermostable, the elevated GC content of mammalian and archosaurian DNA has been hypothesized to be an adaptation to higher body temperatures. This hypothesis can be tested by examining structure of isochores across the reptilian clade, which includes the archosaurs, testudines (turtles), and lepidosaurs (lizards and snakes), because reptiles exhibit diverse body sizes, metabolic rates, and patterns of thermoregulation. This study focuses on a comparative analysis of a new set of expressed genes of the red-eared slider turtle and orthologs of the turtle genes in mammalian (human, mouse, dog, and opossum), archosaurian (chicken and alligator), and amphibian (western clawed frog) genomes. EST (expressed sequence tag) data from a turtle cDNA library enriched for genes that have specialized functions (developmental genes) revealed using the GC content of the third-codon-position to examine isochore structure requires careful consideration of the types of genes examined. The more highly expressed genes (e.g., housekeeping genes) are more likely to be GC-rich than are genes with specialized functions. However, the set of highly expressed turtle genes demonstrated that the turtle genome has a GC content that is intermediate between the GC-poor amphibians and the GC-rich mammals and archosaurs. There was a strong correlation between the GC content of all turtle genes and the GC content of other vertebrate genes, with the slope of the line describing this relationship also indicating that the isochore structure of turtles is intermediate between that of amphibians and other amniotes. These data are consistent with some thermal hypotheses of isochore evolution, but we believe that the credible set of models for isochore evolution still includes a variety of models. These data expand the amount of genomic data available from reptiles upon which future studies of reptilian genomics can build.  相似文献   

14.
Expression patterns and gene distribution in the human genome   总被引:5,自引:0,他引:5  
D'Onofrio G 《Gene》2002,300(1-2):155-160
  相似文献   

15.
DNA helix: the importance of being GC-rich   总被引:14,自引:2,他引:12       下载免费PDF全文
  相似文献   

16.
Whether isochores, the large-scale variation of the GC content in mammalian genomes, are being maintained has recently been questioned. It has been suggested that GC-rich isochores originated in the ancestral amniote genome but that whatever force gave rise to them is no longer effective and that isochores are now disappearing from mammalian genomes. Here we investigated the evolution of the GC content of 41 coding genes in 6 to 66 species of mammals by estimating the ancestral GC content using a method which allows for different rates of substitution between sites. We found a highly significant decrease in the GC content during early mammalian evolution, as well as a weaker but still significant decrease in the GC content of GC-rich genes later in at least three groups of mammals: primates, rodents, and carnivores. These results are of interest because they confirm the recently suggested disappearance of GC-rich isochores in some mammalian genomes, and more importantly, they suggest that this disappearance started very early in mammalian evolution.This article contains online supplementary material.  相似文献   

17.
18.
We have hybridized the vertebrate telomeric sequence (TTAGGG)n on DNA compositional fractions from 13 mammalian species and 3 avian species, representing 9 and 3 orders, respectively. Our results indicate that the 50- to 100-kb fragments derived from telomeric regions are composed of GC-rich and GC-richest isochores. Previous works from our laboratory demonstrated that single-copy sequences from the human H3 isochore family (the GC-richest and gene-richest isochore in the human genome) share homology with compositionally correlated compartments of warm-blooded vertebrates. This correlation suggested that the GC-richest isochores are, as in the human genome, the gene-richest regions of warm-blooded vertebrates' genome. Moreover, this evidence suggests that telomeric regions are the most gene-dense region of all warm-blooded vertebrates. The implications of these findings are discussed.  相似文献   

19.
The genomes of homeothermic (warm-blooded) vertebrates are mosaic interspersions of homogeneously GC-rich and GC-poor regions (isochores). Evolution of genome compartmentalization and GC-rich isochores is hypothesized to reflect either selective advantages of an elevated GC content or chromosome location and mutational pressure associated with the timing of DNA replication in germ cells. To address the present controversy regarding the origins and maintenance of isochores in homeothermic vertebrates, newly obtained as well as published nucleotide sequences of the insulin and insulin-like growth factor (IGF) genes, members of a well-characterized gene family believed to have evolved by repeated duplication and divergence, were utilized to examine the evolution of base composition in nonconstrained (flanking) and weakly constrained (introns and fourfold degenerate sites) regions. A phylogeny derived from amino acid sequences supports a common evolutionary history for the insulin/IGF family genes. In cold- blooded vertebrates, insulin and the IGFs were similar in base composition. In contrast, insulin and IGF-II demonstrate dramatic increases in GC richness in mammals, but no such trend occurred in IGF- I. Base composition of the coding portions of the insulin and IGF genes across vertebrates correlated (r = 0.90) with that of the introns and flanking regions. The GC content of homologous introns differed dramatically between insulin/IGF-II and IGF-I genes in mammals but was similar to the GC level of noncoding regions in neighboring genes. Our findings suggest that the base composition of introns and flanking regions is determined by chromosomal location and the mutational pressure of the isochore in which the sequences are embedded. An elevated GC content at codon third positions in the insulin and the IGF genes may reflect selective constraints on the usage of synonymous codons.   相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号