首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The human genome is composed of large sequence segments with fairly homogeneous GC content, namely isochores, which have been linked to many important functions; biological implications of most isochore boundaries, however, remain elusive, partly due to the difficulty in determining these boundaries at high resolution. Using the segmentation algorithm based on the quadratic divergence, we re-determined all 79 boundaries of previously identified human isochores at single-nucleotide resolution, and then compared the boundary coordinates with other genome features. We found that 55.7% of isochore boundaries coincide with termini of repeat elements; 45.6% of isochore boundaries coincide with termini of highly conserved sequences based on alignment of 17 vertebrate genomes, i.e., the highly conserved genome sequence switches to a less or non-conserved one at the isochore boundary; some isochore boundaries coincide with abrupt change of CpG island distribution (note that one boundary can associate with more than one genome feature). In addition, sequences around isochore boundaries are highly conserved. It seems reasonable to deduce that the boundaries of all the isochores studied here would be replication timing sites in the human genome. These results suggest possible key roles of the isochore boundaries and may further our understanding of the human genome organization.  相似文献   

2.
The human genome is a mosaic of isochores, which are long DNA segments (300 kbp) relatively homogeneous in G+C. Human isochores were first identified by density-gradient ultracentrifugation of bulk DNA, and differ in important features, e.g. genes are found predominantly in the GC-richest isochores. Here, we use a reliable segmentation method to partition the longest contigs in the human genome draft sequence into long homogeneous genome regions (LHGRs), thereby revealing the isochore structure of the human genome. The advantages of the isochore maps presented here are: (1) sequence heterogeneities at different scales are shown in the same plot; (2) pair-wise compositional differences between adjacent regions are all statistically significant; (3) isochore boundaries are accurately defined to single base pair resolution; and (4) both gradual and abrupt isochore boundaries are simultaneously revealed. Taking advantage of the wide sample of genome sequence analyzed, we investigate the correspondence between LHGRs and true human isochores revealed through DNA centrifugation. LHGRs show many of the typical isochore features, mainly size distribution, G+C range, and proportions of the isochore classes. The relative density of genes, Alu and long interspersed nuclear element repeats and the different types of single nucleotide polymorphisms on LHGRs also coincide with expectations in true isochores. Potential applications of isochore maps range from the improvement of gene-finding algorithms to the prediction of linkage disequilibrium levels in association studies between marker genes and complex traits. The coordinates for the LHGRs identified in all the contigs longer than 2 Mb in the human genome sequence are available at the online resource on isochore mapping: http://bioinfo2.ugr.es/isochores.  相似文献   

3.
Gao F  Zhang CT 《FEBS letters》2008,582(16):2441-2444
The human genome is structured at multiple levels: it is organized into a series of replication time zones, and meanwhile it is composed of isochores. Accumulating evidence suggests a match between these two genome features. Based on newly developed software GC-Profile, we obtained a complete coverage of the human genome by 3198 isochores with boundaries at single nucleotide resolution. Interestingly, the experimentally confirmed replication timing sites in the regions of 1p36.1, 6p21.32, 17q11.2 and 22q12.1 nearly all coincide with the determined isochore boundaries. The precise boundaries of the 3198 isochores are available via the website: http://tubic.tju.edu.cn/isomap/.  相似文献   

4.
The mammalian genome is not a random sequence but shows a specific, evolutionarily conserved structure that becomes manifest in its isochore pattern. Isochores, i.e. stretches of DNA with a distinct sequence composition and thus a specific GC content, cause the chromosomal banding pattern. This fundamental level of genome organization is related to several functional features like the replication timing of a DNA sequence. GC richness of genomic regions generally corresponds to an early replication time during S phase. Recently, we demonstrated this interdependency on a molecular level for an abrupt transition from a GC-poor isochore to a GC-rich one in the NF1 gene region; this isochore boundary also separates late from early replicating chromatin. Now, we analyzed another genomic region containing four isochores separated by three sharp isochore transitions. Again, the GC-rich isochores were found to be replicating early, the GC-poor isochores late in S phase; one of the replication time zones was discovered to consist of one single replicon. At the boundaries between isochores, that all show no special sequence elements, the replication machinery stopped for several hours. Thus, our results emphasize the importance of isochores as functional genomic units, and of isochore transitions as genomic landmarks with a key function for chromosome organization and basic biological properties.  相似文献   

5.
Incorporated with the Z curve method, the technique of wavelet multiresolution (also known as multiscale) analysis has been proposed to identify the boundaries of isochores in the human genome. The human MHC sequence and the longest contigs of human chromosomes 21 and 22 are used as examples. The boundary between the isochores of Class III and Class II in the MHC sequence has been detected and found to be situated at the position 2,490,368bp. This result is in good agreement with the experimental evidence. An isochore with a length of about 7Mb in chromosome 21 has been identified and found to be gene- and Alu-poor. We have also found that the G+C content of chromosome 21 is more homogeneous than that of chromosome 22. Compared with the window-based methods, the present method has the highest resolution for identifying the boundaries of isochores, even at a scale of single base. Compared with the entropic segmentation method, the present method has the merits of more intuitiveness and less calculations. The important conclusion drawn in this study is that the segmentation points, at which the G+C content undergoes relatively dramatic changes, do exist in the human genome. These 'singularity' points may be considered to be candidates of isochore boundaries in the human genome. The method presented is a general one and can be used to analyze any other genomes.  相似文献   

6.
Analytical DNA ultracentrifugation revealed that eukaryotic genomes are mosaics of isochores: long DNA segments (>300 kb on average) relatively homogeneous in G+C. Important genome features are dependent on this isochore structure, e.g. genes are found predominantly in the GC-richest isochore classes. However, no reliable method is available to rigorously partition the genome sequence into relatively homogeneous regions of different composition, thereby revealing the isochore structure of chromosomes at the sequence level. Homogeneous regions are currently ascertained by plain statistics on moving windows of arbitrary length, or simply by eye on G+C plots. On the contrary, the entropic segmentation method is able to divide a DNA sequence into relatively homogeneous, statistically significant domains. An early version of this algorithm only produced domains having an average length far below the typical isochore size. Here we show that an improved segmentation method, specifically intended to determine the most statistically significant partition of the sequence at each scale, is able to identify the boundaries between long homogeneous genome regions displaying the typical features of isochores. The algorithm precisely locates classes II and III of the human major histocompatibility complex region, two well-characterized isochores at the sequence level, the boundary between them being the first isochore boundary experimentally characterized at the sequence level. The analysis is then extended to a collection of human large contigs. The relatively homogeneous regions we find show many of the features (G+C range, relative proportion of isochore classes, size distribution, and relationship with gene density) of the isochores identified through DNA centrifugation. Isochore chromosome maps, with many potential applications in genomics, are then drawn for all the completely sequenced eukaryotic genomes available.  相似文献   

7.
Isochore structures in the mouse genome   总被引:2,自引:0,他引:2  
Zhang CT  Zhang R 《Genomics》2004,83(3):384-394
The distribution of the G+C content in the mouse genome has been studied using a windowless technique. We have found that: (i). Abrupt variations of the G+C content from a GC-rich region to a GC-poor region, and vice versa, occur frequently at some sites along the sequence of the mouse genome. (ii). Long domains with relatively homogeneous G+C content (isochores) exist, which usually have sharp boundaries. Consequently, 28 isochores longer than 1 Mb have been identified in the mouse genome. A homogeneity index was used to quantify the variations of the G+C content within isochores. The precise boundaries, sizes, and G+C contents of these isochores have been determined. The windowless technique for the G+C content computation was also used to analyze the DNA sequence containing the mouse MHC region, which has a GC-poor isochore. This isochore is located at the central part of the sequence with boundaries at 468459 and 812716 bp, where the sequence is extended from the centromeric end to the telomeric end. In addition, the analysis of a segment of the rat genome shows that the rat genome also has clear isochore structures.  相似文献   

8.
An isochore map of the human genome based on the Z curve method   总被引:4,自引:0,他引:4  
Zhang CT  Zhang R 《Gene》2003,317(1-2):127-135
The distribution of the G+C content in the human genome has been studied by using a windowless technique derived from the Z curve method. The most important findings presented in this paper are twofold. First, abrupt variations of the G+C content along human chromosome sequences are the main variation patterns of G+C content. It is found that at some sites, the G+C content undergoes abrupt changes from a G+C-rich region to a G+C-poor region alternatively and vice versa. Second, it is shown that long domains with relatively homogeneous G+C content along each chromosome do exist. These domains are thought to be isochores, which usually have sharp boundaries. Consequently, 56 isochores longer than 3 Mb have been identified in chromosomes 1-22, X and Y. Boundaries, size and G+C content of each isochore identified are listed in detail. As an example to demonstrate the power of the method, the boundary between the Classes III and II isochores of the MHC sequence has been determined and found to be at 2,477,936, which is in good agreement with the experimental evidence. A homogeneity index is introduced to measure the homogeneity of G+C content in isochores. We emphasize that the homogeneity of G+C content is relative. The isochores in which the G+C content keeps absolutely constant do not exist. Isochore structures appear to be a basic organization of the human genome. Due to the relevance to many important biological functions, the clarification of isochore structures will provide much insight into the understanding of the human genome.  相似文献   

9.
We compared the exon/intron organization of vertebrate genes belonging to different isochore classes, as predicted by their GC content at third codon position. Two main features have emerged from the analysis of sequences published in GenBank: (1) genes coding for long proteins (i.e., 500 aa) are almost two times more frequent in GC-poor than in GC-rich isochores; (2) intervening sequences (=sum of introns) are on average three times longer in GC-poor than in GC-rich isochores. These patterns are observed among human, mouse, rat, cow, and even chicken genes and are therefore likely to be common to all warm-blooded vertebrates. Analysis of Xenopus sequences suggests that the same patterns exist in cold-blooded vertebrates. It could be argued that such results do not reflect the reality because sequence databases are not representative of entire genomes. However, analysis of biases in GenBank revealed that the observed discrepancies between GC-rich and GC-poor isochores are not artifactual, and are probably largely underestimated. We investigated the distribution of microsatellites and interspersed repeats in introns of human and mouse genes from different isochores. This analysis confirmed previous studies showing that Ll repeats are almost absent from GC-rich isochores. Microsatellites and SINES (Alu, B1, B2) are found at roughly equal frequencies in introns from all isochore classes. Globally, the presence of repeated sequences does not account for the increased intron length in GC-poor isochores. The relationships between gene structure and global genome organization and evolution are discussed.  相似文献   

10.
We have hybridized a human DNA fraction corresponding to the GC-richest and gene-richest isochore family, H3, on compositional fractions of DNAs from 12 mammalian species and three avian species, representing eight and three orders, respectively. Under conditions in which repetitive sequences are competed out, the H3 isochore probe only or predominantly hybridized on the GC-richest fractions of main-band DNA from all the species investigated. These results indicate that single-copy sequences from the human H3 isochores share homology with sequences located in the compositionally corresponding compartments of the vertebrate genomes tested. These sequences are likely to be essentially formed by conserved coding sequences. The present results add to other lines of evidence indicating that isochore patterns are highly conserved in warm-blooded vertebrate genomes. Moreover, they refine recent reports (Sabeur et al., 1993; Kadi et al., 1993), and correct them in some details and also in demonstrating that the shrew genome does not exhibit the general mammalian pattern, but a special pattern.Correspondence to: G. Bernardi  相似文献   

11.
Isochore patterns and gene distributions in fish genomes   总被引:2,自引:0,他引:2  
The compositional approach developed in our laboratory many years ago revealed a large-scale compositional heterogeneity in vertebrate genomes, in which GC-rich and GC-poor regions, the isochores, were found to be characterized by high and low gene densities, respectively. Here we mapped isochores on fish chromosomes and assessed gene densities in isochore families. Because of the availability of sequence data, we have concentrated our investigations on four species, zebrafish (Brachydanio rerio), medaka (Oryzias latipes), stickleback (Gasterosteus aculeatus), and pufferfish (Tetraodon nigroviridis), which belong to four distant orders and cover almost the entire GC range of fish genomes. These investigations produced isochore maps that were drastically different not only from those of mammals (in that only two major isochore families were essentially present in each genome vs five in the human genome) but also from each other (in that different isochore families were represented in different genomes). Gene density distributions for these fish genomes were also obtained and shown to follow the expected increase with increasing isochore GC. Finally, we discovered a remarkable conservation of the average size of the isochores (which match replicon clusters in the case of human chromosomes) and of the average GC levels of isochore families in both fish and human genomes. Moreover, in each genome the GC-poorest isochore families comprised a group of "long isochores" (2-20 Mb in size), which were the lowest in GC and varied in size distribution and relative amount from one genome to the other.  相似文献   

12.
We have hybridized the vertebrate telomeric sequence (TTAGGG)n on DNA compositional fractions from 13 mammalian species and 3 avian species, representing 9 and 3 orders, respectively. Our results indicate that the 50- to 100-kb fragments derived from telomeric regions are composed of GC-rich and GC-richest isochores. Previous works from our laboratory demonstrated that single-copy sequences from the human H3 isochore family (the GC-richest and gene-richest isochore in the human genome) share homology with compositionally correlated compartments of warm-blooded vertebrates. This correlation suggested that the GC-richest isochores are, as in the human genome, the gene-richest regions of warm-blooded vertebrates' genome. Moreover, this evidence suggests that telomeric regions are the most gene-dense region of all warm-blooded vertebrates. The implications of these findings are discussed.  相似文献   

13.
We have mapped and sequenced the region immediately centromeric of the human major histocompatibility complex (MHC). A cluster of 13 genes/pseudogenes was identified in a 175 kb PAC linking the TAPASIN locus with the class II region. It includes two novel human genes (BING4 and SACM2L) and a thus far unnoticed human leucocyte antigen (HLA) class II pseudogene, termed HLA-DPA3. Analysis of the G+C content revealed an isochore boundary which, together with the previously reported telomeric boundary, defines the MHC class II region as one of the first completely sequenced isochores in the human genome. Comparison of the sequence with limited sequence from other cell lines shows that the high sequence variation found within the classical class II region extends beyond the identified isochore boundary leading us to propose the concept of an "extended MHC". By comparative analysis, we have precisely identified the mouse/human synteny breakpoint at the centromeric end of the extended MHC class II region between the genes HSET and PHF1.  相似文献   

14.
15.
The isochore concept in human genome sequence was challenged in an analysis by the International Human Genome Sequencing Consortium (IHGSC). We argue here that a statement in IGHSC analysis concerning the existence of isochore is incorrect, because it had applied an inappropriate statistical test. To test the existence of isochores should be equivalent to a test of homogeneity of windowed GC%. The statistical test applied in the IHGSC's analysis, the binomial test, is however a test of a sequence being random on the base level. For testing the existence of isochore, or homogeneity in GC%, we propose to use another statistical test: the analysis of variance (ANOVA). It can be shown that DNA sequences that are rejected by binomial test may not be rejected by the ANOVA test.  相似文献   

16.
The genome of Plasmodium cynomolgi is partitioned into at least 7 distinct genetic domains. Each domain is apparently uniform in DNA density and is separable from the others by CsCl density centrifugation in the presence of Hoechst dye. The protein-encoding genes that were tested are localized in the two heaviest density domains (isochores). The ribosomal genes are in two lighter isochores as well as in one of the isochores that contains protein encoding genes. Telomeric sequences are mainly, if not exclusively, in the lightest isochores, indicating that position with regard to chromosome ends may correlate with density. Blocks of a tandemly-repeating sequence which mark genetically hypervariable chromosome regions in malaria parasites are located in all isochores. However, the rate of change associated with the blocks of sequence is much slower in some isochores than in others. This indicates that the rate of genetic change in these parasites may differ with isochore and chromosomal position. These results may also have more general biological implications since they suggest that the genetic instability often noted for tandem repeat sequences in the eukaryotic genome may be limited to only a distinct subset of the genomic complement of such sequence blocks.  相似文献   

17.

Background

The very recent availability of fully sequenced individual human genomes is a major revolution in biology which is certainly going to provide new insights into genetic diseases and genomic rearrangements.

Results

We mapped the insertions, deletions and SNPs (single nucleotide polymorphisms) that are present in Craig Venter''s genome, more precisely on chromosomes 17 to 22, and compared them with the human reference genome hg17. Our results show that insertions and deletions are almost absent in L1 and generally scarce in L2 isochore families (GC-poor L1+L2 isochores represent slightly over half of the human genome), whereas they increase in GC-rich isochores, largely paralleling the densities of genes, retroviral integrations and Alu sequences. The distributions of insertions/deletions are in striking contrast with those of SNPs which exhibit almost the same density across all isochore families with, however, a trend for lower concentrations in gene-rich regions.

Conclusions

Our study strongly suggests that the distribution of insertions/deletions is due to the structure of chromatin which is mostly open in gene-rich, GC-rich isochores, and largely closed in gene-poor, GC-poor isochores. The different distributions of insertions/deletions and SNPs are clearly related to the two different responsible mechanisms, namely recombination and point mutations.  相似文献   

18.
Vertebrate genomes are mosaics of megabase-size DNA segments with a fairly homogeneous base composition, called isochores. They are divided into five families characterized by different guanine-cytosine (GC) levels and linked to several functional and structural properties. The increased availability of fully sequenced genomes allows the investigation of isochores in several species, assessing their level of conservation across vertebrate genomes. In this work, we characterized the isochores in Bos taurus using the ARS-UCD1.2 genome version. The comparison of our results with the well-studied human isochores and those of other mammals revealed a large conservation in isochore families, in number, average GC levels and gene density. Exceptions to the established increase in gene density with the increase in isochores (GC%) were observed for the following gene biotypes: tRNA, small nuclear RNA, small nucleolar RNA and pseudogenes that have their maximum number in H2 and H1 isochores. Subsequently, we assessed the ontology of all gene biotypes looking for functional classes that are statistically over- or under-represented in each isochore. Receptor activity and sensory perception pathways were significantly over-represented in L1 and L2 (GC-poor) isochores. This was also validated for the horse genome. Our analysis of housekeeping genes confirmed a preferential localization in GC-rich isochores, as reported in other species. Finally, we assessed the SNP distribution of a bovine high-density SNP chip across the isochores, finding a higher density in the GC-rich families, reflecting a potential bias in the chip, widely used for genetic selection and biodiversity studies.  相似文献   

19.
Vertebrate genomes are mosaics of isochores, defined as long (>100 kb) regions with relatively homogeneous within-region base composition. Birds and mammals have more GC-rich isochores than amphibians and fish, and the GC-rich isochores of birds and mammals have been suggested to be an adaptation to homeothermy. If this hypothesis is correct, all poikilothermic (cold-blooded) vertebrates, including the nonavian reptiles, are expected to lack a GC-rich isochore structure. Previous studies using various methods to examine isochore structure in crocodilians, turtles, and squamates have led to different conclusions. We collected more than 6000 expressed sequence tags (ESTs) from the American alligator to overcome sample size limitations suggested to be the fundamental problem in the previous reptilian studies. The alligator ESTs were assembled and aligned with their human, mouse, chicken, and western clawed frog orthologs, resulting in 366 alignments. Analyses of third-codon-position GC content provided conclusive evidence that the poikilothermic alligator has GC-rich isochores, like homeothermic birds and mammals. We placed these results in a theoretical framework able to unify available models of isochore evolution. The data collected for this study allowed us to reject the models that explain the evolution of GC content using changes in body temperature associated with the transition from poikilothermy to homeothermy. Falsification of these models places fundamental constraints upon the plausible pathways for the evolution of isochores. Electronic supplementary material The online version of this article (doi: ) contains supplementary material, which is available to authorized users. Reviewing Editor: Dr. Nicolas Galtier  相似文献   

20.
In meiotic prophase I, chromatin fibrils attached to the lateral elements of the synaptonemal complexes form loops. Synaptonemal complex associated regions of DNA (SCARs DNA) are a family of genomic DNA sequences tightly associated with the synaptonemal complex; they are located at the chromatin loop basements. Isochore compositional fractions of the human and chicken genomes were used as 32P labeled probes for hybridization with SCAR DNA isolated previously from the spermatocyte nuclei of the golden hamster Mesocricetus auratus. Nucleotide sequences similar to the golden hamster’s SCAR DNA were found in human and chicken genome isochores. The localization of SCAR DNA in isochore compartments of the examined genomes was established to be evolutionary conservative.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号