共查询到20条相似文献,搜索用时 15 毫秒
1.
The availability of the complete chicken genome sequence provides an unprecedented opportunity to study the global genome organization at the sequence level. Delineating compositionally homogeneous G + C domains in DNA sequences can provide much insight into the understanding of the organization and biological functions of the chicken genome. A new segmentation algorithm, which is simple and fast, has been proposed to partition a given genome or DNA sequence into compositionally distinct domains. By applying the new segmentation algorithm to the draft chicken genome sequence, the mosaic organization of the chicken genome can be confirmed at the sequence level. It is shown herein that the chicken genome is also characterized by a mosaic structure of isochores, long DNA segments that are fairly homogeneous in the G + C content. Consequently, 25 isochores longer than 2 Mb (megabases) have been identified in the chicken genome. These isochores have a fairly homogeneous G + C content and often correspond to meaningful biological units. With the aid of the technique of cumulative GC profile, we proposed an intuitive picture to display the distribution of segmentation points. The relationships between G + C content and the distributions of genes (CpG islands, and other genomic elements) were analyzed in a perceivable manner. The cumulative GC profile, equipped with the new segmentation algorithm, would be an appropriate starting point for analyzing the isochore structures of higher eukaryotic genomes. 相似文献
2.
Oliver JL Carpena P Román-Roldán R Mata-Balaguer T Mejías-Romero A Hackenberg M Bernaola-Galván P 《Gene》2002,300(1-2):117-127
The human genome is a mosaic of isochores, which are long DNA segments (300 kbp) relatively homogeneous in G+C. Human isochores were first identified by density-gradient ultracentrifugation of bulk DNA, and differ in important features, e.g. genes are found predominantly in the GC-richest isochores. Here, we use a reliable segmentation method to partition the longest contigs in the human genome draft sequence into long homogeneous genome regions (LHGRs), thereby revealing the isochore structure of the human genome. The advantages of the isochore maps presented here are: (1) sequence heterogeneities at different scales are shown in the same plot; (2) pair-wise compositional differences between adjacent regions are all statistically significant; (3) isochore boundaries are accurately defined to single base pair resolution; and (4) both gradual and abrupt isochore boundaries are simultaneously revealed. Taking advantage of the wide sample of genome sequence analyzed, we investigate the correspondence between LHGRs and true human isochores revealed through DNA centrifugation. LHGRs show many of the typical isochore features, mainly size distribution, G+C range, and proportions of the isochore classes. The relative density of genes, Alu and long interspersed nuclear element repeats and the different types of single nucleotide polymorphisms on LHGRs also coincide with expectations in true isochores. Potential applications of isochore maps range from the improvement of gene-finding algorithms to the prediction of linkage disequilibrium levels in association studies between marker genes and complex traits. The coordinates for the LHGRs identified in all the contigs longer than 2 Mb in the human genome sequence are available at the online resource on isochore mapping: http://bioinfo2.ugr.es/isochores. 相似文献
3.
We report here investigations on the isochore pattern and the distribution of genes in the chromosomes of chicken. In spite of large differences in genome size and karyotype, the compositional properties and the gene distribution of the chicken genome are very similar to those recently published for the human genome, which is a good representative of most mammalian genomes. In fact, this similarity, which extends to the relative amounts and, also, to a large extent at least, to the average base composition of isochore families, is most interesting in view of the very large distance of mammals and birds for a common ancestor, which goes back to 310–340 million years ago. This raises important questions about genome evolution in vertebrates. 相似文献
4.
5.
Botcherby M 《Comparative and Functional Genomics》2002,3(4):319-324
The sequencing of the black 6 mouse (strain C57Bl/6) has reached an important juncture. The BAC fingerprint map is almost complete, the BACs have been endsequenced and a seven-fold coverage whole-genome shotgun has been assembled. Now the BAC-by-BAC sequencing phase is under way and in-depth comparative analysis can be carried out on regions that have been the subject of targeted sequencing. This paper reviews the progress so far and looks forward to the promises of finished sequence. 相似文献
6.
7.
Unusual DNA structures in the adenovirus genome 总被引:16,自引:0,他引:16
M W Kilpatrick A Torri D S Kang J A Engler R D Wells 《The Journal of biological chemistry》1986,261(24):11350-11354
More than 80% (approximately 29 kilobase pairs) of the adenovirus serotype 2 genome was surveyed for the presence of unusual DNA conformations. Seven recombinant DNAs containing the largest HindIII fragments of AD2 DNA were analyzed for the presence of negative supercoil-dependent S1 nuclease-sensitive sites. Four plasmids each contained a specific site of S1 nuclease sensitivity whereas the other three showed no reaction. Further investigation was focused on a plasmid containing one of the positively reacting fragments (fragment C) which contained the major late promoter at coordinate 16.4 on the genome; three serotypes (Ad2, Ad7, Ad12) were studied. Fine mapping studies revealed the S1-sensitive sites to be a small region (approximately 6 base pairs) located at the TATA box of the major late promoter in all three cases. Other determinations (supercoil relaxation, T7 gene 3 product sensitivity, bromoacetaldehyde reactivity, anomalous gel mobility, the influence of negative superhelical density on nuclease sensitivity) led to the conclusion that the B-helix deformation was not due to a previously recognized DNA conformation (left-handed Z-DNA, cruciform, bent DNA), but may be accounted for by the homopurine X homopyrimidine nature of this region. 相似文献
8.
9.
10.
Abstract The human genome is composed of large sequence segments with fairly homogeneous GC content, namely isochores, which have been linked to many important functions; biological implications of most isochore boundaries, however, remain elusive, partly due to the difficulty in determining these boundaries at high resolution. Using the segmentation algorithm based on the quadratic divergence, we re-determined all 79 boundaries of previously identified human isochores at single-nucleotide resolution, and then compared the boundary coordinates with other genome features. We found that 55.7% of isochore boundaries coincide with termini of repeat elements; 45.6% of isochore boundaries coincide with termini of highly conserved sequences based on alignment of 17 vertebrate genomes, i.e., the highly conserved genome sequence switches to a less or non-conserved one at the isochore boundary; some isochore boundaries coincide with abrupt change of CpG island distribution (note that one boundary can associate with more than one genome feature). In addition, sequences around isochore boundaries are highly conserved. It seems reasonable to deduce that the boundaries of all the isochores studied here would be replication timing sites in the human genome. These results suggest possible key roles of the isochore boundaries and may further our understanding of the human genome organization. 相似文献
11.
Mitochondrial DNA of Kinetoplastea is composed of different chromosomes, the maxicircle (bearing 'regular' genes) and numerous minicircles (specifying guide RNAs involved in RNA editing). In trypanosomes [Kinetoplastea], DNA circles are compacted into a single dense body, the kinetoplast. This report addresses the question whether multi-chromosome mitochondrial genomes and compacted chromosome organization are restricted to Kinetoplastea or rather occur throughout Euglenozoa, i.e., Kinetoplastea, Euglenida and Diplonemea. To this end, we investigated the diplonemid Rhynchopus euleeides and the euglenids Petalomonas cantuscygni, Peranema trichophorum and Entosiphon sulcatum, using light and electron microscopy and molecular techniques. Our findings together with previously published data show that multi-chromosome mitochondrial genomes prevail across Euglenozoa, while kinetoplast-like mtDNA packaging is confined to trypanosomes. 相似文献
12.
Zinc finger protein genes in the mouse genome 总被引:1,自引:0,他引:1
13.
Eiberger J Degen J Romualdi A Deutsch U Willecke K Söhl G 《Cell communication & adhesion》2001,8(4-6):163-165
Gap junctions serve for direct intercellular communication by docking of two hemichannels in adjacent cells thereby forming conduits between the cytoplasmic compartments of adjacent cells. Connexin genes code for subunit proteins of gap junction channels and are members of large gene families in mammals. So far, 17 connexin (Cx) genes have been described and characterized in the murine genome. For most of them, orthologues in the human genome have been found (see White and Paul 1999; Manthey et al. 1999; Teubner et al. 2001; S?hl et al. 2001). We have recently performed searches for connexin genes in murine and human gene libraries available at EMBL/Heidelberg, NCBI and the Celera company that have increased the number of identified connexins to 19 in mouse and 20 in humans. For one mouse connexin gene and two human connexin genes we did not find orthologues in the other genome. Here we present a short overview on distinct connexin genes which we found in the mouse and human genome and which may include all members of this gene family, if no further connexin gene will be discovered in the remaining non-sequenced parts (about 1-5%) of the genomes. 相似文献
14.
The natural populations of salmon-like catadromous fishes usually include several percents of residual (dwarf) forms, which are 10-15 times smaller than normal forms. A comparative investigation of normal and residual forms in two species: Oncorhynchus nerka and Salvelinus malma (Salmoniformes order) was made by means of DNA molecular hybridization technique. The essential differences in reassociation kinetics was detected in DNA from normal and residual forms of both species. The genome sizes (kinetic complexity) of normal and residual forms are approximately the same. But some families of repetitive nucleotide sequences are represented by considerably different amount of copies. Intrapopulational differences of genome structures of normal and residual forms in both species seem to be more rough in respect to reassociation kinetic than corresponding interspecies differences between malma and nerka. Comparative analyses and reassociation kinetics for long and shrt DNA fragments imply "xenopus type" of nucleotide sequence organization in both species which is common for the majority of animal and plant genomes investigated. 相似文献
15.
Chemical synthesis of the mouse mitochondrial genome 总被引:1,自引:0,他引:1
We describe a one-step, isothermal assembly method for synthesizing DNA molecules from overlapping oligonucleotides. The method cycles between in vitro recombination and amplification until the desired length is reached. As a demonstration of its simplicity and robustness, we synthesized the entire 16.3-kilobase mouse mitochondrial genome from 600 overlapping 60-mers. 相似文献
16.
the Joint NIH/DOE Mouse Working Group the National Center for Human Genome Research the 《Mammalian genome》1993,4(6):293-300
The National Center for Human Genome Research and the Department of Energy convened a committee of geneticists and biologists who use the laboratory mouse in their research programs. Their responsibility was to identify goals and guidelines for completing the genetic and physical maps of the mouse genome. The motivation for convening this group was to make certain that existing and anticipated research projects together represent a comprehensive program for addressing the Five Year Goals of the Human Genome Project. Three meetings were held: the first addressed the contributions that the mouse can make to the Human Genome Project; the second meeting reviewed the status of the genetic map, gene mapping research, and genome informatics; and the final meeting evaluated the status of the physical map and physical mapping research. The committee then prepared a report that reviewed the status of the mouse genome project and made recommendations concerning areas of research emphasis. The resulting Request For Applications entitled Mapping the Mouse Genome with Emphasis on Technology Development (RFA: HG92-002) is an important mechanism for coordinating mouse genome research and accomplishing the goals of the mouse genome project. Progress towards complete genetic and physical maps has been impressive. The genetic map should be completed on schedule, and ongoing physical mapping projects are promising. Given rapid progress on these maps, the Working Group proposed expanding the focus of the mouse genome effort to begin planning comprehensive approaches for characterizing the function of the large number of genes that will soon be mapped and eventually sequenced. Partly as a consequence of the Working Group's efforts, discussions have begun among members of the scientific community and National Institutes of Health (NIH) staff to plan comprehensive, efficient, and innovative approaches for studying gene function. The Working Group prepared a report summarizing the status of mouse genome research and recommending areas where effort and funding should be placed. Our report was submitted to and accepted by the NIH and Department of Energy (DOE) and is published here in its entirety.Verne M. Champman, Chair, Neal G. Copeland, Franklin D. Costantini, William F. Dove, Joseph H. Nadeau, Roger H. Reeves, Janet Rossant, Oliver Smithies, and Richard P. Woychik. 相似文献
17.
Analytical DNA ultracentrifugation revealed that eukaryotic genomes are mosaics of isochores: long DNA segments (>300 kb on average) relatively homogeneous in G+C. Important genome features are dependent on this isochore structure, e.g. genes are found predominantly in the GC-richest isochore classes. However, no reliable method is available to rigorously partition the genome sequence into relatively homogeneous regions of different composition, thereby revealing the isochore structure of chromosomes at the sequence level. Homogeneous regions are currently ascertained by plain statistics on moving windows of arbitrary length, or simply by eye on G+C plots. On the contrary, the entropic segmentation method is able to divide a DNA sequence into relatively homogeneous, statistically significant domains. An early version of this algorithm only produced domains having an average length far below the typical isochore size. Here we show that an improved segmentation method, specifically intended to determine the most statistically significant partition of the sequence at each scale, is able to identify the boundaries between long homogeneous genome regions displaying the typical features of isochores. The algorithm precisely locates classes II and III of the human major histocompatibility complex region, two well-characterized isochores at the sequence level, the boundary between them being the first isochore boundary experimentally characterized at the sequence level. The analysis is then extended to a collection of human large contigs. The relatively homogeneous regions we find show many of the features (G+C range, relative proportion of isochore classes, size distribution, and relationship with gene density) of the isochores identified through DNA centrifugation. Isochore chromosome maps, with many potential applications in genomics, are then drawn for all the completely sequenced eukaryotic genomes available. 相似文献
18.
Isochore patterns and gene distributions in fish genomes 总被引:2,自引:0,他引:2
The compositional approach developed in our laboratory many years ago revealed a large-scale compositional heterogeneity in vertebrate genomes, in which GC-rich and GC-poor regions, the isochores, were found to be characterized by high and low gene densities, respectively. Here we mapped isochores on fish chromosomes and assessed gene densities in isochore families. Because of the availability of sequence data, we have concentrated our investigations on four species, zebrafish (Brachydanio rerio), medaka (Oryzias latipes), stickleback (Gasterosteus aculeatus), and pufferfish (Tetraodon nigroviridis), which belong to four distant orders and cover almost the entire GC range of fish genomes. These investigations produced isochore maps that were drastically different not only from those of mammals (in that only two major isochore families were essentially present in each genome vs five in the human genome) but also from each other (in that different isochore families were represented in different genomes). Gene density distributions for these fish genomes were also obtained and shown to follow the expected increase with increasing isochore GC. Finally, we discovered a remarkable conservation of the average size of the isochores (which match replicon clusters in the case of human chromosomes) and of the average GC levels of isochore families in both fish and human genomes. Moreover, in each genome the GC-poorest isochore families comprised a group of "long isochores" (2-20 Mb in size), which were the lowest in GC and varied in size distribution and relative amount from one genome to the other. 相似文献
19.
Cheung J Wilson MD Zhang J Khaja R MacDonald JR Heng HH Koop BF Scherer SW 《Genome biology》2003,4(8):R47
Background
The high quality of the mouse genome draft sequence and its associated annotations are an invaluable biological resource. Identifying recent duplications in the mouse genome, especially in regions containing genes, may highlight important events in recent murine evolution. In addition, detecting recent sequence duplications can reveal potentially problematic regions of the genome assembly. We use BLAST-based computational heuristics to identify large (≥ 5 kb) and recent (≥ 90% sequence identity) segmental duplications in the mouse genome sequence. Here we present a database of recently duplicated regions of the mouse genome found in the mouse genome sequencing consortium (MGSC) February 2002 and February 2003 assemblies.Results
We determined that 33.6 Mb of 2,695 Mb (1.2%) of sequence from the February 2003 mouse genome sequence assembly is involved in recent segmental duplications, which is less than that observed in the human genome (around 3.5-5%). From this dataset, 8.9 Mb (26%) of the duplication content consisted of 'unmapped' chromosome sequence. Moreover, we suspect that an additional 18.5 Mb of sequence is involved in duplication artifacts arising from sequence misassignment errors in this genome assembly. By searching for genes that are located within these regions, we identified 675 genes that mapped to duplicated regions of the mouse genome. Sixteen of these genes appear to have been duplicated independently in the human genome. From our dataset we further characterized a 42 kb recent segmental duplication of Mater, a maternal-effect gene essential for embryogenesis in mice.Conclusion
Our results provide an initial analysis of the recently duplicated sequence and gene content of the mouse genome. Many of these duplicated loci, as well as regions identified to be involved in potential sequence misassignment errors, will require further mapping and sequencing to achieve accuracy. A Genome Browser database was set up to display the identified duplication content presented in this work. This data will also be relevant to the growing number of investigators who use the draft genome sequence for experimental design and analysis.20.
Mobilization of giant piggyBac transposons in the mouse genome 总被引:1,自引:0,他引:1
Li MA Turner DJ Ning Z Yusa K Liang Q Eckert S Rad L Fitzgerald TW Craig NL Bradley A 《Nucleic acids research》2011,39(22):e148
The development of technologies that allow the stable delivery of large genomic DNA fragments in mammalian systems is important for genetic studies as well as for applications in gene therapy. DNA transposons have emerged as flexible and efficient molecular vehicles to mediate stable cargo transfer. However, the ability to carry DNA fragments >10 kb is limited in most DNA transposons. Here, we show that the DNA transposon piggyBac can mobilize 100-kb DNA fragments in mouse embryonic stem (ES) cells, making it the only known transposon with such a large cargo capacity. The integrity of the cargo is maintained during transposition, the copy number can be controlled and the inserted giant transposons express the genomic cargo. Furthermore, these 100-kb transposons can also be excised from the genome without leaving a footprint. The development of piggyBac as a large cargo vector will facilitate a wider range of genetic and genomic applications. 相似文献