共查询到20条相似文献,搜索用时 78 毫秒
1.
2.
We present analysis of intronic sequences in the human DMD and UTRN genes. In both genes accumulation of repeated elements could account for intron expansion. Out-of-frame rod-domain exons have stronger splice sites and are separated by significantly longer introns as compared to in-frame exons. These features are unique for the two homologs and not shared by other spectrin superfamily genes. 相似文献
3.
We conducted a multi-genome analysis correlating protein domain organization with the exon-intron structure of genes in nine eukaryotic genomes. We observed a significant correlation between the borders of exons and domains on a genomic scale for both invertebrates and vertebrates. In addition, we found that the more complex organisms displayed consistently stronger exon-domain correlation, with substantially more significant correlations detected in vertebrates compared with invertebrates. Our observations concur with the principles of exon shuffling theory, including the prediction of predominantly symmetric phase of introns flanking the borders of correlating exons. These results suggest that extensive exon shuffling events during evolution significantly contributed to the shaping of eukaryotic proteomes. 相似文献
4.
5.
CpG islands, genes and isochores in the genomes of vertebrates 总被引:6,自引:0,他引:6
We have shown that human genes associated with CpG islands increase in number as they increase in % of guanine + cytosine (GC) levels, and that most genes associated with CpG islands are located in the GC-richest compartment of the human genome. This is an independent confirmation of the concentration gradient of CpG islands (detected as HpaII tiny fragments, or HTF) which was demonstrated in the genome of warm-blooded vertebrates [A?ssani and Bernardi, Gene 106 (1991) 173-183]. We then reassessed the location of CpG islands using the data currently available and confirmed that CpG islands are most frequently located in the 5'-flanking sequences of genes and that they overlap genes to variable extents. We have shown that such extents increase with the increasing GC levels of genes, the GC-richest genes being completely included in CpG islands. Under such circumstances, we have investigated the properties of the 'extragenic' CpG islands located in the 5'-flanking segments of homologous genes from both warm- and cold-blooded vertebrates. We have confirmed that, in cold-blooded vertebrates, CpG islands are often absent; when present, they have lower GC and CpG levels; the latter attain, however, statistically expected values. Finally, we have shown that CpG doublets increase with the increasing GC of exons, introns and intergenic sequences (including 'extragenic' CpG islands) in the genomes from both warm- and cold-blooded vertebrates. The correlations found are the same for both classes of vertebrates, and are similar for exons, introns and intergenic sequences (including 'extragenic' CpG islands). The findings just outlined indicate that the origin and evolution of CpG islands in the vertebrate genome are associated with compositional transitions (GC increases) in genes and isochores. 相似文献
6.
How the structure and base composition of genes changed with the evolution of vertebrates remains a puzzling question. Here we analyzed 895 orthologous protein-coding genes in six multicellular animals: human, chicken, zebrafish, sea squirt, fruit fly, and worm. Our analyses reveal that many gene regions, particularly intron and 3' UTR, gradually expanded throughout the evolution of vertebrates from their invertebrate ancestors, and that the number of exons per gene increased. Studies based on all protein-coding genes in each genome provide consistent results.We also find that GC-content increased in many gene regions (especially 5' UTR) in the evolution of endotherms, except in coding-exons.Analysis of individual genomes shows that 3′ UTR demonstrated stronger length and CC-content correlation with intron than 5' UTR, and gene with large intron in all six species demonstrated relatively similar GC-content. Our data indicates a great increase in complexity in vertebrate genes and we propose that the requirement for morphological and functional changes is probably the driving force behind the evolution of structure and base composition complexity in multicellular animal genes. 相似文献
7.
Vinogradov AE 《Nucleic acids research》2003,31(7):1838-1844
8.
L Kotula L D Laury-Kleintop L Showe K Sahr A J Linnenbach B Forget P J Curtis 《Genomics》1991,9(1):131-140
The human erythrocyte alpha-spectrin gene which spans 80 kbp has been cloned from human genomic DNA as overlapping lambda recombinants. The exon-intron junctions were identified and the exons mapped. The gene is encoded by 52 exons whose sizes range from 684 bp to the smallest of 18 bp. The donor and acceptor splice site sequences match the splice site consensus sequences, with the exception of one splice site where a donor sequence begins with -GC. The size and location of exons do not correlate with the 106-amino-acid repeat, except in three locations where the surrounding codons are conserved as well. The lack of correspondence between exons and 106-amino-acid repeat is interpreted to reflect the appearance of a spectrin-like gene from a minigene early in the evolution of eukaryotes. Since current evidence indicates that introns were present in genes before the divergence of prokaryotes and eukaryotes, it is possible that the original distribution of introns within the minigene has been lost by the random deletion of introns from the spectrin gene. 相似文献
9.
10.
11.
Ribosomal protein S3 (RPS3) is a multifunctional ribosomal protein: it is a structural and functional component of the ribosome, and also a DNA repair enzyme involved in the DNA base excision repair pathway. Here we cloned and characterized the genomic organization of the ribosomal protein S3 gene (RpS3) homolog in Drosophila virilis. We then compared gene structure and protein sequences of RpS3 from vertebrates, invertebrates, and plants. These comparisons revealed that RpS3 genes from plants to mammals have highly conserved coding and amino acid sequences, and also protein size. Further comparisons of the protein sequences show that important domains are well conserved in both localization and sequence. In contrast, comparison of gene size and organization reveals differing patterns and levels of conservation. Whereas invertebrate RpS3 genes are small in size and gene organization is variable (from zero to four introns), vertebrates have a considerably larger (but variable) gene size and a uniform gene organization. The larger gene size in vertebrates is due to increased number and expansion of introns. Although the plant RpS3 genes are relatively small ( approximately 1.8 kb), their organization resembles that seen in vertebrates. The high conservation through different phyla may suggest that RPS3 might be under great functional constraints, both in its capacity as a component of the ribosome and as a component of a DNA repair system. Finally, electrophoretic mobility shift assays indicate that an upstream element binds a nuclear protein(s). 相似文献
12.
We have used Fragmentation Sequencing logic to analyse the repetition structure of several large human genomic genes. The method, based on a proposed laboratory scheme for DNA sequencing, detects short sequences which are repeated near, but not necessarily adjacent, to each other (cryptically simple DNA). We find a low frequency of such repeats. There is a slight excess of such repeats in introns over exons, and a slight but significant excess in genomic DNA over random DNA, confirming that cryptically simple sequences are over-represented in the genome. The analysis suggests that Fragmentation Sequencing will be a suitable method for sequencing large mammalian genes. 相似文献
13.
DNA sequences of 56 human genes for which information on both exons and
introns was available were examined. The variance in G+C content among
genes is estimated and shown to be substantial. There is a high correlation
in G+C content between exons and introns within the same gene. The
dinucleotide frequencies of introns are similar to those of intergenic
spacer regions and are in reasonable agreement with predictions from
substitution rates estimated from pseudogenes, except that the observed
deficiency of TA doublets is not predicted. Duplicated bases also show a
frequency greater than the expectation under independence. There is marked
variability among genes in the frequency of the doublet CG relative to its
expectation under independence. This variation is evolutionarily conserved
and is correlated with the G+C content. Pseudogenes behave as if they are
in a low -G+C, CG-deficient part of the genome, although the genes from
which they arose are variable in these respects.
相似文献
14.
Functional and structural units in the chromomere 总被引:1,自引:0,他引:1
N. A. Reznik G. P. Yampol E. V. Kiseleva N. B. Khristolyubova A. D. Gruzdev 《Genetica》1991,83(3):293-299
Electron microscopic observations demonstrate the existence of several DNA packing levels in the chromomere. A linear DNA molecule forms a big (chromomere) loop anchored to the chromosomal scaffold. The loop forms a set of smaller loops in the rosette pattern. Packing of the DNA by the histone octamer particles results in nucleosomes and nucleomeres. To establish the possible correspondence between the structural units of a chromomere and the genetical units (genes, exons, introns) in it, we compared the lengths of the units. Statistical analysis of the 315 sequenced genes indicate that the average gene size corresponds to the average length of a rosette loop. It means that a chromomere contains one or more genes. Assuming that exon-intron boundaries cannot bind nucleosomes we constructed DNA-packing models of the 88 genes. They demonstrate that the first (in 77.8 per cent of the genes) and the last (in 52.7 per cent) exons of the genes are too short to bind nucleosomes. Many genes contain long (nucleosome binding) pieces of DNA. Long packed pieces are introns in vertebrates; they are exons in invertebrates and plants. The average size gene contains two nucleomeres. 相似文献
15.
Vinogradov AE 《Nucleic acids research》2005,33(2):559-563
The nucleosome formation potential of introns, intergenic spacers and exons of human genes is shown here to negatively correlate with among-tissues breadth of gene expression. The nucleosome formation potential is also found to negatively correlate with the GC content of genomic sequences; the slope of regression line is steeper in exons compared with noncoding DNA (introns and intergenic spacers). The correlation with GC content is independent of sequence length; in turn, the nucleosome formation potential of introns and intergenic spacers positively (albeit weakly) correlates with sequence length independently of GC content. These findings help explain the functional significance of the isochores (regions differing in GC content) in the human genome as a result of optimization of genomic structure for epigenetic complexity and support the notion that noncoding DNA is important for orderly chromatin condensation and chromatin-mediated suppression of tissue-specific genes. 相似文献
16.
17.
Vertebrate genomes are mosaics of isochores. On the assumption that marked differences exist in the isochore structure between warm-blooded and cold-blooded animals, variations among vertebrates were previously attributed to adaptation to homeothermy. However, based on the data of coding regions from representatives of extant vertebrates, including a turtle, a crocodile (Archosauromorpha) and a few kinds of snakes (Lepidosauromorpha), it was recently hypothesized that the common ancestors of mammals, birds and extant reptiles already had the "warm-blooded" isochore structure. To test this hypothesis, the nucleotide sequences of alpha-globin genes including non-coding regions (introns) from two snakes, N. kaouthia and E. climacophora, were determined (accession number: AB104824, AB104825). The correlation between the GC contents in the introns and exons of alpha-globin genes from snakes and those from other vertebrates supports the above hypothesis. Similar analysis using data for exons and introns of other genes obtained from the GenBank (Release 131) also support the above hypothesis. 相似文献
18.
Directed evolution of proteins by exon shuffling 总被引:18,自引:0,他引:18
Evolution of eukaryotes is mediated by sexual recombination of parental genomes. Crossovers occur in random, but homologous, positions at a frequency that depends on DNA length. As exons occupy only 1% of the human genome and introns about 24%, by far most of the crossovers occur between exons, rather than inside. The natural process of creating new combinations of exons by intronic recombination is called exon shuffling. Our group is developing in vitro formats for exon shuffling and applying these to the directed evolution of proteins. Based on the splice frame junctions, nine classes of exons and three classes of introns can be distinguished. Splice frame diagrams of natural genes show how the splice frame rules govern exon shuffling. Here, we review various approaches to constructing libraries of exon-shuffled genes. For example, exon shuffling of human pharmaceutical proteins can generate libraries in which all of the sequences are fully human, without the point mutations that raise concerns about immunogenicity. 相似文献
19.
Since base composition of translational stop codons (TAG, TAA, and TGA) is biased toward a low G+C content, a differential
density for these termination signals is expected in random DNA sequences of different base compositions. The expected length
of reading frames (DNA segments of sense codons flanked by in-phase stop codons) in random sequences is thus a function of
GC content. The analysis of DNA sequences from several genome databases stratified according to GC content reveals that the
longest coding sequences—exons in vertebrates and genes in prokaryotes—are GC-rich, while the shortest ones are GC-poor. Exon
lengthening in GC-rich vertebrate regions does not result, however, in longer vertebrate proteins, perhaps because of the
lower number of exons in the genes located in these regions. The effects on coding-sequence lengths constitute a new evolutionary
meaning for compositional variations in DNA GC content.
Correspondence to: J. L. Oliver 相似文献
20.
To review the general consideration about the different compositional structure of warm and cold-blooded vertebrates genomes,
we used of the increasing number of genetic sequences, including coding (exons) and non-coding (introns) regions, that have
been deposited on the databases throughout last years. The nucleotide distributions of the third codon positions (GC3) have
been analyzed in 1510 coding sequences (CDS) of fish, 1414 CDS of amphibians and 320 CDS of reptiles. Also, the relationship
between GC content of 74, 56 and 25 CDS of fish, amphibians and reptiles, respectively and that of their corresponding introns
(GCI) have been considerated. In accordance with recent data, sequence analysis showed the presence of very GC3-rich CDS in
these poikilotherm vertebrates. However, very high diversity in compositional patterns among different orders of fish, amphibians
and reptiles was found. Significant positive correlations between GC3 and GCI was also confirmed for the genes analyzed. Nevertheless,
introns resulted to be poorer in GC than their corresponding CDS, this difference being larger than in human genome. Because
the limited number of available sequences including exons and introns we must be cautious about the results derived from them.
However, the indicious of higher GC richness of coding sequences than of their corresponding introns could aid to understand
the discrepancy of sequence analysis with the ultracentrifugation studies in cold-blooded vertebrates that did not predict
the existence of GC-rich isochores. 相似文献