首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 562 毫秒
1.
A database called eukaryotic intron database (EID) was developed based on the data from GenBank.Studies on the statistical characteristics of EID show that there were 103,848 genes,478,484 introns,and 582,332 exons,with an average of 4.61 introns and 5.61 exons per gene.Introns of 40-120 nt in length were abundant in the database.Results of the statistical analysis on the data from nine model species showed that in eukaryotes,higher species do not necessarily have more introns or exons in a gene than lower species.Furthermore,characteristics of EID,such as intron phase,distribution of different splice sites,and the relationship between genome size and intron proportion or intron density,have been studied.  相似文献   

2.
MOTIVATION: Intron sliding is the relocation of intron-exon boundaries over short distances and is often also referred to as intron slippage or intron migration or intron drift. We have generated a database containing discordant intron positions in homologous genes (MIDB--Mismatched Intron DataBase). Discordant intron positions are those that are either closely located in homologous genes (within a window of 10 nucleotides) or an intron position that is present in one gene but not in any of its homologs. The MIDB database aims at systematically collecting information about mismatched introns in the genes from GenBank and organizing it into a form useful for understanding the genomics and dynamics of introns thereby helping understand the evolution of genes. RESULTS: Intron displacement or sliding is critically important for explaining the present distribution of introns among orthologous and paralogous genes. MIDB allows examining of intron movements and allows mapping of intron positions from homologous proteins onto a single sequence. The database is of potential use for molecular biologists in general and for researchers who are interested in gene evolution and eukaryotic gene structure. Partial analysis of this database allowed us to identify a few putative cases of intron sliding. AVAILABILITY: http://intron.bic.nus.edu.sg/midb/midb.html  相似文献   

3.
The origin and evolution of intron-exon structures continue to be controversial topics. Two alternative theories, the ‘exon theory of genes’ and the ‘insertional theory of introns’, debate the presence or absence of introns in primordial genes. Both sides of the argument have focused on the positions of introns with respect to protein and gene structures. A new approach has emerged in the study of the evolution of intron-exon structures: a population analysis of genes. One example is the statistical analysis of intron phases — the position of introns within or between codons. This analysis detected a significant signal of exon shuffling in the DNA sequence database containing both ancient and modern exon sequences: intron phase correlations, that is, the association together within genes of introns of the same phase. The results of this analysis suggest that exon shuffling played an important role in the origin of both ancient and modern genes.  相似文献   

4.
A database called eukaryotic intron database (EID) was developed based on the data from GenBank. Studies on the statistical characteristics of EID show that there were 103, 848 genes, 478,484 introns, and 582,332 exons, with an average of 4.61 introns and 5.61 exons per gene. Introns of 40–120 nt in length were abundant in the database. Results of the statistical analysis on the data from nine model species showed that in eukaryotes, higher species do not necessarily have more introns or exons in a gene than lower species. Furthermore, characteristics of EID, such as intron phase, distribution of different splice sites, and the relationship between genome size and intron proportion or intron density, have been studied. __________ Translated from Acta Scientiarum Naturalium Universitatis Sunyatseni, 2005, 44(6): 79–82 [译自: 中山大学学报, 2005, 44(6): 79–82]  相似文献   

5.
How exon-intron structures of eukaryotic genes evolved under various evolutionary forces remains unknown. The phases of spliceosomal introns (the placement of introns with respect to reading frame) provide an opportunity to approach this question. When a large number of nuclear introns in protein-coding genes were analyzed, it was found that most introns were of phase 0, which keeps codons intact. We found that the phase distribution of spliceosomal introns is strongly correlated with the sequence conservation of splice signals in exons; the relatively underrepresented phase 2 introns are associated with the lowest conservation, the relatively overrepresented phase 0 introns display the highest conservation, and phase 1 introns are intermediate. Given the detrimental effect of mutations in exon sequences near splice sites as found in molecular experiments, the underrepresentation of phase 2 introns may be the result of deleterious-mutation-driven intron loss, suggesting a possible genetic mechanism for the evolution of intron-exon structures.  相似文献   

6.
7.
The Exon/Intron (ExInt) database incorporates information on the exon/intron structure of eukaryotic genes. Features in the database include: intron nucleotide sequence, amino acid sequence of the corresponding protein, position of the introns at the amino acid level and intron phase. From ExInt, we have also generated four additional databases each with ExInt entries containing predicted introns, introns experimentally defined, organelle introns or nuclear introns. ExInt is accessible through a retrieval system with pointers to GenBank. The database can be searched by keywords, locus name, NID, accession number or length of the protein. ExInt is freely accessible at http://intron.bic.nus.edu.sg/exint/exint.html  相似文献   

8.
Protein size is an important biochemical feature since longer proteins can harbor more domains and therefore can display more biological functionalities than shorter proteins. We found remarkable differences in protein length, exon structure, and domain count among different phylo-genetic lineages. While eukaryotic proteins have an average size of 472 amino acid residues (aa), average protein sizes in plant genomes are smaller than those of animals and fungi. Proteins unique to plants are ?81 aa shorter than plant proteins conserved among other eukaryotic lineages. The smaller average size of plant proteins could neither be explained by endosymbiosis nor subcellular compartmentation nor exon size, but rather due to exon number. Metazoan proteins are encoded on average by ?10 exons of small size [?176 nucleotides (nt)]. Streptophyta have on average only ?5.7 exons of medium size (?230 nt). Multicellular species code for large proteins by increasing the exon number, while most unicellular organisms employ rather larger exons (>400 nt). Among sub-cellular compartments, membrane proteins are the largest (?520 aa), whereas the smallest proteins correspond to the gene ontology group of ribosome (?240 aa). Plant genes are encoded by half the number of exons and also contain fewer domains than animal proteins on average. Interestingly, endosymbiotic proteins that migrated to the plant nucleus became larger than their cyanobacterial orthologs. We thus conclude that plants have proteins larger than bacteria but smaller than animals or fungi. Compared to the average of eukaryotic species, plants have ?34%more but ?20%smal-ler proteins. This suggests that photosynthetic organisms are unique and deserve therefore special attention with regard to the evolutionary forces acting on their genomes and proteomes.  相似文献   

9.
10.
The Caenorhabditis elegans and Artemia T4 globin sequences are highly homologous with other invertebrate globins. The intron/exon patterns of their genes display a single intron in the E and G helices respectively. Precoding introns in multirepeat globins are inserted in homologous positions. Comparison of the intron/exon patterns in the known globin gene sequences demonstrates that they are more diverse than first expected but nevertheless can be derived from an ancestral pattern having 3 introns and 4 exons.  相似文献   

11.
The human genome is revisited using exon and intron distribution profiles. The 26,564 annotated genes in the human genome (build October, 2003) contain 233,785 exons and 207,344 introns. On average, there are 8.8 exons and 7.8 introns per gene. About 80% of the exons on each chromosome are < 200 bp in length. < 0.01% of the introns are < 20 bp in length and < 10% of introns are more than 11,000 bp in length. These results suggest constraints on the splicing machinery to splice out very long or very short introns and provide insight to optimal intron length selection. Interestingly, the total length in introns and intergenic DNA on each chromosome is significantly correlated to the determined chromosome size with a coefficient of correlation r = 0.95 and r = 0.97, respectively. These results suggest their implication in genome design.  相似文献   

12.
The RPL10A gene encodes the RPL10 protein, required for joining 40S and 60S subunits into a functional 80S ribosome. This highly conserved gene, ubiquitous across all eukaryotic super-groups, is characterized by a variable number of spliceosomal introns, present in most organisms. These properties facilitate the recognition of orthologs among distant taxa and thus comparative studies of sequences as well as the distribution and properties of introns in taxonomically distant groups of eukaryotes. The present study examined the multiple ways in which RPL10A conservation vs. sequence changes in the gene over the course of evolution, including in exons, introns, and the encoded proteins, can be exploited for evolutionary analysis at different taxonomic levels. At least 25 different positions harboring introns within the RPL10A gene were determined in different taxa, including animals, plants, fungi, and alveolates. Generally, intron positions were found to be well conserved even across different kingdoms. However, certain introns seemed to be restricted to specific groups of organisms. Analyses of several properties of introns, including insertion site, phase, and length, along with exon and intron GC content and exon–intron boundaries, suggested biases within different groups of organisms. The use of a standard primer pair to analyze a portion of the intron-containing RPL10A gene in 12 genera of green algae within Chlorophyta is presented as a case study for evolutionary analyses of introns at intermediate and low taxonomic levels. Our study shows that phylogenetic reconstructions at different depths can be achieved using RPL10A nucleotide sequences from both exons and introns as well as the amino acid sequences of the encoded protein.  相似文献   

13.
14.
Although mobility of the phylogenetically widespread group I introns appears to be mechanistically similar, the phage T4 intron-encoded endonucleases that promote mobility of the td and sunY introns are different from their eukaryotic counterparts. Most notably, they cleave at a distance from the intron insertion sites. The td enzyme was shown to cleave 23-26 nt 5' and the sunY endonuclease 13-15 nt 3' to the intron insertion site to generate 3-nt or 2-nt 3'-OH extensions, respectively. The absolute coconversion of exon markers between the distant cleavage and insertion sites is consistent with the double-strand-break repair model for intron mobility. As a further critical test of the model we have demonstrated that the mobility event is independent of DNA sequences that encode the catalytic intron core structure. Thus, in derivatives in which the lacZ or kanR coding sequences replace the intron, these marker genes are efficiently inserted into intron-minus alleles when the cognate endonuclease is provided in trans. The process is therefore endonuclease-dependent, rather than dependent on the intron per se. These findings, which imply that the endonucleases rather than the introns themselves were the primordial mobile elements, are incorporated into a model for the evolution of mobile introns.  相似文献   

15.
16.
17.
Selective constraints on intron evolution in Drosophila   总被引:5,自引:0,他引:5  
Parsch J 《Genetics》2003,165(4):1843-1851
Intron sizes show an asymmetrical distribution in a number of organisms, with a large number of "short" introns clustered around a minimal intron length and a much broader distribution of longer introns. In Drosophila melanogaster, the short intron class is centered around 61 bp. The narrow length distribution suggests that natural selection may play a role in maintaining intron size. A comparison of 15 orthologous introns among species of the D. melanogaster subgroup indicates that, in general, short introns are not under greater DNA sequence or length constraints than long introns. There is a bias toward deletions in all introns (deletion/insertion ratio is 1.66), and the vast majority of indels are of short length (<10 bp). Indels occurring on the internal branches of the phylogenetic tree are significantly longer than those occurring on the terminal branches. These results are consistent with a compensatory model of intron length evolution in which slightly deleterious short deletions are frequently fixed within species by genetic drift, and relatively rare larger insertions that restore intron length are fixed by positive selection. A comparison of paralogous introns shared among duplicated genes suggests that length constraints differ between introns within the same gene. The janusA, janusB, and ocnus genes share two short introns derived from a common ancestor. The first of these introns shows significantly fewer indels than the second intron, although the two introns show a comparable number of substitutions. This indicates that intron-specific selective constraints have been maintained following gene duplication, which preceded the divergence of the D. melanogaster species subgroup.  相似文献   

18.
Advances in the Exon-Intron Database (EID)   总被引:3,自引:0,他引:3  
  相似文献   

19.
A comparison of the nucleotide sequences around the splice junctions that flank old (shared by two or more major lineages of eukaryotes) and new (lineage-specific) introns in eukaryotic genes reveals substantial differences in the distribution of information between introns and exons. Old introns have a lower information content in the exon regions adjacent to the splice sites than new introns but have a corresponding higher information content in the intron itself. This suggests that introns insert into nonrandom (proto-splice) sites but, during the evolution of an intron after insertion, the splice signal shifts from the flanking exon regions to the ends of the intron itself. Accumulation of information inside the intron during evolution suggests that new introns largely emerge de novo rather than through propagation and migration of old introns.  相似文献   

20.
IE-Kb: intron exon knowledge base   总被引:2,自引:0,他引:2  
SUMMARY: IE-Kb (Intron Exon-Knowledge base) illustrates the intron-exon dynamics in eukaryotic genes. We have developed three different knowledge sets, namely 'Non-redundant ExInt', 'Non-redundant Pfam-ExInt complement' and 'Non-redundant GenBank eukaryotic subdivisional sets' to understand this phenomenon. Statistical analysis is performed on each knowledge set and the results are made available online. The entries in knowledge sets are ranked based on their intron length, exon length and protein length with relational hyper-links to the corresponding intron phase, intron position, intron sequence, gene definition and parent GenBank entry.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号