首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Prediction of gene sequences and their exon-intron structure in large eukaryotic genomic sequences is one of the central problems of mathematical biology. Solving this problem involves, in particular, high-accuracy splice site recognition. Using statistical analysis of a splice site-containing human gene fragment database, some characteristic features were described for nucleotide sequences in the splicing site neighborhood, the frequencies of all nucleotides and dinucleotides were determined, and those with frequencies increased or decreased in comparison to a random sequence were identified. The results can be used in sequence annotation, splicing site prediction, and the recognition of the gene exon-intron structure.  相似文献   

2.
Xing Y  Wang Q  Lee C 《Genetics》2006,173(3):1787-1791
The intronic sequences flanking exon-intron junctions (i.e., exon flanks) are important for splice site recognition and pre-mRNA splicing. Recent studies show a higher degree of sequence conservation at flanks of alternative exons, compared to flanks of constitutive exons. In this article we performed a detailed analysis on the evolutionary divergence of exon flanks between human and chimpanzee, aiming to dissect the impact of mutability and selection on their evolution. Inside exon flanks, sites that might reside in ancestral CpG dinucleotides evolved significantly faster than sites outside of ancestral CpG dinucleotides. This result reflects a systematic variation of mutation rates (mutability) at exon flanks, depending on the local CpG contexts. Remarkably, we observed a significant reduction of the nucleotide substitution rate in flanks of alternatively spliced exons, independent of the site-by-site variation in mutability due to different CpG contexts. Our data provide concrete evidence for increased purifying selection at exon flanks associated with regulation of alternative splicing.  相似文献   

3.
A Deirdre  J Scadden    C W Smith 《The EMBO journal》1995,14(13):3236-3246
Nuclear pre-mRNA splicing has a fundamentally similar two-step mechanism to that employed by group II self-splicing introns. It is believed that nuclear pre-mRNA splicing involves a network of RNA-RNA interactions which form the catalytic core of the active spliceosome. We show here a non-Watson-Crick interaction between the first and last guanosine residues of a mammalian intron. As in Saccharomyces cerevisiae, substitution of the conserved guanosines at the 5' and 3' splice sites by A and C respectively, specifically suppresses step 2 splicing defects resulting from the individual mutations. No other combination of terminal nucleotides was able to restore splicing. We additionally provide independent evidence for an indirect interaction between other nucleotides of the consensus splice sites during step 2 of splicing. Substitution of the nucleotide in the +3 position of the 5' splice site affects competition between closely spaced AG dinucleotides at the 3' splice site, although the interaction is not via direct differential base pairing. Finally, we show that complete substitution of guanosine residues by inosine in a pre-mRNA has only a modest effect upon step 2 of splicing, although earlier spliceosome assembly steps are impaired. Predictions can thus be made about the precise configuration of the non-Watson-Crick interaction between the terminal residues.  相似文献   

4.
A conserved 3' splice site YAG is essential for the second step of pre-mRNA splicing but no trans-acting factor recognizing this sequence has been found. A direct, non-Watson-Crick interaction between the intron terminal nucleotides was suggested to affect YAG selection. The mechanism of YAG recognition was proposed to involve 5' to 3' scanning originating from the branchpoint or the polypyrimidine tract. We have constructed a yeast intron harbouring two closely spaced 3' splice sites. Preferential selection of a wild-type site over mutant ones indicated that the two sites are competing. For two identical sequences, the proximal site is selected. As previously observed, an A at the first intron nucleotide spliced most efficiently with a 3' splice site UAC. In this context, UAA or UAU were also more efficient 3' splice sites than UAG and competed more efficiently than the wild-type sequence with a 3' splice site UAC. We observed that a U at the first intron nucleotide is used for splicing in combination with 3' splice sites UAG, UAA or UAU. Our data indicate that the 3' splice site is not primarily selected through an interaction with the first intron nucleotide. Selection of the 3' splice site depends critically on its distance from the branchpoint but does not occur by a simple leaky scanning mechanism.  相似文献   

5.
Piva F  Principato G 《Gene》2007,393(1-2):81-86
There is ample evidence that prediction of human splice sites can be refined by analyzing the nucleotides surrounding splice sites. This could mean that exon nucleotides over splice sites harbour information for the splicing process in addition to the coding information to specify aminoacids. We analyzed the correlations among the nucleotides lying at the end and at the beginning of all the consecutive human exons to seek relationships among the nucleotides. We have divided the sequences taking into account the phase of interruption. Even though exon sequences are involved in the coding function, we found phase-dependent, specific correlations in the area of exon junctions. These regularities do not give rise to specific motifs, but rather to a phase-specific nucleotide context that could contribute to define the splice site or aid the splicing machinery to join the exon ends. Results provide further evidence that accurate selection of human splice sites likely requires the contribution of exon regulatory sequences.  相似文献   

6.
7.
Single nucleotide changes to the sequence between two alternative 5' splice sites, separated by 25 nucleotides in a beta-globin gene derivative, caused substantial shifts in pre-mRNA splicing preferences, both in vivo and in vitro. An activating sequence for splicing was located. Models for the recognition by U1 small nuclear ribonucleoproteins (snRNPs) of competing 5' splice sites were tested by altering the distance separating the two sites. Use of the upstream splice site declined sharply when it was separated from the downstream (natural) site by distances of 40 nucleotides or more. This effect was reversed in vivo, but not in vitro, by altering the upstream sequence to that of a consensus 5' splice site sequence. Dilution of an extract used for splicing in vitro shifted preferences when the sites were close towards the downstream site. We conclude that the mechanism of selection depends on the distance apart of the potential splice sites and that with close sites steric interference between factors bound to both sites may impede splicing and affect splicing preferences.  相似文献   

8.

Background  

Alternative splicing (AS) is a key molecular process that endows biological functions with diversity and complexity. Generally, functional redundancy leads to the generation of new functions through relaxation of selective pressure in evolution, as exemplified by duplicated genes. It is also known that alternatively spliced exons (ASEs) are subject to relaxed selective pressure. Within consensus sequences at the splice junctions, the most conserved sites are dinucleotides at both ends of introns (splice dinucleotides). However, a small number of single nucleotide polymorphisms (SNPs) occur at splice dinucleotides. An intriguing question relating to the evolution of AS diversity is whether mutations at splice dinucleotides are maintained as polymorphisms and produce diversity in splice patterns within the human population. We therefore surveyed validated SNPs in the database dbSNP located at splice dinucleotides of all human genes that are defined by the H-Invitational Database.  相似文献   

9.
The accurate prediction of plant pre-mRNA splicing sites has been studied extensively. The rules for plant pre-mRNA splicing still remain unknown. This study, based on confirmed sequence data, systematically analyzed all expressed genes on Arabidopsis thaliana chromosome IV to quantitatively explore the natural splicing rules. The results indicated that defining Arabidopsis thaliana pre-mRNA splicing sites required a combination of multiple factors including (1) relative conserved consensus sequence at splicing site; (2) individual nucleotide distribution pattern in 50 nucleotides up- and down-stream regions of splicing site; (3) quantitative analysis of individual nucleotide distribution by using the formulations concluded from this study. The combination of all these factors together can bring the accuracy of Arabidopsis thaliana splicing site recognition over 99%. The results provide additional information to the future of plant pre-mRNA splicing research.  相似文献   

10.
11.
SRSF2 (SC35) is a key player in the regulation of alternative splicing events and binds degenerated RNA sequences with similar affinity in nanomolar range. We have determined the solution structure of the SRSF2 RRM bound to the 5'-UCCAGU-3' and 5'-UGGAGU-3' RNA, both identified as SRSF2 binding sites in the HIV-1 tat exon 2. RNA recognition is achieved through a novel sandwich-like structure with both termini forming a positively charged cavity to accommodate the four central nucleotides. To bind both RNA sequences equally well, SRSF2 forms a nearly identical network of intermolecular interactions by simply flipping the bases of the two consecutive C or G nucleotides into either anti or syn conformation. We validate this unusual mode of RNA recognition functionally by in-vitro and in-vivo splicing assays and propose a 5'-SSNG-3' (S=C/G) high-affinity binding consensus sequence for SRSF2. In conclusion, in addition to describe for the first time the RNA recognition mode of SRSF2, we provide the precise consensus sequence to identify new putative binding sites for this splicing factor.  相似文献   

12.
本文介绍了一个在微机(IBM PC)上实现的、用于核酸顺序分析的计算机程序系统.该系统由三个层次和18个功能块构成,菜单及人机对话使得用户能较快地掌握和使用它.在编程中,采用了树结构、先进后出栈和稀疏矩阵等数据结构技巧,运用了Bayes法等统计分析方法,Kruskal算法和Floyd算法等一系列图论方法也被得到应用,这个软件系统的推出对于分子生物学研究具有一定的积极作用.  相似文献   

13.
D Poncet  G Verdier  V M Nigon 《Biochimie》1983,65(7):417-425
Available restriction endonucleases including CG dinucleotides in their target sequences (most of them being unable to cut the DNA when the cytosine of the CG sequence is methylated) have been used to map cloned DNA covering the human gamma-delta-beta globin gene cluster. Since the human DNA fragments were cloned in Escherichia coli, only the internal cytosine in the sequence CCAT GG could be methylated. Thus, any recognized "CG enzyme" site can be detected since they are unmethylated. Results show that frequencies of "CG enzyme" sites regularly decrease from the gamma-globin region to the beta-globin region, the latter being very poor in "CG enzyme"' sites. The array of enzymes used here detects 4 times more CG sites than the classical MspI/HpaII system. Examination of previously sequenced parts of the gamma-delta-beta globin gene cluster shows that CG dinucleotides correspond to an average frequency of 1 out of 104 nucleotides in the gamma-globin region and 1 out of 138 nucleotides in the beta-globin region. In the gamma-globin region, 1 CG out of 4 or 5 may be detected by the enzymes used; the detected frequency is less than 1 out of 10 CG in the beta-region. Analysis of nucleotide environment around CG dinucleotides shows occurrence of local differences, the main sequences being CGG in the 5' side flanking the gamma genes and ACG in the corresponding area of the beta gene. The results presented introduce some new considerations about analysis of cytosine methylation which has been previously proposed as playing a role in the control of the activity of gamma, delta and beta genes respectively.  相似文献   

14.
The conformation of RNA sequences spanning five 3' splice sites and two 5' splice sites in adenovirus mRNA was probed by partial digestion with single-strand specific nucleases. Although cleavage of nucleotides near both 3' and 5' splice sites was observed, most striking was the preferential digestion of sequences near the 3' splice site. At each 3' splice site a region of very strong cleavage is observed at low concentrations of enzyme near the splice site consensus sequence or the upstream branch point consensus sequence. Additional sites of moderately strong cutting near the branch point consensus sequence were observed in those sequences where the splice site was the preferred target. Since recognition of the 3' splice site and branch site appear to be early events in mRNA splicing these observations may indicate that the local conformation of the splice site sequences may play a direct or indirect role in enhancing the accessibility of sequences important for splicing.  相似文献   

15.
Removal of introns by pre-mRNA splicing is fundamental to gene function in eukaryotes. However, understanding the mechanism by which exon-intron boundaries are defined remains a challenging endeavor. Published reports support that the recruitment of U1 snRNP at the 5′ss marked by GU dinucleotides defines the 5′ss as well as facilitates 3′ss recognition through cross-exon interactions. However, exceptions to this rule exist as U1 snRNP recruited away from the 5′ss retains the capability to define the splice site, where the cleavage takes place. Independent reports employing exon 7 of Survival Motor Neuron (SMN) genes suggest a long-distance effect of U1 snRNP on splice site selection upon U1 snRNP recruitment at target sequences with or without GU dinucleotides. These findings underscore that sequences distinct from the 5′ss may also impact exon definition if U1 snRNP is recruited to them through partial complementarity with the U1 snRNA. In this review we discuss the expanded role of U1 snRNP in splice-site selection due to U1 ability to be recruited at more sites than predicted solely based on GU dinucleotides.  相似文献   

16.
Certain thalassemic human beta-globin pre-mRNAs carry mutations that generate aberrant splice sites and/or activate cryptic splice sites, providing a convenient and clinically relevant system to study splice site selection. Antisense 2'-O-methyl oligoribonucleotides were used to block a number of sequences in these pre-mRNAs and were tested for their ability to inhibit splicing in vitro or to affect the ratio between aberrantly and correctly spliced products. By this approach, it was found that (i) up to 19 nucleotides upstream from the branch point adenosine are involved in proper recognition and functioning of the branch point sequence; (ii) whereas at least 25 nucleotides of exon sequences at both 3' and 5' ends are required for splicing, this requirement does not extend past the 5' splice site sequence of the intron; and (iii) improving the 5' splice site of the internal exon to match the consensus sequence strongly decreases the accessibility of the upstream 3' splice site to antisense 2'-O-methyl oligoribonucleotides. This result most likely reflects changes in the strength of interactions near the 3' splice site in response to improvement of the 5' splice site and further supports the existence of communication between these sites across the exon.  相似文献   

17.
Nuclear pre-mRNA splicing necessitates specific recognition of the pre-mRNA splice sites. It is known that 5' splice site selection requires base pairing of U6 snRNA with intron positions 4-6. However, no factor recognizing the highly conserved 5' splice site GU has yet been identified. We have tested if the known U6 snRNA-pre-mRNA interaction could be extended to include the first intron nucleotides and the conserved 50GAG52 sequence of U6 snRNA. We observe that some combinations of 5' splice site and U6 snRNA mutations produce a specific synthetic block to the first splicing step. In addition, the U6-G52U allele can switch between two competing 5' splice sites harboring different nucleotides following the cleavage site. These results indicate that U6 snRNA position 52 interacts with the first nucleotide of the intron before 5' splice site cleavage. Some combinations of U6 snRNA and pre-mRNA mutations also blocked the second splicing step, suggesting a role for the corresponding nucleotides in a proofreading step before exon ligation. From studies in diverse organisms, various functions have been ascribed to the conserved U6 snRNA 47ACAGAG52 sequence. Our results suggest that these discrepancies might reflect variations between different experimental systems and point to an important conserved role of this sequence in the splicing reaction.  相似文献   

18.
Regulation of splicing in eukaryotes occurs through the coordinated action of multiple splicing factors. Exons and introns contain numerous putative binding sites for splicing regulatory proteins. Regulation of splicing is presumably achieved by the combinatorial output of the binding of splicing factors to the corresponding binding sites. Although putative regulatory sites often overlap, no extensive study has examined whether overlapping regulatory sequences provide yet another dimension to splicing regulation. Here we analyzed experimentally-identified splicing regulatory sequences using a computational method based on the natural distribution of nucleotides and splicing regulatory sequences. We uncovered positive and negative interplay between overlapping regulatory sequences. Examination of these overlapping motifs revealed a unique spatial distribution, especially near splice donor sites of exons with weak splice donor sites. The positively selected overlapping splicing regulatory motifs were highly conserved among different species, implying functionality. Overall, these results suggest that overlap of two splicing regulatory binding sites is an evolutionary conserved widespread mechanism of splicing regulation. Finally, over-abundant motif overlaps were experimentally tested in a reporting minigene revealing that overlaps may facilitate a mode of splicing that did not occur in the presence of only one of the two regulatory sequences that comprise it.  相似文献   

19.
The T-->G mutation at nucleotide 705 in the second intron of the beta-globin gene creates an aberrant 5' splice site and activates a 3' cryptic splice site upstream from the mutation. As a result, the IVS2-705 pre-mRNA is spliced via the aberrant splice sites leading to a deficiency of beta-globin mRNA and protein and to the genetic blood disorder thalassemia. We have shown previously that in cell culture models of thalassemia, aberrant splicing of beta-thalassemic IVS2-705 pre-mRNA was permanently corrected by a modified murine U7 snRNA that incorporated sequences antisense to the splice sites activated by the mutation. To explore the possibility of using other snRNAs as vectors for antisense sequences, U1 snRNA was modified in a similar manner. Replacement of the U1 9-nucleotide 5' splice site recognition sequence with nucleotides complementary to the aberrant 5' splice site failed to correct splicing of IVS2-705 pre-mRNA. In contrast, U1 snRNA targeted to the cryptic 3' splice site was effective. A hybrid with a modified U7 snRNA gene under the control of the U1 promoter and terminator sequences resulted in the highest levels of correction (up to 70%) in transiently and stably transfected target cells.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号