首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
人类基因组盒式外显子和内含子保留的可变剪接位点预测   总被引:2,自引:0,他引:2  
信使RNA的可变剪接是真核生物有别于原核生物的基本特征之一,信使RNA前体的可变剪接极大地丰富了高等真核生物蛋白质的多样性,并与生物体的组织特异性密切相关。文章对人类盒式外显子和内含子保留的一些基本特征进行了统计;根据剪接位点附近的单碱基、碱基二联体和三联体的保守性等特征,利用基于多样性指标的二次判别法,对盒式外显子和内含子保留的供体端和受体端可变剪接位点进行了预测。交叉检验结果表明,盒式外显子供体端和受体端的识别精度分别达到93%、84%以上的水平;内含子保留供体端和受体端的识别精度分别达到89%、81%以上的水平。  相似文献   

2.
The mechanism of cellular src (c-src) transduction by a transformation-defective deletion mutant, td109, of Rous sarcoma virus was studied by sequence analysis of the recombinational junctions in three td109-derived recovered sarcoma viruses (rASVs). Our results show that two rASVs have been generated by recombination between td109 and c-src at the region between exons 1 and 2 defined previously. Significant homology between td109 and c-src sequences was present at the sites of recombination. The viral and c-src sequence junction of the third rASV was formed by splicing a cryptic donor site at the 5' region of env of td109 to exon 1 of c-src. Various lengths of c-src internal intron 1 sequences were incorporated into all three rASV genomes, which resulted from activation of potential splice donor and acceptor sites. The incorporated intron 1 sequences were absent in the c-src mRNA, excluding its being the precursor for recombination with td109 and implying that initial recombinations most likely took place at the DNA level. A potential splice acceptor site within the incorporated intron 1 sequences in two rASVs was activated and was used for the src mRNA synthesis in infected cells. The normal env mRNA splice acceptor site was used for src mRNA synthesis for the third rASV.  相似文献   

3.
Application of learning techniques to splicing site recognition   总被引:2,自引:0,他引:2  
J Quinqueton  J Moreau 《Biochimie》1985,67(5):541-547
Most genes of eukaryotic genomes are disrupted by introns. The application of a learning technique which uses both statistic and syntactic analysis lead to the establishment of logical rules enabling the recognition of intron/exon junctions between uncoding and coding sequences. The rules were tested on rat actin gene sequences containing some or all of the introns and 50 exon nucleotides on either side of the intron. The results show good recognition of the excision site. This recognition is more ambiguous when the sequence is short; for the acceptor sequence it presents a good selection. The learning achieved with both the donor and acceptor sequence does not lead to recognition. This result indicates that it is not the relationship between donor and acceptor sites in the same intron which determines sequence selection or the splicing mechanism.  相似文献   

4.
5.
6.
《Mutation Research Letters》1994,323(4):159-165
The molecular analysis of mutations affecting mRNA processing may contribute to a better understanding of the splicing mechanism through the identification of genomic sequences necessary for the recognition of splice sites. In this paper we report the sequence analysis of 14 splice mutants induced by 4-nitroquinoline 1-oxide (4NQO) at the hamster hypoxanthine-guanine-phosphoribosyltransferase (hprt) locus. We show that mutations at the 3′ acceptor splice site or at the first or fifth base of the 5′ donor splice site are responsible for exon skipping. In addition, mutations in exon sequences also determine the skipping of one or more exons. Our data indicate that point mutations in intron regions at either side of an internal exon may induce the skipping of the same exon, supporting a model where the exon is the unit of early spliceosome assembly. Furthermore, they suggest that the splicing of hprt mRNA precursors may proceed through a clustering of exons 2, 3 and 4 which are then spliced in a concerted way.  相似文献   

7.
Mammalian dolichol-phosphate-mannose (DPM) synthase has three subunits, DPM1, DPM2, and DPM3. In this report, an analysis of the gene and cDNAs of hamster DPM2 is presented. The CHO DPM2 gene has two special features. First, the initiation codon ATG is separated from the remainder of the coding region by intron sequences. Second, within these intron sequences the DPM2 gene contains an adjacent 3' splice site (acceptor) and a 5' splice site (donor), suggestive of a deleted exon between the first and second codons. In fact, these sites overlap by four nucleotides (nt) of AGGT. Splicing intermediates using both of these alternative splice sites were observed. This latter feature appears unique and is particularly unusual considering the relatively small size of the gene (2.7 kb) and of introns a (123 bp) and b (152 bp).  相似文献   

8.
The plasmid vector pLIV11 is used commonly to achieve liver-specific expression of genes of interest in transgenic mice and rabbits. Expression is driven by the human apolipoprotein (apo)E 5′ proximal promoter, which includes 5 kb of upstream sequence, exon 1, intron 1, and 5 bp of exon 2. A 3.8 kb 3′ hepatic control region, derived from a region ∼18 kb downstream of the apoE gene, enhances liver-specific expression. Here, we report that cDNA sequences inserted into the multiple cloning site (MCS) of pLIV11, which is positioned just downstream of truncated exon 2, can cause exon 2 skipping. Hence, splicing is displaced to downstream cryptic 3′ splice acceptor sites causing deletion of cloned 5′ untranslated mRNA sequences and, in some cases, deletion of the 5′ end of an open reading frame. To prevent use of cryptic splice sites, the pLIV11 vector was modified with an engineered 3′ splice acceptor site inserted immediately downstream of truncated apoE exon 2. Presence of this sequence fully shifted splicing of exon 1 from the native intron 1–exon 2 splice acceptor site to the engineered site. This finding confirmed that sequences inserted into the MCS of the vector pLIV11 can affect exon 2 recognition and provides a strategy to protect cloned sequences from alternative splicing and possible attenuation of transgenic expression.  相似文献   

9.
Prediction of human mRNA donor and acceptor sites from the DNA sequence   总被引:40,自引:0,他引:40  
Artificial neural networks have been applied to the prediction of splice site location in human pre-mRNA. A joint prediction scheme where prediction of transition regions between introns and exons regulates a cutoff level for splice site assignment was able to predict splice site locations with confidence levels far better than previously reported in the literature. The problem of predicting donor and acceptor sites in human genes is hampered by the presence of numerous amounts of false positives: here, the distribution of these false splice sites is examined and linked to a possible scenario for the splicing mechanism in vivo. When the presented method detects 95% of the true donor and acceptor sites, it makes less than 0.1% false donor site assignments and less than 0.4% false acceptor site assignments. For the large data set used in this study, this means that on average there are one and a half false donor sites per true donor site and six false acceptor sites per true acceptor site. With the joint assignment method, more than a fifth of the true donor sites and around one fourth of the true acceptor sites could be detected without accompaniment of any false positive predictions. Highly confident splice sites could not be isolated with a widely used weight matrix method or by separate splice site networks. A complementary relation between the confidence levels of the coding/non-coding and the separate splice site networks was observed, with many weak splice sites having sharp transitions in the coding/non-coding signal and many stronger splice sites having more ill-defined transitions between coding and non-coding.  相似文献   

10.
Prediction of splice junctions in mRNA sequences.   总被引:8,自引:6,他引:2       下载免费PDF全文
K Nakata  M Kanehisa    C DeLisi 《Nucleic acids research》1985,13(14):5327-5340
A general method based on the statistical technique of discriminant analysis is developed to distinguish boundaries of coding and non-coding regions in nucleic acid sequences. In particular, the method is applied to the prediction of splicing sites in messenger RNA precursors. Information used for discrimination includes consensus sequence patterns around splice junctions, free energy of snRNA and mRNA base pairing, and statistical differences between coding and non-coding regions such as periodic appearance of specific bases in coding regions reflecting the non-random usage of degenerate codons. Given the reading frame of an exon (but not the exon/intron boundaries), the method will predict the following exon, namely, the intron to be excised out. When applied to human sequences in the GenBank database, the method correctly identified 80% of true splice junctions.  相似文献   

11.
It has been previously observed that the intrinsically weak variant GC donor sites, in order to be recognized by the U2-type spliceosome, possess strong consensus sequences maximized for base pair formation with U1 and U5/U6 snRNAs. However, variability in signal strength is a fundamental mechanism for splice site selection in alternative splicing. Here we report human alternative GC-AG introns (for the first time from any species), and show that while constitutive GC-AG introns do possess strong signals at their donor sites, a large subset of alternative GC-AG introns possess weak consensus sequences at their donor sites. Surprisingly, this subset of alternative isoforms shows strong consensus at acceptor exon positions 1 and 2. The improved consensus at the acceptor exon can facilitate a strong interaction with U5 snRNA, which tethers the two exons for ligation during the second step of splicing. Further, these isoforms nearly always possess alternative acceptor sites and exhibit particularly weak polypyrimidine tracts characteristic of AG-dependent introns. The acceptor exon nucleotides are part of the consensus required for the U2AF35-mediated recognition of AG in such introns. Such improved consensus at acceptor exons is not found in either normal or alternative GT-AG introns having weak donor sites or weak polypyrimidine tracts. The changes probably reflect mechanisms that allow GC-AG alternative intron isoforms to cope with two conflicting requirements, namely an apparent need for differential splice strength to direct the choice of alternative sites and a need for improved donor signals to compensate for the central mismatch base pair (C-A) in the RNA duplex of U1 snRNA and the pre-mRNA. The other important findings include (i) one in every twenty alternative introns is a GC-AG intron, and (ii) three of every five observed GC-AG introns are alternative isoforms.  相似文献   

12.
We have previously shown that the calcitonin (CT)-encoding exon 4 of the human calcitonin/calcitonin gene-related peptide I (CGRP-I) gene (CALC-I gene) is surrounded by suboptimal processing sites. At the 5' end of exon 4 a weak 3' splice site is present because of an unusual branch acceptor nucleotide (U) and a weak poly(A) site is present at the 3' end of exon 4. For CT-specific RNA processing two different exon enhancer elements, A and B, located within exon 4 are required. In this study we have investigated the cooperation of these elements in CT exon recognition and inclusion by transient transfection into 293 cells of CALC-I minigene constructs. Improvement of the strength of the 3' splice site in front of exon 4 by the branchpoint mutation U-->A reduces the requirement for the presence of exon enhancer elements within exon 4 for CT-specific RNA processing, irrespective of the length of exon 4. Replacement of the exon 4 poly(A) site with a 5' splice site does not result in CT exon recognition, unless also one or more exon enhancer elements and/or the branchpoint mutation U-->A in front of exon 4 are present. This indicates that terminal and internal exons are recognised in a similar fashion. The number of additional enhancing elements that are required for CT exon recognition depends on the strength of the 5' splice site. Deletion of a large part of intron 4 also leads to partial exon 4 skipping. All these different elements contribute to CT exon recognition and inclusion. The CT exon is recognised as a whole entity and the sum of the strengths of the different elements determines recognition as an exon. Curiously, in one of our constructs a 5' splice site at the end of exon 4 is either ignored by the splicing machinery of the cell or recognised as a splice donor or as a splice acceptor site.  相似文献   

13.
14.
完整基因结构的预测是当前生命科学研究的一个重要基础课题,其中一个关键环节是剪接位点和各种可变剪接事件的精确识别.基于转录组测序(RNA-seq)数据,识别剪接位点和可变剪接事件是近几年随着新一代测序技术发展起来的新技术策略和方法.本工作基于黑腹果蝇睾丸RNA-seq数据,使用TopHat软件成功识别出39718个果蝇剪接位点,其中有10584个新剪接位点.同时,基于剪接位点的不同组合,针对各类型可变剪接特征开发出计算识别算法,成功识别了8477个可变剪接事件(其中新识别的可变剪接事件3922个),包括可变供体位点、可变受体位点、内含子保留和外显子缺失4种类型.RT-PCR实验验证了2个果蝇基因上新识别的可变剪接事件,发现了全新的剪接异构体.进一步表明,RNA-seq数据可有效应用于识别剪接位点和可变剪接事件,为深入揭示剪接机制及可变剪接生物学功能提供新思路和新手段.  相似文献   

15.
We have generated several deletions within the intron of a yeast actin gene construct which have lead to different splicing efficiencies as measured by Northern blot (RNA blot) and primer extension analyses. Our data especially demonstrate that a minimum distance from the 5' splice site to the internal branch acceptor site is required for accurate and efficient splicing. In a construct in which splicing was completely abolished, splicing could be restored by expanding the distance from the 5' splice site to the internal branch acceptor site with heterologous sequences. Alternative splicing, i.e., exon skipping and the use of a cryptic 5' splice site, was observed when the mRNA precursor was derived from a tandem repeat of a truncated intron with flanking exon sequences.  相似文献   

16.
Structure and sequence of the human homeobox gene HOX7.   总被引:13,自引:0,他引:13  
A cosmid containing the human sequence HOX7, homologous to the murine Hox-7 gene, was isolated from a genomic library, and the positions of the coding sequences were determined by hybridization. DNA sequence analysis demonstrated two exons that code for a homeodomain-containing protein of 297 amino acids. The open reading frame is interrupted by a single intron of approximately 1.6 kb, the splice donor and acceptor sites of which conform to known consensus sequences. The human HOX7 coding sequence has a very high degree of identity with the murine Hox-7 cDNA. Within the homeobox, the two sequences share 94% identity at the DNA level, all substitutions being silent. This high level of sequence similarity is not confined to the homeodomain; overall the human and murine HOX7 gene products show 80% identity at the amino acid level. Both the 5' and 3' untranslated regions also show significant similarity to the murine gene, with 79 and 70% sequence identity, respectively. The sequence upstream of the coding sequence of exon 1 contains a GC-rich putative promoter region. There is no TATA box, but a CCAAT and numerous GC boxes are present. The region encompassing the promoter region, exon 1, and the 5' region of exon 2 have a higher than expected frequency of CpG dinucleotides; numerous sites for rare-cutter restriction enzymes are present, a characteristic of HTF islands.  相似文献   

17.
18.
Zhang L  Luo L 《Nucleic acids research》2003,31(21):6214-6220
Based on the conservation of nucleotides at splicing sites and the features of base composition and base correlation around these sites we use the method of increment of diversity combined with quadratic discriminant analysis (IDQD) to study the dependence structure of splicing sites and predict the exons/introns and their boundaries for four model genomes: Caenorhabditis elegans, Arabidopsis thaliana, Drosophila melanogaster and human. The comparison of compositional features between two sequences and the comparison of base dependencies at adjacent or non-adjacent positions of two sequences can be integrated automatically in the increment of diversity (ID). Eight feature variables around a potential splice site are defined in terms of ID. They are integrated in a single formal framework given by IDQD. In our calculations 7 (8) base region around the donor (acceptor) sites have been considered in studying the conservation of nucleotides and sequences of 48 bp on either side of splice sites have been used in studying the compositional and base-correlating features. The windows are enlarged to 16 (donor), 29 (acceptor) and 80 bp (either side) to improve the prediction for human splice sites. The prediction capability of the present method is comparable with the leading splice site detector—GeneSplicer.  相似文献   

19.
20.
人类基因组中可变和组成性剪接位点的预测   总被引:2,自引:0,他引:2  
根据剪接位点的核酸序列保守特征,以及邻近位点的碱基组成和关联特性,结合一对可变剪接位点之间的距离参数和受体端剪接位点前30位碱基的GC和TC含量,利用结合多样性指标的二次判别方法(IDQD),预测了人类基因组中可变和组成性内含子的供体端和受体端的剪接位点,对可变的供体端和受体端剪接位点,阈值ξ选择-2时,总的预测精度分别为87.9%和89.9%,对组成性的供体端和受体端剪接位点,阈值ξ选择-1,总的预测精度分别为92.8%和94.3%.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号