首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 125 毫秒
1.
果蝇内含子3'剪接位点的选择机制   总被引:5,自引:1,他引:4  
从722个果蝇基因中,选出了324个内含子作为分析3'剪接位点的选择机制的研究对象,结果表明Smith的扫描选择机制表述为''3'剪接位点是一致新闻记者基架上分支点和多嘧啶区下游的优势AG''更合理,对于ORF约束在生物学操作上的含义作了必要的讨论。  相似文献   

2.
完整基因结构的预测是当前生命科学研究的一个重要基础课题,其中一个关键环节是剪接位点和各种可变剪接事件的精确识别.基于转录组测序(RNA-seq)数据,识别剪接位点和可变剪接事件是近几年随着新一代测序技术发展起来的新技术策略和方法.本工作基于黑腹果蝇睾丸RNA-seq数据,使用TopHat软件成功识别出39718个果蝇剪接位点,其中有10584个新剪接位点.同时,基于剪接位点的不同组合,针对各类型可变剪接特征开发出计算识别算法,成功识别了8477个可变剪接事件(其中新识别的可变剪接事件3922个),包括可变供体位点、可变受体位点、内含子保留和外显子缺失4种类型.RT-PCR实验验证了2个果蝇基因上新识别的可变剪接事件,发现了全新的剪接异构体.进一步表明,RNA-seq数据可有效应用于识别剪接位点和可变剪接事件,为深入揭示剪接机制及可变剪接生物学功能提供新思路和新手段.  相似文献   

3.
目的:计算识别果蝇中新的非经典剪接位点,以探索未知的剪接机制。方法:基于黑腹果蝇表达序列标签(EST)与其基因组序列比对数据重构基因结构,从中发现非经典的剪接位点,并采用Weblogo软件分析非经典剪接位点上下游序列,以期发现剪接相关的特异性元件。结果:共得到265个非经典的剪接位点,这些剪接位点落在195个蛋白编码基因上。结论:应用生物信息学方法在果蝇中发现了上百个非经典剪接位点,为研究非经典剪接机制奠定了基础。  相似文献   

4.
采用基于贝叶斯网络的建模方法,预测真核生物DNA序列中的剪接位点.分别建立了供体位点和受体位点模型,并根据两种位点的生物学特性,对模型的拓扑结构和上下游节点的选择进行了优化.通过贝叶斯网络的最大似然学习算法求出模型参数后,利用10分组交互验证方法对测试数据进行剪接位点预测。结果显示,受体位点的平均预测准确率为92.5%,伪受体位点的平均预测准确率为94.0%,供体位点的平均预测准确率为92.3%,伪供体位点的平均预测准确率为93.5%,整体效果要好于基于使用独立和条件概率矩阵、以及隐Markov模型的预测方法.表明利用贝叶斯网络对剪接位点建模是预测剪接位点的一种有效手段.  相似文献   

5.
mRNA选择性剪接的分子机制   总被引:5,自引:0,他引:5  
章国卫  宋怀东  陈竺 《遗传学报》2004,31(1):102-107
真核细胞mRNA前体经过剪接成为成熟的mRNA,而mRNA前体的选择性剪接极大地增加了蛋白质的多样性和基因表达的复杂程度,剪接位点的识别可以以跨越内含子的机制(内含子限定)或跨越外显子的机制(外显子限定)进行。选择性剪接有多种剪接形式:选择不同的剪接位点,选择不同的剪接末端,外显子的不同组合及内含子的剪接与否等。选择性剪接过程受到许多顺式元件和反式因子的调控,并与基本剪接过程紧密联系,剪接体中的一些剪接因子也参与了对选择性剪接的调控。选择性剪接也是1个伴随转录发生的过程,不同的启动子可调控产生不同的剪接产物。mRNA的选择性剪接机制多种多样,已发现RNA编辑和反式剪接也可参与选择性剪接过程。  相似文献   

6.
为提高非翻译区剪接位点识别的精度,提出一种统计概率与支持向量机相结合的识别方法 .该方法主要分为两个阶段,第一阶段应用统计学方法对非翻译区(UTR)序列进行描述,将序列中各碱基之间的相关性、位置特异性、保守性等特征用概率形式描述,以概率参数作为第二阶段支持向量机的输入向量,第二阶段应用带有多项式核函数的支持向量机(SVM)对剪接位点进行识别.通过对人类5′UTR剪接位点数据集进行测试,结果表明:该方法对非翻译区剪接位点的识别取得了很好的效果.  相似文献   

7.
基于支持向量机(SVM)的剪接位点识别   总被引:14,自引:1,他引:13  
剪接位点的识别作为基因识别中的一个重要环节, 一直受到研究人员的关注。考虑到剪接位点附近存在的序列保守性,已有一些基于统计特性的方法被用于剪接位点的识别中,但效果仍有待进一步改进。支持向量机(Support Vector Machines) 作为一种新的基于统计学习理论的学习机,近几年有了很大的发展,已被应用在模式识别的许多问题中。文中将其用于剪接位点的识别中,并针对满足GT- AG 规则的序列样本中虚假剪接位点的样本数远大于真实位点这一特性, 提出了一种基于SVM 的平衡取小法以获得更好的识别效果。实验结果表明,应用支持向量机进行剪接位点的识别能更好地提取位点附近保守序列的统计特征,对测试集具有更好的推广能力,并且使用上更加简单。这一结果为剪接位点的识别提供了一种新的方法,同时也为生物大分子研究中结构和位点的识别问题的解决提供了新的线索。  相似文献   

8.
用神经网络法预测mRNA的剪接位点   总被引:3,自引:2,他引:3  
用神经网络预测了mRNA的剪接位点,比较了在各种不同的情况下,神经网络的学习与预测的情况,讨论了能反映真实剪接位点预测情况的有效预测成功率,指出它可达64%,而且总的预测成功率可达98%.预测的相关系数为0.66.  相似文献   

9.
基于支持向量机的人类5’非翻译区剪接位点识别   总被引:5,自引:0,他引:5  
基因非编码区域剪接位点的识别是基因识别中一个非常具有挑战性的问题,尤其是5’非翻译区中剪接位点的识别。与一般剪接位点不同,5’非翻译区剪接位点的两侧不存在由编码到非编码的状态转移,所以通常的剪接位点识别算法在非翻译区的性能不太理想。文章采用了基于支持向量机的方法对5’非翻译区中的剪接位点进行识别。为了提高识别精度,采用了基于矩阵相似性度量的核函数参数选取方法,它能够简单快速地确定合适的核函数参数,进而提高核函数的识别性能。通过实验验证,经过参数选择后的支持向量机能够较好地识别5'非翻译区剪接位点。  相似文献   

10.
对68个外显子-内含子-外显子序列片段以及相应的外显子-外显子序列片段的二级结构进行分析后发现,内含子5‘端和3’端的碱基G(剪拉位点)中大约90%们一级结构的环区或是茎区的端部并靠近环,贿位于环区的G也多靠近环的基部;92%的外显子拼接位点也有类似性质,约82%的分枝点A位于环区或环与茎的连接部位,折叠结构的形成使剪接位点和分枝点在空间上彼此靠近。  相似文献   

11.
F Wang  L Petti  D Braun  S Seung    E Kieff 《Journal of virology》1987,61(4):945-954
EBNA2 is a nuclear protein expressed in all cells latently infected with and growth transformed by Epstein-Barr virus (EBV) infection (K. Hennessy and E. Kieff, Science 227:1230-1240, 1985). The nucleotide sequence of the EBNA2 mRNA (J. Sample, M. Hummel, D. Braun, M. Birkenbach, and E. Kieff, Proc. Natl. Acad. Sci. USA 83:5096-5100, 1986) revealed that it begins with a 924-base open reading frame that has an unusual potential translational initiation site (CAAATGG). This open reading frame is followed by 138 nucleotides with only one highly unlikely translational initiation site (TACATGC), which would translate a pentapeptide before the next stop codon. The last part of the mRNA is the open reading frame which encodes EBNA2. In this paper, we demonstrate that the 924-base open reading frame translates a 40-kilodalton protein in vitro or in murine cells transfected with the EBNA2 cDNA under control of the murine leukemia virus long terminal repeat. A protein of identical size was detected in EBV-transformed, latently infected human lymphocyte nuclei by using antibody specific for the leader open reading frame expressed in bacteria. Therefore, this is a rare example of a mRNA which translates two proteins from nonoverlapping open reading frames. Since the protein encoded by the leader of the EBNA mRNA is expressed in all nuclei of a latently infected cell line, it was designated EBNA-LP. EBNA-LP localizes to small intranuclear particles and differs in this respect from EBNA1, EBNA2, or EBNA3. EBNA-LP is not expressed in an EBV-transformed marmoset lymphocyte cell (B95-8) or in one EBV-infected Burkitt tumor cell line (Raji) but is expressed in three other Burkitt tumor cell lines (Namalwa, P3HR-1, and Daudi).  相似文献   

12.
13.
14.
A small open reading frame, comS of the srf operon, is the site of mutations that impair competence development in Bacillus subtilis. comS open reading frame translation was required for competence, as was confirmed by the suppression of a comS amber mutation [comS(Am)] by the nonsense suppressor sup-3. comS(Am), when introduced into the srf operon, eliminated late competence gene expression but had no significant effect on surfactin production.  相似文献   

15.
The citrate utilization determinant from a large 200-kilobase (kb) naturally occurring plasmid was previously cloned into the PstI site of plasmid vector pBR325 creating the Cit+ tetracycline resistance plasmid pWR61 (15 kb). Tn5 insertion mutagenesis analysis of plasmid pWR61 limited the segment responsible for citrate utilization to a 4.8-kb region bordered by EcoRI and PstI restriction nuclease sites. The 4.8-kb fragment was cloned into phage M13, and the DNA sequence was determined by the dideoxyribonucleotide method. Within this sequence was a 1,296-base-pair open reading frame with a preceding ribosomal binding site. The 431-amino-acid polypeptide that could be translated from this open reading frame would be highly hydrophobic. A second long open reading frame with the potential of encoding a 379-amino-acid polypeptide preceded the larger open reading frame. Portions of the 4.8-kb fragment were further subcloned with restriction endonucleases BglII and BamHI, reducing the minimum size needed for a citrate-positive phenotype to a 1.9-kb BamHI-BglII fragment (which includes the coding region for the 431-amino-acid polypeptide, but only the distal 2/3 of the reading frame for the 379-amino-acid polypeptide). Citrate utilization results from a citrate transport activity encoded by the plasmid. With the 4.8-kb fragment (as with larger fragments) the citrate transport activity was inducible by growth on citrate. On transfer from glucose, succinate, malate, or glycerol medium to citrate medium, the Cit+ Escherichia coli strains showed a delay of 36 to 48 h before growth.  相似文献   

16.
17.
Acetylcholinesterase exists predominantly as a secreted enzyme which remains cell-associated at specific extracellular locations. Its extensive structural diversity appears responsible for the unique cellular disposition of the enzyme. To examine the molecular basis of the structural divergence of acetylcholinesterase species, we hybridized total RNA from Torpedo californica electric organ with restriction fragments from a cDNA encoding the catalytic subunits of asymmetric species of acetylcholinesterase. Multiple RNA species up to 14 kilobases in length can be detected on Northern blots using a full-length cDNA for hybridization. Each of these RNA species also hybridizes with smaller restriction fragments within the open reading frame and 3'-untranslated region of the cDNA. This indicates that the entire open reading frame plus the 3'-untranslated region is contained in the large RNA species. RNase protection experiments revealed at least three points of divergence for the message species. One occurs within the COOH-terminal portion of the open reading frame at a position just 5' to the TGA stop codon. This divergence accounts for the two classes of acetylcholinesterase found in abundance in Torpedo. The site of splicing has been further defined by isolating a genomic clone containing the exon serving as the potential splice donor. We find a divergence between the cDNA and genomic DNA at the position estimated by the protection experiments. A less abundant divergence in mRNA can also be detected in the 3'-untranslated region. Another divergence occurs as a deleted sequence within the 5'-noncoding region and may be important for controlling translation efficiency. Since it is hypothesized that a single gene encodes acetylcholinesterase, the divergences in the very 3' region of the open reading frame and the 5'-noncoding region correspond to presumed splice junction boundaries where alternative RNA splicing occurs.  相似文献   

18.
19.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号