首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
基于支持向量机识别真核生物DNA中的翻译起始位点   总被引:2,自引:1,他引:1  
翻译起始位点(TIS)的识别是真核生物基因预测的关键步骤之一,近年来一直得到研究人员的高度重视。基于TIS附近序列的统计特性,出现了一些辨识TIS的判别方法,但识别精度还有待进一步提高。针对传统支持向量机(SVM)方法中存在的不足,提出了基于数据优化法的SVM,它通过其它统计学模型优化训练数据集,进而提高分类器的辨识精度。实验结果表明基于数据优化法的SVM分类器在翻译起始位点的辨识上可获得比其他判别方法更好的效果。  相似文献   

3.
4.
Liu H  Han H  Li J  Wong L 《In silico biology》2004,4(3):255-269
The translation initiation site (TIS) prediction problem is about how to correctly identify TIS in mRNA, cDNA, or other types of genomic sequences. High prediction accuracy can be helpful in a better understanding of protein coding from nucleotide sequences. This is an important step in genomic analysis to determine protein coding from nucleotide sequences. In this paper, we present an in silico method to predict translation initiation sites in vertebrate cDNA or mRNA sequences. This method consists of three sequential steps as follows. In the first step, candidate features are generated using k-gram amino acid patterns. In the second step, a small number of top-ranked features are selected by an entropy-based algorithm. In the third step, a classification model is built to recognize true TISs by applying support vector machines or ensembles of decision trees to the selected features. We have tested our method on several independent data sets, including two public ones and our own extracted sequences. The experimental results achieved are better than those reported previously using the same data sets. Our high accuracy not only demonstrates the feasibility of our method, but also indicates that there might be "amino acid" patterns around TIS in cDNA and mRNA sequences.  相似文献   

5.
6.
Translation initiation start prediction in human cDNAs with high accuracy   总被引:3,自引:0,他引:3  
MOTIVATION: Correct identification of the Translation Initiation Start (TIS) in cDNA sequences is an important issue for genome annotation. The aim of this work is to improve upon current methods and provide a performance guaranteed prediction. METHODS: This is achieved by using two modules, one sensitive to the conserved motif and the other sensitive to the coding/non-coding potential around the start codon. Both modules are based on Artificial Neural Networks (ANNs). By applying the simplified method of the ribosome scanning model, the algorithm starts a linear search at the beginning of the coding ORF and stops once the combination of the two modules predicts a positive score. RESULTS: According to the results of the test group, 94% of the TIS were correctly predicted. A confident decision is obtained through the use of the Las Vegas algorithm idea. The incorporation of this algorithm leads to a highly accurate recognition of the TIS in human cDNAs for 60% of the cases. Availability: The program is available upon request from the author.  相似文献   

7.
We isolated a group of genes that are rapidly and transiently induced in 3T3 cells by tetradecanoyl phorbol acetate (TPA). These genes are called TIS genes (for TPA-inducible sequences). Epidermal growth factor (EGF), fibroblast growth factor (FGF), and TPA activated TIS gene expression with similar induction kinetics. TPA pretreatment to deplete protein kinase C activity did not abolish the subsequent induction of TIS gene expression by epidermal growth factor or fibroblast growth factor; both peptide mitogens can activate TIS genes through a protein kinase C-independent pathway(s). We also analyzed TIS gene expression in three TPA-nonproliferative variants (3T3-TNR2, 3T3-TNR9, and A31T6E12A). The results indicate that (i) modulation of a TPA-responsive sodium-potassium-chloride transport system is not necessary for TIS gene induction either by TPA or by other mitogens and (ii) TIS gene induction is not sufficient to guarantee a proliferative response to mitogenic stimulation.  相似文献   

8.
TIS11d is a member of the CCCH-type family of tandem zinc finger (TZF) proteins; the TZF domain of TIS11d (residues 151–220) is sufficient to bind and destabilize its target mRNAs with high specificity. In this study, the TZF domain of TIS11d is simulated in an aqueous environment in both the free and RNA-bound states. Multiple nanosecond timescale molecular dynamics trajectories of TIS11d wild-type and E157R/E195K mutant with different RNA sequences were performed to investigate the molecular basis for RNA binding specificities of this TZF domain. A variety of measures of the protein structure, fluctuations, and dynamics were used to analyze the trajectories. The results of this study support the following conclusions: (1) the structure of the two fingers is maintained in the free state but a global reorientation occurs to yield a more compact structure; (2) mutation of the glutamate residues at positions 157 and 195 to arginine and lysine, respectively, affects the RNA recognition by this TIS11d mutant in agreement with the findings of Pagano et al. (J Biol Chem 2007; 282:8883–8894); and (3) we predict that the E157R/E195K mutant will present a more relaxed RNA binding specificity relative to wild-type TIS11d based on the more favorable nonsequence-specific Coulomb interaction of the two positively charged residues at positions 157 and 195 with the RNA backbone, which compensates for a partial loss of the stacking interaction of aromatic side chains with the RNA bases.  相似文献   

9.
SUMMARY: DNAFSMiner (DNA Functional Sites Miner) is a web-based software toolbox to recognize functional sites in nucleic acid sequences. Currently in this toolbox, we provide two software: TIS Miner and Poly(A) Signal Miner. The TIS Miner can be used to predict translation initiation sites in vertebrate DNA/mRNA/cDNA sequences, and the Poly(A) Signal Miner can be used to predict polyadenylation [poly(A)] signals in human DNA sequences. The prediction results are better than those by literature methods on two benchmark applications. This good performance is mainly attributable to our unique learning method. DNAFSMiner is available free of charge for academic and non-profit organizations. AVAILABILITY: http://research.i2r.a-star.edu.sg/DNAFSMiner/ CONTACT: huiqing@i2r.a-star.edu.sg.  相似文献   

10.
11.
12.
翻译起始位点(TIS,即基因5’端)的精确定位是原核生物基因预测的一个关键问题,而基因组GC含量和翻译起始机制的多样性是影响当前TIS预测水平的重要因素.结合基因组结构的复杂信息(包括GC含量、TIS邻近序列及上游调控信号、序列编码潜能、操纵子结构等),发展刻画翻译起始机制的数学统计模型,据此设计TIS预测的新算法MED.StartPlus.并将MED.StartPlus与同类方法RBSfinder、GS.Finder、MED-Start、TiCo和Hon-yaku等进行系统地比较和评价.测试针对两种数据集进行:当前14个已知的TIS被确认的基因数据集,以及300个物种中功能已知的基因数据集.测试结果表明,MED-StartPlus的预测精度在总体上超过同类方法.尤其是对高GC含量基因组以及具有复杂翻译起始机制的基因组,MED-StartPlus具有明显的优势.  相似文献   

13.
14.
ACTH is the major trophic factor regulating and maintaining adrenocortical function, affecting such diverse processes as steroidogenesis, cell proliferation, cell migration, and cell survival. We used differential display RT-PCR to identify genes that are rapidly induced by ACTH in the bovine adrenal cortex. Of 42 PCR products differentially amplified from primary cultures of bovine adrenocortical cells treated with 10 nM ACTH, six identified mRNAs that were confirmed by Northern blot analysis to be induced by ACTH. Four of these amplicons encoded noninformative repetitive sequences. Of the other two sequenced amplicons, one encoded a partial sequence for mitochondrial manganese-dependent superoxide dismutase (SOD2), an enzyme that is likely to protect adrenocortical cells from the cytotoxic effects of radical oxygen species generated during steroid biosynthesis. The second was identified as TIS11b (phorbol-12-myristate-13-acetate-inducible sequence 11b)/ERF-1/cMG, a member of the CCCH double-zinc finger protein family. SOD2 induction by ACTH was independent of extracellular steroid concentration or oxidative stress. SOD2 and TIS11b mRNA expressions were rapidly induced by ACTH, reaching a maximal level after 8 h and 3 h of treatment, respectively. These ACTH effects were mimicked by forskolin but appeared independent of cortisol secretion. Upon ACTH treatment, induction of TIS11b expression closely followed the previously characterized peak of vascular endothelial growth factor (VEGF) expression. Transfection of a TIS11b expression plasmid into 3T3 fibroblasts induced a decrease in the expression of a reporter gene placed upstream of the VEGF 3'-untranslated region, indicating that TIS11b may be an important regulator of VEGF expression through interaction with its 3'-untranslated region.  相似文献   

15.
16.
Regions flanking the translation initiation site (TIS) are thought to play a crucial role in translation efficiency of mRNAs, but their exact sequence and evolution in eukaryotes are still a matter of debate. We investigated the context sequences in 20 nucleotides around the TIS in multi-cellular eukaryotes, with a focus on two model plants and a comparison to human. We identified consensus sequences aaaaaaa(A/G)(A/C)aAUGGcgaataata and ggcggc(g/c)(A/G)(A/C)(G/C)AUGGCggcggcgg for Arabidopsis thaliana and Oryza sativa, respectively. We observe strongly conserved G at position +4 and A or C at position -2; however, the exact nucleotide frequencies vary between the three organisms even at these conserved positions. The frequency of pyrimidines, which are considered sub optimum at position -3, is higher in both plants than in human. Arabidopsis is GC-depleted (AU-enriched) compared to both rice and human, and the enrichment is slightly stronger upstream than downstream of AUG. While both plants are similar though not identical in their variation of nucleotide frequencies, rice and human are more similar to each other than Arabidopsis and human. All three organisms display clear periodicity in A + G and C + U content when analyzing normalized frequencies. These findings suggest that, besides few highly conserved positions, overall structure of the context sequence plays a larger role in TIS recognition than the actual nucleotide frequencies.  相似文献   

17.
18.
Perspective-taking and emotion recognition are essential for successful social development and have been the focus of developmental research for many years. Although the two abilities often overlap, they are distinct and our understanding of these abilities critically rests upon the efficacy of existing measures. Lessons from the literature differentiating recall versus recognition memory tasks led us to hypothesize that an open-ended emotion recognition measure would be less reliant on compensatory strategies and hence a more specific measure of emotion recognition abilities than a forced-choice task. To this end, we compared an open-ended version of the Reading the Mind in the Eyes Task with the original forced-choice version in two studies: 118 typically-developing 4- to 8-year-olds (Study 1) and 139 5- to 12-year-olds; 85 typically-developing and 54 with learning disorders (Study 2). We found that the open-ended version of the task was a better predictor of empathy and more reliably discriminated typically-developing children from those with learning disorders. As a whole, the results suggest that the open-ended version is a more sensitive measure of emotion recognition specifically.  相似文献   

19.
Experience of two teratology information services in Europe   总被引:2,自引:0,他引:2  
Teratology Information Services (TIS) are started in different countries in Europe in order to gather available data on exogenous agents, to evaluate their pertinence to human subjects, and to apply this knowledge to specific cases. Most European centers can only be consulted by medical professionals. The experience of two such services (Lyon, France, and Bilthoven, The Netherlands) is described. Attention is given to the task of TIS, risk evaluation, operational methods, and functioning and future developments.  相似文献   

20.
Interpolated markov chains for eukaryotic promoter recognition.   总被引:9,自引:0,他引:9  
MOTIVATION: We describe a new content-based approach for the detection of promoter regions of eukaryotic protein encoding genes. Our system is based on three interpolated Markov chains (IMCs) of different order which are trained on coding, non-coding and promoter sequences. It was recently shown that the interpolation of Markov chains leads to stable parameters and improves on the results in microbial gene finding (Salzberg et al., Nucleic Acids Res., 26, 544-548, 1998). Here, we present new methods for an automated estimation of optimal interpolation parameters and show how the IMCs can be applied to detect promoters in contiguous DNA sequences. Our interpolation approach can also be employed to obtain a reliable scoring function for human coding DNA regions, and the trained models can easily be incorporated in the general framework for gene recognition systems. RESULTS: A 5-fold cross-validation evaluation of our IMC approach on a representative sequence set yielded a mean correlation coefficient of 0.84 (promoter versus coding sequences) and 0.53 (promoter versus non-coding sequences). Applied to the task of eukaryotic promoter region identification in genomic DNA sequences, our classifier identifies 50% of the promoter regions in the sequences used in the most recent review and comparison by Fickett and Hatzigeorgiou ( Genome Res., 7, 861-878, 1997), while having a false-positive rate of 1/849 bp.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号