首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
MOTIVATION: Most computational methodologies for miRNA:mRNA target gene prediction use the seed segment of the miRNA and require cross-species sequence conservation in this region of the mRNA target. Methods that do not rely on conservation generate numbers of predictions, which are too large to validate. We describe a target prediction method (NBmiRTar) that does not require sequence conservation, using instead, machine learning by a na?ve Bayes classifier. It generates a model from sequence and miRNA:mRNA duplex information from validated targets and artificially generated negative examples. Both the 'seed' and 'out-seed' segments of the miRNA:mRNA duplex are used for target identification. RESULTS: The application of machine-learning techniques to the features we have used is a useful and general approach for microRNA target gene prediction. Our technique produces fewer false positive predictions and fewer target candidates to be tested. It exhibits higher sensitivity and specificity than algorithms that rely on conserved genomic regions to decrease false positive predictions.  相似文献   

3.
Prediction of both conserved and nonconserved microRNA targets in animals   总被引:2,自引:0,他引:2  
MOTIVATION: MicroRNAs (miRNAs) are involved in many diverse biological processes and they may potentially regulate the functions of thousands of genes. However, one major issue in miRNA studies is the lack of bioinformatics programs to accurately predict miRNA targets. Animal miRNAs have limited sequence complementarity to their gene targets, which makes it challenging to build target prediction models with high specificity. RESULTS: Here we present a new miRNA target prediction program based on support vector machines (SVMs) and a large microarray training dataset. By systematically analyzing public microarray data, we have identified statistically significant features that are important to target downregulation. Heterogeneous prediction features have been non-linearly integrated in an SVM machine learning framework for the training of our target prediction model, MirTarget2. About half of the predicted miRNA target sites in human are not conserved in other organisms. Our prediction algorithm has been validated with independent experimental data for its improved performance on predicting a large number of miRNA down-regulated gene targets. AVAILABILITY: All the predicted targets were imported into an online database miRDB, which is freely accessible at http://mirdb.org.  相似文献   

4.
5.
The prediction of novel pre-microRNA (miRNA) from genomic sequence has received considerable attention recently. However, the majority of studies have focused on the human genome. Previous studies have demonstrated that sensitivity (correctly detecting true miRNA) is sustained when human-trained methods are applied to other species, however they have failed to report the dramatic drop in specificity (the ability to correctly reject non-miRNA sequences) in non-human genomes. Considering the ratio of true miRNA sequences to pseudo-miRNA sequences is on the order of 1:1000, such low specificity prevents the application of most existing tools to non-human genomes, as the number of false positives overwhelms the true predictions. We here introduce a framework (SMIRP) for creating species-specific miRNA prediction systems, leveraging sequence conservation and phylogenetic distance information. Substantial improvements in specificity and precision are obtained for four non-human test species when our framework is applied to three different prediction systems representing two types of classifiers (support vector machine and Random Forest), based on three different feature sets, with both human-specific and taxon-wide training data. The SMIRP framework is potentially applicable to all miRNA prediction systems and we expect substantial improvement in precision and specificity, while sustaining sensitivity, independent of the machine learning technique chosen.  相似文献   

6.
microRNA计算发现方法的研究进展   总被引:5,自引:0,他引:5  
侯妍妍  应晓敏  李伍举 《遗传》2008,30(6):687-696
microRNA (miRNA)是近几年发现的一类长度为~21 nt的内源非编码小RNA, 在植物和动物中发挥着重要而广泛的调控功能。它的发现主要有cDNA克隆测序和计算发现两条途径。由于cDNA克隆测序方法受miRNA表达的时间和组织特异性以及表达水平的影响, 而计算发现可以弥补其不足, 因此miRNA的计算发现方法研究受到了广泛的重视。文章对近几年计算发现miRNA的研究进展进行了综述, 根据计算发现方法的本质, 将计算发现方法归纳为5类, 分别是同源片段搜索方法、基于比较基因组学的预测方法、基于序列和结构特征打分的预测方法、结合作用靶标的预测方法和基于机器学习的预测方法, 并对各类方法的原理、核心思想、优点和局限性进行了分析, 最后探讨了进一步的发展方向。  相似文献   

7.
8.
9.
Rahman ME  Islam R  Islam S  Mondal SI  Amin MR 《Genomics》2012,99(4):189-194
MicroRNA (miRNA) is a special class of short noncoding RNA that serves pivotal function of regulating gene expression. The computational prediction of new miRNA candidates involves various methods such as learning methods and methods using expression data. This article has proposed a reliable model - miRANN which is a supervised machine learning approach. MiRANN used known pre-miRNAs as positive set and a novel negative set from human CDS regions. The number of known miRNAs is now huge and diversified that could cover almost all characteristics of unknown miRNAs which increases the quality of the result (99.9% accuracy, 99.8% sensitivity, 100% specificity) and provides a more reliable prediction. MiRANN performs better than other state-of-the-art approaches and declares to be the most potential tool to predict novel miRNAs. We have also tested our result using a previous negative set. MiRANN, opens new ground using ANN for predicting pre-miRNAs with a promise of better performance.  相似文献   

10.
11.
水稻MicroRNA的预测及实验验证   总被引:1,自引:0,他引:1  
根据已报道水稻pre-miRNA的序列与结构信息,利用支持向量机(support vector machine, SVM)方法在miRNA前体上预测成熟区,产生一个模型——mature-SVM.它预测水稻成熟区的敏感性和特异性分别为86.7% 和100%;然后,用这个模型对从水稻基因组中筛选出的46.501条pre-miRNA进行成熟链预测,此外再根据miRNA的作用原理用blast程序所进一步的筛选,得到了127条pre-miRNA及成熟miRNA;除去其中已知的21条,最后得到106条候选的新的水稻miRNA. 从中随机挑取10条进行Northern验证,结果有4条miRNA得到确认.  相似文献   

12.
13.
MicroRNAs (miRNAs) are small non-coding RNA molecules that regulate mRNAs through a sequence-specific mechanism. By virtue of their structure and mechanism of action, computational methods have been devised to investigate the encoding of miRNA genes and the targets of miRNA action. A variety of assumptions have predicated the implementation of these various computational solutions. Evolutionary sequence conservation, secondary structure, and folding energetics are some of the assumptions that have been used. The success of these different computational solutions has been evaluated for both elucidation of new miRNAs and deducing targets of miRNA action. While the focus is on search techniques for new miRNAs, we have compared the programs miRseeker, miRScan, PalGrade, ProMiR, and miRAlign as examples of implementation of these techniques. For these programs, a benchmark comparison between theoretical estimation and actual identification is possible. We have also compared the target prediction programs TargetScanS, PicTar, DIANA-microT, miRanda, and RNAhybrid. However, it is difficult to rigorously assess the benchmark performance of these programs due to the difficulty in confirming their theoretical predictions.  相似文献   

14.
MicroRNAs (miRNAs) are important regulators of gene expression. The large-scale detection and profiling of miRNAs have been accelerated with the development of high-throughput small RNA sequencing (sRNA-Seq) techniques and bioinformatics tools. However, generating high-quality comprehensive miRNA annotations remains challenging due to the intrinsic complexity of sRNA-Seq data and inherent limitations of existing miRNA prediction tools. Here, we present iwa-miRNA, a Galaxy-based framework that can facilitate miRNA annotation in plant species by combining computational analysis and manual curation. iwa-miRNA is specifically designed to generate a comprehensive list of miRNA candidates, bridging the gap between already annotated miRNAs provided by public miRNA databases and new predictions from sRNA-Seq datasets. It can also assist users in selecting promising miRNA candidates in an interactive mode, contributing to the accessibility and reproducibility of genome-wide miRNA annotation. iwa-miRNA is user-friendly and can be easily deployed as a web application for researchers without programming experience. With flexible, interactive, and easy-to-use features, iwa-miRNA is a valuable tool for the annotation of miRNAs in plant species with reference genomes. We also illustrate the application of iwa-miRNA for miRNA annotation using data from plant species with varying genomic complexity. The source codes and web server of iwa-miRNA are freely accessible at http://iwa-miRNA.omicstudio.cloud/.  相似文献   

15.
Viral evolution remains to be a main obstacle in the effectiveness of antiviral treatments. The ability to predict this evolution will help in the early detection of drug-resistant strains and will potentially facilitate the design of more efficient antiviral treatments. Various tools has been utilized in genome studies to achieve this goal. One of these tools is machine learning, which facilitates the study of structure-activity relationships, secondary and tertiary structure evolution prediction, and sequence error correction. This work proposes a novel machine learning technique for the prediction of the possible point mutations that appear on alignments of primary RNA sequence structure. It predicts the genotype of each nucleotide in the RNA sequence, and proves that a nucleotide in an RNA sequence changes based on the other nucleotides in the sequence. Neural networks technique is utilized in order to predict new strains, then a rough set theory based algorithm is introduced to extract these point mutation patterns. This algorithm is applied on a number of aligned RNA isolates time-series species of the Newcastle virus. Two different data sets from two sources are used in the validation of these techniques. The results show that the accuracy of this technique in predicting the nucleotides in the new generation is as high as 75 %. The mutation rules are visualized for the analysis of the correlation between different nucleotides in the same RNA sequence.  相似文献   

16.
Li Q  Jin X  Zhu YX 《遗传学报》2012,39(7):351-360
The plant genome possesses a large number of microRNAs(miRNAs)mainly 21-24 nucleotides in length.They play a vital role in regulation of target gene expression at various stages throughout the whole plant life cycle.Here we sequenced and analyzed~10 million non-coding RNAs(ncRNAs)derived from fiber tissue of the allotetraploid cotton(Gossypium hirsutum)1 days post-anthesis using ncRNA-seq technology.In terms of distinct reads,24 nt ncRNA is by far the dominant species,followed by 21 nt and 23 nt ncRNAs. Using ab initio prediction,we identified and characterized a total of 562 candidate miRNA gene loci on the recently assembled D5 genome of the diploid cotton G.raimondii.Of all the 562 predicted miRNAs,22 were previously discovered in cotton species and 187 had sequence conservation and homology to homologous miRNAs of other plant species.Nucleotide bias analysis showed that the 9th and 1 st positions were significantly conserved among different types of miRNA genes.Among the 463 putative miRNA target genes,most significant up/down-regulation occurred in 10-20 days post-anthesis,indicating that miRNAs played an important role during the elongation and secondary cell wall synthesis stages of cotton fiber development.The discovery of new miRNA genes will help understand the mechanisms of miRNA generation and regulation in cotton.  相似文献   

17.
18.
19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号