首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Increasing evidence has indicated that long non-coding RNAs (lncRNAs) are implicated in and associated with many complex human diseases. Despite of the accumulation of lncRNA-disease associations, only a few studies had studied the roles of these associations in pathogenesis. In this paper, we investigated lncRNA-disease associations from a network view to understand the contribution of these lncRNAs to complex diseases. Specifically, we studied both the properties of the diseases in which the lncRNAs were implicated, and that of the lncRNAs associated with complex diseases. Regarding the fact that protein coding genes and lncRNAs are involved in human diseases, we constructed a coding-non-coding gene-disease bipartite network based on known associations between diseases and disease-causing genes. We then applied a propagation algorithm to uncover the hidden lncRNA-disease associations in this network. The algorithm was evaluated by leave-one-out cross validation on 103 diseases in which at least two genes were known to be involved, and achieved an AUC of 0.7881. Our algorithm successfully predicted 768 potential lncRNA-disease associations between 66 lncRNAs and 193 diseases. Furthermore, our results for Alzheimer''s disease, pancreatic cancer, and gastric cancer were verified by other independent studies.  相似文献   

2.

Background

Evidences have increasingly indicated that lncRNAs (long non-coding RNAs) are deeply involved in important biological regulation processes leading to various human complex diseases. Experimental investigations of these disease associated lncRNAs are slow with high costs. Computational methods to infer potential associations between lncRNAs and diseases have become an effective prior-pinpointing approach to the experimental verification.

Results

In this study, we develop a novel method for the prediction of lncRNA-disease associations using bi-random walks on a network merging the similarities of lncRNAs and diseases. Particularly, this method applies a Laplacian technique to normalize the lncRNA similarity matrix and the disease similarity matrix before the construction of the lncRNA similarity network and disease similarity network. The two networks are then connected via existing lncRNA-disease associations. After that, bi-random walks are applied on the heterogeneous network to predict the potential associations between the lncRNAs and the diseases. Experimental results demonstrate that the performance of our method is highly comparable to or better than the state-of-the-art methods for predicting lncRNA-disease associations. Our analyses on three cancer data sets (breast cancer, lung cancer, and liver cancer) also indicate the usefulness of our method in practical applications.

Conclusions

Our proposed method, including the construction of the lncRNA similarity network and disease similarity network and the bi-random walks algorithm on the heterogeneous network, could be used for prediction of potential associations between the lncRNAs and the diseases.
  相似文献   

3.

Background

In human genomes, long non-coding RNAs (lncRNAs) have attracted more and more attention because their dysfunctions are involved in many diseases. However, the associations between lncRNAs and diseases (LDA) still remain unknown in most cases. While identifying disease-related lncRNAs in vivo is costly, computational approaches are promising to not only accelerate the possible identification of associations but also provide clues on the underlying mechanism of various lncRNA-caused diseases. Former computational approaches usually only focus on predicting new associations between lncRNAs having known associations with diseases and other lncRNA-associated diseases. They also only work on binary lncRNA-disease associations (whether the pair has an association or not), which cannot reflect and reveal other biological facts, such as the number of proteins involved in LDA or how strong the association is (i.e., the intensity of LDA).

Results

To address abovementioned issues, we propose a graph regression-based unified framework (GRUF). In particular, our method can work on lncRNAs, which have no previously known disease association and diseases that have no known association with any lncRNAs. Also, instead of only a binary answer for the association, our method tries to uncover more biological relationship between a pair of lncRNA and disease, which may provide better clues for researchers. We compared GRUF with three state-of-the-art approaches and demonstrated the superiority of GRUF, which achieves 5%~16% improvement in terms of the area under the receiver operating characteristic curve (AUC). GRUF also provides a predicted confidence score for the predicted LDA, which reveals the significant correlation between the score and the number of RNA-Binding Proteins involved in LDAs. Lastly, three out of top-5 LDA candidates generated by GRUF in novel prediction are verified indirectly by medical literature and known biological facts.

Conclusions

The proposed GRUF has two advantages over existing approaches. Firstly, it can be used to work on lncRNAs that have no known disease association and diseases that have no known association with any lncRNAs. Secondly, instead of providing a binary answer (with or without association), GRUF works for both discrete and continued LDA, which help revealing the pathological implications between lncRNAs and diseases.
  相似文献   

4.
Recently, many long non-coding RNAs (lncRNAs) have been identified and their biological function has been characterized; however, our understanding of their underlying molecular mechanisms related to disease is still limited. To overcome the limitation in experimentally identifying disease–lncRNA associations, computational methods have been proposed as a powerful tool to predict such associations. These methods are usually based on the similarities between diseases or lncRNAs since it was reported that similar diseases are associated with functionally similar lncRNAs. Therefore, prediction performance is highly dependent on how well the similarities can be captured. Previous studies have calculated the similarity between two diseases by mapping exactly each disease to a single Disease Ontology (DO) term, and then use a semantic similarity measure to calculate the similarity between them. However, the problem of this approach is that a disease can be described by more than one DO terms. Until now, there is no annotation database of DO terms for diseases except for genes. In contrast, Human Phenotype Ontology (HPO) is designed to fully annotate human disease phenotypes. Therefore, in this study, we constructed disease similarity networks/matrices using HPO instead of DO. Then, we used these networks/matrices as inputs of two representative machine learning-based and network-based ranking algorithms, that is, regularized least square and heterogeneous graph-based inference, respectively. The results showed that the prediction performance of the two algorithms on HPO-based is better than that on DO-based networks/matrices. In addition, our method can predict 11 novel cancer-associated lncRNAs, which are supported by literature evidence.  相似文献   

5.
6.
哺乳动物中,只有小部分基因转录成为编码蛋白质的RNA,大量的基因则转录为不能编码蛋白质的RNA,即ncRNA。长非 编码RNA(lncRNAs)是分子长度在200-100000 nt 之间的一类ncRNA。lncRNAs 的数量超过蛋白质编码基因的数量。目前,对长非 编码RNA(lncRNAs)的生物学特性,转录调控以及其在肿瘤发生发展中的作用机制的研究任然是RNA研究的热点。lncRNAs 通 过控制染色质重塑,转录调控和录转录后调控而在基因的转录调节中发挥了重要作用。lncRNAs 与多种肿瘤相关,并且在抑制因 素和促进因素中都具有重要的作用。众多文献报道的结果表明lncRNAs 参与调控基因表达,在正常细胞与肿瘤细胞的转换中起 到至关重要的作用。  相似文献   

7.
MiRNAs are a class of small non‐coding RNAs that are involved in the development and progression of various complex diseases. Great efforts have been made to discover potential associations between miRNAs and diseases recently. As experimental methods are in general expensive and time‐consuming, a large number of computational models have been developed to effectively predict reliable disease‐related miRNAs. However, the inherent noise and incompleteness in the existing biological datasets have inevitably limited the prediction accuracy of current computational models. To solve this issue, in this paper, we propose a novel method for miRNA‐disease association prediction based on matrix completion and label propagation. Specifically, our method first reconstructs a new miRNA/disease similarity matrix by matrix completion algorithm based on known experimentally verified miRNA‐disease associations and then utilizes the label propagation algorithm to reliably predict disease‐related miRNAs. As a result, MCLPMDA achieved comparable performance under different evaluation metrics and was capable of discovering greater number of true miRNA‐disease associations. Moreover, case study conducted on Breast Neoplasms further confirmed the prediction reliability of the proposed method. Taken together, the experimental results clearly demonstrated that MCLPMDA can serve as an effective and reliable tool for miRNA‐disease association prediction.  相似文献   

8.
As a major class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been implicated in various critical biological processes. Accumulating researches have linked dysregulations and mutations of lncRNAs to a variety of human disorders and diseases. However, to date, only a few human lncRNAs have been associated with diseases. Therefore, it is very important to develop a computational method to globally predict potential associated diseases for human lncRNAs. In this paper, we developed a computational framework to accomplish this by combining human lncRNA expression profiles, gene expression profiles, and human disease-associated gene data. Applying this framework to available human long intergenic noncoding RNAs (lincRNAs) expression data, we showed that the framework has reliable accuracy. As a result, for non-tissue-specific lincRNAs, the AUC of our algorithm is 0.7645, and the prediction accuracy is about 89%. This study will be helpful for identifying novel lncRNAs for human diseases, which will help in understanding the roles of lncRNAs in human diseases and facilitate treatment. The corresponding codes for our method and the predicted results are all available at http://asdcd.amss.ac.cn/MingXiLiu/lncRNA-disease.html.  相似文献   

9.
10.

Background

With the development of sequencing technology, more and more long non-coding RNAs (lncRNAs) have been identified. Some lncRNAs have been confirmed that they play an important role in the process of development through the dosage compensation effect, epigenetic regulation, cell differentiation regulation and other aspects. However, the majority of the lncRNAs have not been functionally characterized. Explore the function of lncRNAs and the regulatory network has become a hot research topic currently.

Methods

In the work, a network-based model named BiRWLGO is developed. The ultimate goal is to predict the probable functions for lncRNAs at large scale. The new model starts with building a global network composed of three networks: lncRNA similarity network, lncRNA-protein association network and protein-protein interaction (PPI) network. After that, it utilizes bi-random walk algorithm to explore the similarities between lncRNAs and proteins. Finally, we can annotate an lncRNA with the Gene Ontology (GO) terms according to its neighboring proteins.

Results

We compare the performance of BiRWLGO with the state-of-the-art models on a manually annotated lncRNA benchmark with known GO terms. The experimental results assert that BiRWLGO outperforms other methods in terms of both maximum F-measure (Fmax) and coverage.

Conclusions

BiRWLGO is a relatively efficient method to predict the functions of lncRNA. When protein interaction data is integrated, the predictive performance of BiRWLGO gains a great improvement.
  相似文献   

11.
《Genomics》2020,112(2):1879-1888
Porcine reproductive and respiratory syndrome (PRRS), which is caused by PRRS virus (PRRSV), is one of the most globally devastating swine diseases. It is essential to develop new strategy to control PRRS via an understanding of mechanisms that PRRSV utilizes to interfere with the host's innate immunity. In this study, we deeply sequenced and analyzed long noncoding RNA (lncRNA) and mRNA expression profiles of the porcine alveolar macrophages (PAMs) after PRRSV infection. 126 lncRNAs and 753 mRNAs were differentially expressed between PRRSV-infected and control PAMs. The co-expressed genes of down-regulated lncRNAs were significantly enriched within NF-kappa B and toll-like receptor signaling pathways. Co-expression network analysis indicated that part of the dysregulated lncRNAs associated with the interferon-induced genes. These dysregulated lncRNAs may play an important role in the host's innate immune responses to PRRSV infection. However, further research is required to characterize the function of these lncRNAs.  相似文献   

12.
Non-coding RNAs in Alzheimer's Disease   总被引:1,自引:0,他引:1  
  相似文献   

13.
Many experimental and computational studies have identified key protein coding genes in initiation and progression of esophageal squamous cell carcinoma (ESCC). However, the number of researches that tried to reveal the role of long non-coding RNAs (lncRNAs) in ESCC has been limited. LncRNAs are one of the important regulators of cancers which are transcribed dominantly in the genome and in various conditions. The main goal of this study was to use a systems biology approach to predict novel lncRNAs as well as protein coding genes associated with ESCC and assess their prognostic values. By using microarray expression data for mRNAs and lncRNAs from a large number of ESCC patients, we utilized “Weighted Gene Co-expression Network Analysis” (WGCNA) method to make a big coding-non-coding gene co-expression network, and discovered important functional modules. Gene set enrichment and pathway analysis revealed major biological processes and pathways involved in these modules. After selecting some protein coding genes involved in biological processes and pathways related to cancer, we used “LncTar”, a computational tool to predict potential interactions between these genes and lncRNAs. By combining interaction results with Pearson correlations, we introduced some novel lncRNAs with putative key regulatory roles in the network. Survival analysis with Kaplan-Meier estimator and Log-rank test statistic confirmed that most of the introduced genes are associated with poor prognosis in ESCC. Overall, our study reveals novel protein coding genes and lncRNAs associated with ESCC, along with their predicted interactions. Based on the promising results of survival analysis, these genes can be used as good estimators of patients' survival, or even can be analyzed further as new potential signatures or targets for the therapy of ESCC disease.  相似文献   

14.
目的 长非编码RNA(lncRNAs)参与多种重要的生物学过程并与各种人类疾病密切相关,因此,lncRNA-疾病关联预测研究有助于疾病的诊断、治疗和在分子水平理解人类疾病的发生发展机制。目前,大多数lncRNA-疾病关联预测方法倾向于浅层整合lncRNA和疾病的相关信息,忽略网络拓扑结构中的深层嵌入特征;另外通过随机选取lncRNA-疾病非关联对构建负样本训练集合,影响预测方法的鲁棒性。方法 本文提出一种基于网络嵌入的NELDA方法,预测潜在的lncRNA-疾病关联关系。NELDA首先利用lncRNA 表达谱、疾病本体论和已知的lncRNA-疾病关联关系,构建lncRNA相似性网络、疾病相似性网络和lncRNA-疾病关联网络。然后,通过设计4个深度自编码器分别从lncRNA/疾病的相似性网络、lncRNA-疾病关联网络学习lncRNA和疾病的低维网络嵌入特征。串联lncRNA和疾病的相似性网络嵌入特征及lncRNA和疾病的关联网络嵌入特征,分别输入两个支持向量机分类器预测lncRNA-疾病关联。最后,采用加权融合策略融合两个支持向量机分类器的预测结果,给出lncRNA-疾病关联关系的最终预测结果。另外,根据已知的lncRNA-疾病关联对和疾病语义相似性,设计一种负样本选取策略构建可信度相对较高的lncRNA-疾病非关联对样本集,用以改善分类器的鲁棒性,该策略通过设计一种打分函数为每对lncRNA-疾病进行打分,选取得分较低的lncRNA-疾病对作为lncRNA-疾病非关联对样本(即负样本)。结果 十折交叉验证实验结果表明:NELDA能够有效预测lncRNA-疾病关联关系,其AUC达到0.982 7,比现有LDASR和 LDNFSGB方法分别提高了0.062 7和0.020 7。另外,负样本选取策略与决策级加权融合策略能够有效改善NELDA预测性能。胃癌和乳腺癌案例研究中,29/40(72.5%)预测的与胃癌和乳腺癌关联lncRNAs,在近期文献和公共数据库中能够发现相关的支撑证据。结论 这些实验结果表明,NELDA是一种有效的lncRNA-疾病关联关系预测方法,具有挖掘潜在lncRNA-疾病关联关系的能力。  相似文献   

15.
Protein folding and protein binding are similar processes. In both, structural units combinatorially associate with each other. In the case of folding, we mostly handle relatively small units, building blocks or domains, that are covalently linked. In the case of multi-molecular binding, the subunits are relatively large and are associated only by non-covalent bonds. Experimentally, the difficulty in the determination of the structures of such large assemblies increases with the complex size and the number of components it contains. Computationally, the prediction of the structures of multi-molecular complexes has largely not been addressed, probably owing to the magnitude of the combinatorial complexity of the problem. Current docking algorithms mostly target prediction of pairwise interactions. Here our goal is to predict the structures of multi-unit associations, whether these are chain-connected as in protein folding, or separate disjoint molecules in the assemblies. We assume that the structures of the single units are known, either through experimental determination or modeling. Our aim is to combinatorially assemble these units to predict their structure. To address this problem we have developed CombDock. CombDock is a combinatorial docking algorithm for the structural units assembly problem. Below, we briefly describe the algorithm and present examples of its various applications to folding and to multi-molecular assemblies. To test the robustness of the algorithm, we use inaccurate models of the structural units, derived either from crystal structures of unbound molecules or from modeling of the target sequences. The algorithm has been able to predict near-native arrangements of the input structural units in almost all of the cases, suggesting that a combinatorial approach can overcome the imperfect shape complementarity caused by the inaccuracy of the models. In addition, we further show that through a combinatorial docking strategy it is possible to enhance the predictions of pairwise interactions involved in a multi-molecular assembly.  相似文献   

16.
为实现高通量识别新的药物-长链非编码RNA(Long non-coding RNA, lncRNA)关联,本文提出了一种基于图卷积网络模型来识别潜在药物-lncRNA关联的方法DLGCN(Drug-LncRNA graph convolution network)。首先,基于药物的结构信息和lncRNA的序列信息分别构建了药物-药物和lncRNA-lncRNA相似性网络,并整合实验证实的药物-lncRNA关联构建了药物-lncRNA异质性网络。然后,将注意力机制和图卷积运算应用于该网络中,学习药物和lncRNA的低维特征,基于整合的低维特征预测新的药物-lncRNA关联。通过效能评估,DLGCN的受试者工作特性曲线下面积(Area under receiver operating characteristic, AUROC)达到0.843 1,优于经典的机器学习方法和常见的深度学习方法。此外,DLGCN预测到姜黄素能够调控lncRNA MALAT1的表达,已被最近的研究证实。DLGCN能够有效预测药物-lncRNA关联,为肿瘤治疗新靶点的识别和抗癌药物的筛选提供了重要参考。  相似文献   

17.
Neurodegenerative diseases (NDs) are a diversity of neurological disorders characterized by the progressive degeneration of the structure and function of the central nervous system (CNS). The most common NDs are Alzheimer's disease (AD), Parkinson's disease (PD), and Huntington's disease (HD). Recently, many studies have investigated associations between common NDs with noncoding RNAs (ncRNAs) molecules. ncRNAs are regulatory molecules in the normal functioning of the CNS. Two of the most important ncRNAs are microRNAs (miRNAs) and long noncoding RNAs (lncRNAs). These types of ncRNAs are involved in different biological processes including brain development, maturation, differentiation, neuronal cell specification, neurogenesis, and neurotransmission. Increasing data has demonstrated that miRNAs and lncRNAs have strong correlations with the development of NDs, particularly gene expression. Besides, ncRNAs can be introduced as new biomarkers for diagnosis and prognosis of NDs. Hence, in this review, we summarized the involvement of various miRNAs and lncRNAs in most common NDs followed by a correlation of ncRNAs dysregulation with the AD, PD, and HD.  相似文献   

18.
Over the past decade, there has been growing enthusiasm for using electronic medical records (EMRs) for biomedical research. Quantile regression estimates distributional associations, providing unique insights into the intricacies and heterogeneity of the EMR data. However, the widespread nonignorable missing observations in EMR often obscure the true associations and challenge its potential for robust biomedical discoveries. We propose a novel method to estimate the covariate effects in the presence of nonignorable missing responses under quantile regression. This method imposes no parametric specifications on response distributions, which subtly uses implicit distributions induced by the corresponding quantile regression models. We show that the proposed estimator is consistent and asymptotically normal. We also provide an efficient algorithm to obtain the proposed estimate and a randomly weighted bootstrap approach for statistical inferences. Numerical studies, including an empirical analysis of real-world EMR data, are used to assess the proposed method's finite-sample performance compared to existing literature.  相似文献   

19.
《Genomics》2020,112(2):1754-1760
Recently, lncRNAs have attracted accumulating attentions because more and more experimental researches have shown lncRNA can play critical roles in many biological processes. Predicting potential interactions between lncRNAs and proteins are key to understand the lncRNAs biological functions. But traditional biological experiments are expensive and time-consuming, network similarity methods provide a powerful solution to computationally predict lncRNA-protein interactions. In this work, a novel path-based lncRNA-protein interaction (PBLPI) prediction model is proposed by integrating protein semantic similarity, lncRNA functional similarity, known human lncRNA-protein interactions, and Gaussian interaction profile kernel similarity. PBLPI model utilizes three interlinked sub-graphs to construct a heterogeneous graph, and then infers potential lncRNA-protein interactions through depth-first search algorithm. Consequently, PBLPI achieves reliable performance in the frameworks of 5-fold cross validation (average AUC is 0.9244 and AUPR is 0.6478). In the case study, we use “Mus musculus” data to further validate the reliability of PBLPI method. It is anticipated that PBLPI would become a useful tool to identify potential lncRNA-protein interactions.  相似文献   

20.
Mounting evidence shows that organisms have already begun to respond to global climate change. Advances in our knowledge of how climate shapes species distributional patterns has helped us better understand the response of birds to climate change. However, the distribution of birds across the landscape is also driven by biotic and abiotic components, including habitat characteristics. We therefore developed statistical models of 147 bird species distributions in the eastern United States, using climate, elevation, and the distributions of 39 tree species to predict contemporary bird distributions. We used randomForest, a robust regression‐based decision tree ensemble method to predict contemporary bird distributions. These models were then projected onto three models of climate change under high and low emission scenarios for both climate and the projected change in suitable habitat for the 39 tree species. The resulting bird species models indicated that breeding habitat will decrease by at least 10% for 61–79 species (depending on model and emissions scenario) and increase by at least 10% for 38–52 species in the eastern United States. Alternatively, running the species models using only climate/elevation (omitting tree species), we found that the predictive power of these models was significantly reduced (p<0.001). When these climate/elevation‐only models were projected onto the climate change scenarios, the change in suitable habitat was more extreme in 60% of the species. In the end, the strong associations with vegetation tempers a climate/elevation‐only response to climate change and indicates that refugia of suitable habitat may persist for these bird species in the eastern US, even after the redistribution of tree species. These results suggest the importance of interacting biotic processes and that further fine‐scale research exploring how climate change may disrupt species specific requirements is needed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号