共查询到19条相似文献,搜索用时 203 毫秒
1.
2.
3.
利用有限个实验条件下的基因表达谱数据,只能对与实验条件相关的基因功能类进行有效预测,所以有必要限定可预测的基因功能类范围。据此,首先基于GeneOntology(GO)选择富集差异表达基因与实验条件相关的功能类。再通过支持向量机分类器,深化预测迄今只注释到实验条件相关功能类的父结点的基因是否属于该实验条件相关功能类。应用于一套酵母基因表达谱数据,结果显示,在剔除了高度不平衡的训练集合后,平均真阳性率(precision)与平均覆盖率(recall)都分别达到了71%与47%以上。 相似文献
4.
基于Fiedler向量的基因表达谱数据分类方法 总被引:1,自引:0,他引:1
尝试将一种基于图的Fiedler向量的聚类算法引入到基因表达谱数据的肿瘤分类中来。该方法将分属不同类的所有样本通过高斯权构造Laplace完全图,经SVD分解后获得Fiedler向量,利用各样本所对应的Fiedler向量分量的符号差异来进行基因表达谱数据的分类。通过模拟数据仿真实验和对白血病两个亚型(ALL与AML)及结肠癌真实数据实验,证明了这一方法的有效性。 相似文献
5.
结合基因功能分类体系Gene Ontology筛选聚类特征基因 总被引:3,自引:0,他引:3
使用两套基因表达谱数据,按各基因的表达值方差,选择表达变异基因对样本聚类,发现一般使用方差较大的前10%的基因作为特征基因,就可以较好地对疾病样本聚类。对不同的疾病,包含聚类信息的特征基因有不同的分布特点。在此基础上,结合基因功能分类体系(Gene Ontology,GO),进一步筛选聚类的特征基因。通过检验在Gene Ontology中的每个功能类中的表达变异基因是否非随机地聚集,寻找疾病相关功能类,再根据相关功能类中的表达变异基因进行聚类分析。实验结果显示:结合基因功能体系进一步筛选表达变异基因作为聚类特征基因,可以保持或提高聚类准确性,并使得聚类结果具有明确的生物学意义。另外,发现了一些可能和淋巴瘤和白血病相关的基因。 相似文献
6.
7.
8.
9.
10.
11.
Currently, some efforts have been devoted to the text analysis of disease phenotype data, and their results indicated that similar disease phenotypes arise from functionally related genes. These related genes work together, as a functional module, to perform a desired cellular function. We constructed a text-based human disease phenotype network and detected 82 disease-specific gene functional modules, each corresponding to a different phenotype cluster, by means of graph-based clustering and mapping from disease phenotype to gene. Since genes in such gene functional modules are functionally related and cause clinically similar diseases, they may share common genetic origin of their associated disease phenotypes. We believe the investigation may facilitate the ultimate understanding of the common pathophysiologic basis of associated diseases. 相似文献
12.
活细胞依赖其众多的转录调控模块来实现复杂的生物功能,识别转录调控模块对深入理解细胞的功能及其转录机制有着重要的意义。本文结合酵母基因表达数据和ChIP-chip数据,提出了一种转录调控模块识别算法。该算法通过采用不同的P值阈值分别得到了核心集和粗糙集,然后对核心集和粗糙集进行判别,最后对基因进行扩展之后得到基因转录调控模块。将该算法运用到两个酵母基因表达数据中,得到了一些具有显著生物学意义的基因转录调控模块。与其它算法相比,该算法不仅可以识别含有较多基因的转录调控模块,而且可以识别一些其它算法不能识别的基因转录调控模块。识别得到的基因转录调控模块有着不同的生物学功能,并且有助于进一步理解酵母的转录调控机制。 相似文献
13.
Guangyu Cui Rojan Shrestha 《Computer methods in biomechanics and biomedical engineering》2013,16(7):691-699
Many biological processes are performed by a group of proteins rather than by individual proteins. Proteins involved in the same biological process often form a densely connected sub-graph in a protein–protein interaction network. Therefore, finding a dense sub-graph provides useful information to predict the function or protein complex of uncharacterised proteins in the sub-graph. We developed a heuristic algorithm that finds functional modules in a protein–protein interaction network and visualises the modules. The algorithm has been implemented in a platform-independent, standalone program called ModuleSearch. In an interaction network of yeast proteins, ModuleSearch found 366 overlapping modules. Of the modules, 71% have a function shared by more than half the proteins in the module and 58% have a function shared by all proteins in the module. Comparison of ModuleSearch with other programs shows that ModuleSearch finds more sub-graphs than most other programs, yet a higher proportion of the sub-graphs correspond to known functional modules. ModuleSearch and sample data are freely available to academics at http://bclab.inha.ac.kr/ModuleSearch. 相似文献
14.
Computational analysis is essential for transforming the masses of microarray data into a mechanistic understanding of cancer. Here we present a method for finding gene functional modules of cancer from microarray data and have applied it to colon cancer. First, a colon cancer gene network and a normal colon tissue gene network were constructed using correlations between the genes. Then the modules that tended to have a homogeneous functional composition were identified by splitting up the network. Analysis of both networks revealed that they are scale-free. Comparison of the gene functional modules for colon cancer and normal tissues showed that the modules' functions changed with their structures. 相似文献
15.
16.
榕小蜂的雌雄个体之间存在很大表型差异,以至于很难将雌雄个体彼此联系在一起.以对叶榕传粉榕小蜂作为材料,利用"加权基因共表达网络分析"软件(WGCNA),对榕小蜂的基因组和转录组进行分析,结果发现,5个基因共表达模块,分别用蓝色、蓝绿色、棕色、绿色和黄色标识,其中2个模块偏爱在雌蜂中表达,3个模块偏爱在蛹中表达.基因本体(GO)分析发现在蓝绿色和黄色表达模块中发现3个功能富集的基因集合.在蓝绿色基因表达模块中发现2个基因集合,分别与细胞周期和核苷酸结合活性有关;在黄色基因表达模块中发现1个基因结合,与细胞分化有关,尤其是与神经发育有关. 相似文献
17.
18.
前列腺癌病因及发病机理研究有助于前列腺癌预防和治疗.目前,前列腺癌生化试验研究方法成本高、耗时,而基于网络计算方法容易受基因表达谱数据不完整、噪声高及实验样本数量少等约束.为此,本文提出一种基于节点-模块置信度及局部模块度的双重约束算法(命名为NMCOM),挖掘前列腺癌候选疾病模块.NMCOM算法不依赖基因表达谱数据,采用候选基因与致病表型之间一致性得分,候选基因与致病基因之间语义相似性得分融合排序策略,选取起始节点,并基于节点-模块置信度及局部模块度双重约束挖掘前列腺癌候选疾病模块.通过对挖掘出的模块进行富集分析,最终得到18个有显著意义的候选疾病基因模块.与单一打分排序方法及随机游走重开始方法相比,NMCOM融合排序策略的平均排名比小、AUC值大,且挖掘出结果明显优于其他模块挖掘算法,模块生物学意义显著.NMCOM算法不仅能准确有效地挖掘前列腺癌候选疾病模块,且可扩展挖掘其他疾病候选模块. 相似文献
19.
Farutin V Robison K Lightcap E Dancik V Ruttenberg A Letovsky S Pradines J 《Proteins》2006,62(3):800-818
We present a computational approach based on a local search strategy that discovers sets of proteins that preferentially interact with each other. Such sets are referred to as protein communities and are likely to represent functional modules. Preferential interaction between module members is quantified via an analytical framework based on a network null model known as the random graph with given expected degrees. Based on this framework, the concept of local protein community is generalized to that of community of communities. Protein communities and higher-level structures are extracted from two yeast protein interaction data sets and a network of published interactions between human proteins. The high level structures obtained with the human network correspond to broad biological concepts such as signal transduction, regulation of gene expression, and intercellular communication. Many of the obtained human communities are enriched, in a statistically significant way, for proteins having no clear orthologs in lower organisms. This indicates that the extracted modules are quite coherent in terms of function. 相似文献