期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

肖国华李永进郭政彭春方王栋朱晶杨达姚晨王靖《生物信息学》2008,6(1):9-11

采用基因表达谱可以研究基因功能模块与疾病异质性之间的关系.根据两套白血病基因表达谱数据,将富集高变异基因的Gene Ontology基因功能模块作为特征功能模块,将疾病样本聚为两类.通过对比原始多类标签,采用聚类评估指标来分析两类化聚类结果的效果,并探讨特征功能模块与疾病异质性之间的关系.实验结果显示:在两套不同的白血病基因表达谱数据中得到的特征功能模块类似,它们对白血病亚型有较强的分型能力. 相似文献

2.

基于GO与基因表达谱挖掘特征基因功能类

喻辉郭政李霞《生物信息学》2003,1(1):15-19

我们研制了基于Gene Ontology与基因表达谱挖掘与实验条件相关的特征基因功能类的算法OntoFexed,它的特点是分别采用信息增益方法和Rand Index评价单个基因功能类与一组基因功能类鉴别差异表达基因与不差异表达基因的能力。算法的优点是充分利用了GO的结构信息来搜索特征功能类,并能给出各个抽象层次上的特征功能类。我们将OntoFexed应用于腺癌数据集和NCI60数据集,发现OntoFexed确能发掘与实验条件相关的功能类,且算法对主要的参数有较高的稳健性。相似文献

3.

根据GO分类体系选择与实验条件相关的差异表达功能结点进行基因功能深化预测

刘帅郭政李霞屠康徐建震《生物信息学》2005,3(2):49-52

利用有限个实验条件下的基因表达谱数据,只能对与实验条件相关的基因功能类进行有效预测,所以有必要限定可预测的基因功能类范围。据此,首先基于GeneOntology(GO)选择富集差异表达基因与实验条件相关的功能类。再通过支持向量机分类器,深化预测迄今只注释到实验条件相关功能类的父结点的基因是否属于该实验条件相关功能类。应用于一套酵母基因表达谱数据,结果显示,在剔除了高度不平衡的训练集合后,平均真阳性率(precision)与平均覆盖率(recall)都分别达到了71%与47%以上。相似文献

4.

基于Fiedler向量的基因表达谱数据分类方法 总被引：1，自引：0，他引：1

王年庄振华唐俊苏亮亮《中国生物工程杂志》2010,30(12):82-86

尝试将一种基于图的Fiedler向量的聚类算法引入到基因表达谱数据的肿瘤分类中来。该方法将分属不同类的所有样本通过高斯权构造Laplace完全图,经SVD分解后获得Fiedler向量,利用各样本所对应的Fiedler向量分量的符号差异来进行基因表达谱数据的分类。通过模拟数据仿真实验和对白血病两个亚型(ALL与AML)及结肠癌真实数据实验,证明了这一方法的有效性。相似文献

5.

结合基因功能分类体系Gene Ontology筛选聚类特征基因 总被引：3，自引：0，他引：3

徐建震郭政李霞李永进刘帅屠康《生物物理学报》2005,21(3):187-194

使用两套基因表达谱数据,按各基因的表达值方差,选择表达变异基因对样本聚类,发现一般使用方差较大的前10％的基因作为特征基因,就可以较好地对疾病样本聚类。对不同的疾病,包含聚类信息的特征基因有不同的分布特点。在此基础上,结合基因功能分类体系(Gene Ontology,GO),进一步筛选聚类的特征基因。通过检验在Gene Ontology中的每个功能类中的表达变异基因是否非随机地聚集,寻找疾病相关功能类,再根据相关功能类中的表达变异基因进行聚类分析。实验结果显示：结合基因功能体系进一步筛选表达变异基因作为聚类特征基因,可以保持或提高聚类准确性,并使得聚类结果具有明确的生物学意义。另外,发现了一些可能和淋巴瘤和白血病相关的基因。相似文献

6.

贝叶斯聚类在基因表达谱知识挖掘中的应用 总被引：1，自引：0，他引：1

包雷李泽孙之荣《生物物理学报》2002,18(1):66-70

在大规模基因表达谱的数据分析中引入了一种全新的基于贝叶斯模型的聚类算法,从生物学背景出发,研究了该算法应用在大规模基因表达谱中的理论基础和算法优越性,并应用该算法对两个公共的基因表达数据集进行了知识再挖掘。结果表明,与其他聚类算法相比,该算法在知识发现方面具有显著的优越性。挖掘出的生物学知识对该领域研究人员的实验设计也有一定的启发性。相似文献

7.

一种基于类均值的肿瘤基因芯片数据的标准化方法

王广云邱浪波王正志《生命科学研究》2007,11(3):206-211

：分析了当前常用的标准化方法在肿瘤基因芯片中引起错误分类的原因,提出了一种基于类均值的标准化方法．该方法对基因表达谱进行双向标准化,并将标准化过程与聚类过程相互缠绕,利用聚类结果来修正参照表达水平．选取了5组肿瘤基因芯片数据,用层次聚类和K-均值聚类算法在不同的方差水平上分别对常用的标准化和基于类均值的标准化处理后的基因表达数据进行聚类分析比较．实验结果表明,基于类均值的标准化方法能有效提高肿瘤基因表达谱聚类结果的质量．相似文献

8.

改进的系统发育谱算法在蛋白质功能注释中的应用

马雅楠孙平平魏雅卓陆林英崔颖马志强《生物信息学》2009,7(1):44-46,51

系统发育谱方法是目前研究较多的一种基于非同源性的生物大分子功能注释方法。针对现有算法存在的一些缺陷,从两个方面对该方法做了改进：一是构造基于权重的系统发育谱;二是采用改进的聚类算法对发育谱的相似性进行分析。从NCBI上下载100条Escherichia coli K12蛋白质作为实验数据,分别使用改进的算法和经典的层次聚类算法、K均值聚类算法对相似谱进行分析。结果显示,提出的改进算法在对相似谱聚类的精确度上明显优于后两种聚类算法。相似文献

9.

改进的K—mean聚类算法在基因系统发育谱分析中的应用

马志强孙平平马雅楠魏雅卓陆林英崔颖周春光《生物信息学》2008,6(2):82-84

简要介绍了系统发育谱法的原理,着重阐述了K—mean聚类算法在对基因系统发育谱分析中的改进,并与传统的K—mean聚类算法进行比较。实验结果表明,改进的K—mean聚类算法在运用系统发育谱法进行基因功能注释上是快而有效的,可以快速收敛到近似最优解。相似文献

10.

微阵列数据的多目标免疫优化双聚类

刘军万李舟军陈义明刘飞飞《生物信息学》2009,7(3):234-237

DNA微阵列技术的发展为基因表达研究提供更有效的工具。分析这些大规模基因数据主要应用聚类方法。最近,提出双聚类技术来发现子矩阵以揭示各种生物模式。多目标优化算法可以同时优化多个相互冲突的目标,因而是求解基因表达矩阵的双聚类的一种很好的方法。本文基于克隆选择原理提出了一个新奇的多目标免疫优化双聚类算法,来挖掘微阵列数据的双聚类。在两个真实数据集上的实验结果表明该方法比其他多目标进化双聚娄算法表现出更优越的性能。相似文献

11.

From phenotype to gene: Detecting disease-specific gene functional modules via a text-based human disease phenotype network construction

Shi-Hua Zhang Xia Li Xi Chen Bin-Sheng Gong Yu-Qing Yan 《FEBS letters》2010,584(16):3635-3643

Currently, some efforts have been devoted to the text analysis of disease phenotype data, and their results indicated that similar disease phenotypes arise from functionally related genes. These related genes work together, as a functional module, to perform a desired cellular function. We constructed a text-based human disease phenotype network and detected 82 disease-specific gene functional modules, each corresponding to a different phenotype cluster, by means of graph-based clustering and mapping from disease phenotype to gene. Since genes in such gene functional modules are functionally related and cause clinically similar diseases, they may share common genetic origin of their associated disease phenotypes. We believe the investigation may facilitate the ultimate understanding of the common pathophysiologic basis of associated diseases. 相似文献

12.

结合基因表达数据和ChIP-chip数据的酵母转录调控模块的识别

王晓敏王正志王广云黎刚果《生物物理学报》2011,(3)

活细胞依赖其众多的转录调控模块来实现复杂的生物功能,识别转录调控模块对深入理解细胞的功能及其转录机制有着重要的意义。本文结合酵母基因表达数据和ChIP-chip数据,提出了一种转录调控模块识别算法。该算法通过采用不同的P值阈值分别得到了核心集和粗糙集,然后对核心集和粗糙集进行判别,最后对基因进行扩展之后得到基因转录调控模块。将该算法运用到两个酵母基因表达数据中,得到了一些具有显著生物学意义的基因转录调控模块。与其它算法相比,该算法不仅可以识别含有较多基因的转录调控模块,而且可以识别一些其它算法不能识别的基因转录调控模块。识别得到的基因转录调控模块有着不同的生物学功能,并且有助于进一步理解酵母的转录调控机制。相似文献

13.

ModuleSearch: finding functional modules in a protein–protein interaction network

Guangyu Cui Rojan Shrestha 《Computer methods in biomechanics and biomedical engineering》2013,16(7):691-699

Many biological processes are performed by a group of proteins rather than by individual proteins. Proteins involved in the same biological process often form a densely connected sub-graph in a protein–protein interaction network. Therefore, finding a dense sub-graph provides useful information to predict the function or protein complex of uncharacterised proteins in the sub-graph. We developed a heuristic algorithm that finds functional modules in a protein–protein interaction network and visualises the modules. The algorithm has been implemented in a platform-independent, standalone program called ModuleSearch. In an interaction network of yeast proteins, ModuleSearch found 366 overlapping modules. Of the modules, 71% have a function shared by more than half the proteins in the module and 58% have a function shared by all proteins in the module. Comparison of ModuleSearch with other programs shows that ModuleSearch finds more sub-graphs than most other programs, yet a higher proportion of the sub-graphs correspond to known functional modules. ModuleSearch and sample data are freely available to academics at http://bclab.inha.ac.kr/ModuleSearch. 相似文献

14.

A network partition algorithm for mining gene functional modules of colon cancer from DNA microarray data

Ruan XG Wang JL Li JG 《基因组蛋白质组与生物信息学报(英文版)》2006,4(4):245-252

Computational analysis is essential for transforming the masses of microarray data into a mechanistic understanding of cancer. Here we present a method for finding gene functional modules of cancer from microarray data and have applied it to colon cancer. First, a colon cancer gene network and a normal colon tissue gene network were constructed using correlations between the genes. Then the modules that tended to have a homogeneous functional composition were identified by splitting up the network. Analysis of both networks revealed that they are scale-free. Comparison of the gene functional modules for colon cancer and normal tissues showed that the modules＇ functions changed with their structures. 相似文献

15.

Combining sequence and time series expression data to learn transcriptional modules

Kundaje A Middendorf M Gao F Wiggins C Leslie C 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2005,2(3):194-202

相似文献

16.

榕小蜂基因共表达研究

崔东亚孙旭斌王佳佳张鹏孙宝发陈小伟 Robert W. Murphy 何顺民黄大卫《生物化学与生物物理进展》2016,43(1):83-89

榕小蜂的雌雄个体之间存在很大表型差异,以至于很难将雌雄个体彼此联系在一起.以对叶榕传粉榕小蜂作为材料,利用"加权基因共表达网络分析"软件(WGCNA),对榕小蜂的基因组和转录组进行分析,结果发现,5个基因共表达模块,分别用蓝色、蓝绿色、棕色、绿色和黄色标识,其中2个模块偏爱在雌蜂中表达,3个模块偏爱在蛹中表达.基因本体(GO)分析发现在蓝绿色和黄色表达模块中发现3个功能富集的基因集合.在蓝绿色基因表达模块中发现2个基因集合,分别与细胞周期和核苷酸结合活性有关;在黄色基因表达模块中发现1个基因结合,与细胞分化有关,尤其是与神经发育有关. 相似文献

17.

Defining transcription modules using large-scale gene expression data 总被引：3，自引：0，他引：3

Ihmels J Bergmann S Barkai N 《Bioinformatics (Oxford, England)》2004,20(13):1993-2003

相似文献

18.

基于节点-模块置信度及局部模块度双重约束挖掘前列腺癌候选疾病模块

王一斌程咏梅张绍武《生物化学与生物物理进展》2015,42(4):375-389

前列腺癌病因及发病机理研究有助于前列腺癌预防和治疗.目前,前列腺癌生化试验研究方法成本高、耗时,而基于网络计算方法容易受基因表达谱数据不完整、噪声高及实验样本数量少等约束.为此,本文提出一种基于节点-模块置信度及局部模块度的双重约束算法(命名为NMCOM),挖掘前列腺癌候选疾病模块.NMCOM算法不依赖基因表达谱数据,采用候选基因与致病表型之间一致性得分,候选基因与致病基因之间语义相似性得分融合排序策略,选取起始节点,并基于节点-模块置信度及局部模块度双重约束挖掘前列腺癌候选疾病模块.通过对挖掘出的模块进行富集分析,最终得到18个有显著意义的候选疾病基因模块.与单一打分排序方法及随机游走重开始方法相比,NMCOM融合排序策略的平均排名比小、AUC值大,且挖掘出结果明显优于其他模块挖掘算法,模块生物学意义显著.NMCOM算法不仅能准确有效地挖掘前列腺癌候选疾病模块,且可扩展挖掘其他疾病候选模块. 相似文献

19.

Edge-count probabilities for the identification of local protein communities and their organization

Farutin V Robison K Lightcap E Dancik V Ruttenberg A Letovsky S Pradines J 《Proteins》2006,62(3):800-818

We present a computational approach based on a local search strategy that discovers sets of proteins that preferentially interact with each other. Such sets are referred to as protein communities and are likely to represent functional modules. Preferential interaction between module members is quantified via an analytical framework based on a network null model known as the random graph with given expected degrees. Based on this framework, the concept of local protein community is generalized to that of community of communities. Protein communities and higher-level structures are extracted from two yeast protein interaction data sets and a network of published interactions between human proteins. The high level structures obtained with the human network correspond to broad biological concepts such as signal transduction, regulation of gene expression, and intercellular communication. Many of the obtained human communities are enriched, in a statistically significant way, for proteins having no clear orthologs in lower organisms. This indicates that the extracted modules are quite coherent in terms of function. 相似文献