首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
应用多样性增量结合二次判别分析 (Increment of Diversity with Quadratic Discriminant analysis,IDQD)方法,对大肠杆菌σ70启动子进行识别.使用受试者操作特性 (receiver operating characteristic,ROC)曲线和精度召回率曲线 (Precision Recall Curves,PRC) 进行性能评估.10-fold交叉检验给出,在正负集之比为1:1时,ROC曲线下面积和PRC曲线下面积均为95%.结果表明,IDQD算法有能力应用于原核启动子的识别.识别精度高于现有算法.  相似文献   

2.
陈伟  罗辽复 《生物信息学》2009,7(2):159-162
应用多样性增量结合二次判别分析(Increment of Diversity with Quadratic Discriminant analysis, IDQD)方法,对酵母基因组中的核小体强/弱偏好序列进行了识别。10交叉检验的预测成功率超过了97%,受试者操作特性(receiver operating characteristic,ROC)曲线下面积达到了0.99,预测成功率高于现有SVM算法。最后利用构建好的分类器对酵母基因组中三类包含TATA盒基因的起始密码子ATC上游400nt下游100nt区域进行了分析。结果表明,IDQD算法有能力应用于基因组中核小体序列的识别。  相似文献   

3.
人类polⅡ启动子的识别   总被引:14,自引:2,他引:12  
依据基因启动子区和非启动子区碱基分布的特征,应用基于多样性增量的二次判别分析 (IDQD),对人类polⅡ启动子进行识别,识别精度达到90%以上的水平,优于其他已发表的 (包括SVM分类器等) 识别算法. 使用IDQD算法也能对转录起始位点 (TSS) 进行较准确的预测,10-fold交叉检验结果的敏感性和特异性分别为86%和91%. 这些结果表明IDQD是一个有效的分类器.  相似文献   

4.
应用多样性增量结合二次判别分析(Increment of Diversity with Quadratic Discriminant analysis,IDQD)方法,对酵母基因组中的核小体强/弱偏好序列进行了识别.10交叉检验的预测成功率超过了97%,受试者操作特性(receiver operating characteristic,ROC)曲线下面积达到了0.99,预测成功率高于现有SVM算法.最后利用构建好的分类器对酵母基因组中三类包含TATA盒基因的起始密码子ATG上游400nt下游100nt区域进行了分析.结果表明,IDQD算法有能力应用于基因组中核小体序列的识别.  相似文献   

5.
基于蛋白质序列组分信息,提出一个离散增量结合二次判别分析法(IDQD)预测蛋白质相互作用的模型,对人类蛋白质相互作用进行预测.自洽检验的识别精度达到75.89%,3-fold交叉检验的敏感性和特异性分别为64.22%和64.68%.结果表明IDQD算法可以用于蛋白质相互作用的预测.  相似文献   

6.
林昊 《生物信息学》2009,7(4):252-254
由于蛋白质亚细胞位置与其一级序列存在很强的相关性,利用多样性增量来描述蛋白质之间氨基酸组分和二肽组分的相似程度,采用修正的马氏判别式(这里称为IDQD方法)对分枝杆菌蛋白质的亚细胞位置进行了预测。利用Jackknife检验对不同序列相似度下的蛋白质数据集进行了预测研究,结果显示,当数据集的序列相似度小于等于70%时,算法的预测精度稳定在75%左右。在对整体852条蛋白质的预测成功率达到87.7%,这一结果优于已有算法的预测精度,说明IDQD是一种有效的分枝杆菌蛋白质亚细胞预测方法。  相似文献   

7.
文献报道采用氨基酸组成分布提取特征值能有效提高预测分类精度, 本文采用该方法提取特征值, 使用一种新的组合分类器——随机森林, 从蛋白质一级结构对嗜热和嗜冷蛋白进行分类。通过10倍交叉验证和独立样本测试两种方法检测, 结果表明:当分段数量为1时, 其精度最优, 分别为92.9%和90.2%, 暗示使用基于氨基酸组成分布提取特征值在该算法中并不能有效提高识别精度, 这与报道结果不符, 而该提取方法在SVM中却能适当提高识别精度; 当引入6个新变量后, 其精度分别提高到93.2%和92.2%, ROC曲线下面积分别为0.9771和0.9696, 优于其它组合分类器。  相似文献   

8.
文献报道采用氨基酸组成分布提取特征值能有效提高预测分类精度, 本文采用该方法提取特征值, 使用一种新的组合分类器——随机森林, 从蛋白质一级结构对嗜热和嗜冷蛋白进行分类。通过10倍交叉验证和独立样本测试两种方法检测, 结果表明:当分段数量为1时, 其精度最优, 分别为92.9%和90.2%, 暗示使用基于氨基酸组成分布提取特征值在该算法中并不能有效提高识别精度, 这与报道结果不符, 而该提取方法在SVM中却能适当提高识别精度; 当引入6个新变量后, 其精度分别提高到93.2%和92.2%, ROC曲线下面积分别为0.9771和0.9696, 优于其它组合分类器。  相似文献   

9.
基于二次判别的果蝇启动子识别   总被引:3,自引:0,他引:3  
通过对果蝇polⅡ启动子和非启动子的序列特征分析,计算了序列每个位点单碱基保守性M1(l)值和六联体保守性M6(l)值。从而分别选取两个区域的六联体频数作为离散源参数,利用离散增量结合二次判别函数(IDQD)对启动子进行了预测。对于从编码区和内含子中选取的非启动子数据集,启动子的预测成功率分别达到93%和89%。比较结果显示IDQD模型能够有效地提高启动子预测成功率。  相似文献   

10.
分析抑郁症和心理亚健康的关联代谢生物标志物,为二者的临床诊断识别以及早期药物防治提供参考。 筛选18例抑郁症患者和23例心理亚健康受试者,空腹采集其静脉血。采用1H-NMR代谢组学技术并结合单变量、多元统计、相关性以及倍数变化(Fold Change,FC)等分析方法,筛选二者内源性差异代谢物,并作为候选的关联代谢生物标志物,再以受试者工作曲线(receiver operating characteristic curve, ROC)对其诊断识别能力进行评估。选择1r1>0.6、FC>1.5为临界指标,对候选的代谢生物标志物进行筛选,候选的差异代谢物在两组受试者之间存在显著差异(P<0.05,0.01)。主要有3-OH-丁酸盐、醋酸盐、丙氨酸、甜菜碱和肉碱等。受试者工作曲线下面积(AUC)结果显示,肉碱、胆碱、组氨酸和脂质(AUC > 0.85)对于关联抑郁症和心理亚健康,具有较高的诊断价值以及预测能力,为提高抑郁症临床诊断准确度和可信度开辟新途径,并为心理亚健康受试者的早期识别和防治,阻止其发展成为精神类疾病提供参考。  相似文献   

11.
12.
13.
14.
It has now been over twenty years since a novel herpesviral genome was identified in Kaposi's sarcoma biopsies. Since then, the cumulative research effort by molecular biologists, virologists, clinicians, and epidemiologists alike has led to the extensive characterization of this tumor virus, Kaposi's sarcoma-associated herpesvirus(KSHV; also known as human herpesvirus 8(HHV-8)), and its associated diseases. Here we review the current knowledge of KSHV biology and pathogenesis, with a particular emphasis on new and exciting advances in the field of epigenetics. We also discuss the development and practicality of various cell culture and animal model systems to study KSHV replication and pathogenesis.  相似文献   

15.
16.
Comprises species occurring mostly in subtidal habitats in tropical, subtropical and warm-temperate areas of the world. An analysis of the type species, V. spiralis (Sonder) Lamouroux ex J. Agardh, a species from Australia, establishes basic characters for distinguishing species in the genus. These characters are (1) branching patterns of thalli, (2) flat blades that may be spiralled on their axis, (3) width of the blade, (4) primary or secondary derivation of sterile and fertile branchlets and (5) position of sterile and fertile branchlets on the thalli. Application of the latter two characters provides an important basic method for separation of species into three major groups. Osmundaria , a genus known only in southern Australia, was studied in relation to Vidalia , and its separation from the Vidalia assemblage is not accepted. Species of Vidalia therefore are transferred to the older genus name, Osmundaria. Two new species, Osmundaria papenfussii and Osmundaria oliveae are described from Natal. Confusion in the usage of the epithet, Vidalia fimbriala Brown ex Turner has been clarified, and Vidalia gregaria Falkenberg, described as an epiphyte on Osmundaria pro/ifera Lamouroux, is revealed to be young branches of the host, Osmundaria prolifera.  相似文献   

17.
Fifteen chromosome counts of six Artemisia taxa and one species of each of the genera Brachanthemum, Hippolytia, Kaschgaria, Lepidolopsis and Turaniphytum are reported from Kazakhstan. Three of them are new reports, two are not consistent with previous counts and the remainder are confirmations of very scarce (one to four) earlier records. All the populations studied have the same basic chromosome number, x = 9, with ploidy levels ranging from 2x to 6x. Some correlations between ploidy level, morphological characters and distribution are noted.  相似文献   

18.
肝癌中HBV和HCV基因和抗原的分布及意义   总被引:1,自引:0,他引:1  
采用原位分子杂交方法检测HCV RNA及HBV X基因;采用免疫组织化学方法研究HCV核心抗原,非结构区C33c抗原及HBxAg在肝细胞肝癌中的定位及分布.结果表明(1)HCV RNA、HBV X基因在肝细胞肝癌组织检出率分别为40%(55/136)和82%(112/136).HCV RNA定位于癌细胞的胞浆内,阳性细胞呈散在、灶状及弥漫分布三种形式;HBV X基因在肝癌细胞中的分布呈胞浆型、核型及核浆型,阳性细胞也呈上述三种分布形式;(2)HCV C33c抗原、核心抗原在肝细胞肝癌中的阳性率为81%(133/164)及86%(141/164).C33c抗原定位于癌细胞及肝细胞的胞浆内;核心抗原既定位于癌细胞核中,又可定位于胞浆中.C33c抗原阳性细胞以灶状分布为主;而核心抗原阳性细  相似文献   

19.
20.
For a plant selection model with frequency-independent viabilities, fertilities and selfing rates, it is shown that apart from global fixation, for certain parameter combinations a protected polymorphism and facultative fixation (either allele may become fixed according to initial frequencies) may both occur. Facultative fixation requires different selling rates for the dominant and recessive type. Protection of the polymorphism requires resource allocation for male and female function. In this connection the problem of purely genetically caused population extinction is discussed.
For general frequency dependence and regular segregation, the chances for establishment of a completely recessive gene are compared to those of a completely dominant gene. It is proven that the process of establishment of the recessive gene, despite a fitness advantage, may be considerably endangered by drift effects if random mating prevails. The recessive gene may reach the same effectivity in establishment as a dominant gene, only if the recessive homozygote mates exclusively with its own type during the period of establishment.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号