Analysis of alcoholism data using support vector machines期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Analysis of alcoholism data using support vector machines

Authors:	Yu Robert Shete Sanjay

Affiliation:	Department of Epidemiology, Unit 1340, The University of Texas M. D. Anderson Cancer Center, Houston, TX 77030, USA. rkyu@mdanderson.org

Abstract:	A supervised learning method, support vector machine, was used to analyze the microsatellite marker dataset of the Collaborative Study on the Genetics of Alcoholism Problem 1 for the Genetic Analysis Workshop 14. Twelve binary-valued phenotype variables were chosen for analyses using the markers from all autosomal chromosomes. Using various polynomial kernel functions of the support vector machine and randomly divided genome regions, we were able to observe the association of some marker sets with the chosen phenotypes and thus reduce the size of the dataset. The successful classifications established with the chosen support vector machine kernel function had high levels of correctness for each prediction, e.g., 96% in the fourfold cross-validations. However, owing to the limited sample data, we were not able to test the predictions of the classifiers in the new sample data.

Keywords:
本文献已被 PubMed 等数据库收录！