Analysis of alcoholism data using support vector machines |
| |
Authors: | Yu Robert Shete Sanjay |
| |
Affiliation: | Department of Epidemiology, Unit 1340, The University of Texas M. D. Anderson Cancer Center, Houston, TX 77030, USA. rkyu@mdanderson.org |
| |
Abstract: | A supervised learning method, support vector machine, was used to analyze the microsatellite marker dataset of the Collaborative Study on the Genetics of Alcoholism Problem 1 for the Genetic Analysis Workshop 14. Twelve binary-valued phenotype variables were chosen for analyses using the markers from all autosomal chromosomes. Using various polynomial kernel functions of the support vector machine and randomly divided genome regions, we were able to observe the association of some marker sets with the chosen phenotypes and thus reduce the size of the dataset. The successful classifications established with the chosen support vector machine kernel function had high levels of correctness for each prediction, e.g., 96% in the fourfold cross-validations. However, owing to the limited sample data, we were not able to test the predictions of the classifiers in the new sample data. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|