首页 | 本学科首页   官方微博 | 高级检索  
     


Overlooked pitfalls in multi-class machine learning classification in radiation oncology and how to avoid them
Affiliation:1. Unit of Internal Medicine, Azienda Ospedaliero Universitaria S.Orsola Malpighi di Bologna, Bologna, Italy;2. Laboratory of Bioengineering, Department of Experimental, Diagnostic and Specialty Medicine, Alma Mater Studiorum University of Bologna, Bologna, Italy;1. Department of Experimental and Clinical Biomedical Sciences "M. Serio", University of Florence, Firenze, Italy;2. Radiation Oncology Unit - Oncology Department, Azienda Ospedaliero Universitaria Careggi, Firenze, Italy;3. Department of Oncology, Radiation Oncology, University of Turin, Torino, Italy;4. Health Science Department (DISSAL), University of Genoa, Genova, Italy;5. Radiation Oncology Department, IRCCS Ospedale Policlinico San Martino, Genova, Italy;6. Fondazione Policlinico Universitario "A. Gemelli", IRCCS, Roma, Italy;7. Department of Radiation Oncology, Institut d''Oncologie Thoracique (IOT), Villejuif, France;8. Radiotherapy Unit 1, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy;9. Division of Radiation Oncology, IEO - European Institute of Oncology, IRCCS, Milan, Italy;10. Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy;11. Radiation Oncology Unit, Santa Maria della Misericordia Hospital, Rovigo, Italy;12. Radiation Oncology Unit - Department of Biomedical, Dental Science, and Morphological and Funcitional Images, University of Messina, Messina, Italy;13. Department of Radiation Oncology, University of Brescia and Spedali Civili Hospital, Brescia, Italy;14. Department of Medical Oncology, U.O.C. Clinica di Oncologia Medica, IRCCS Ospedale Policlinico San Martino, Genova, Italy;15. Department of Internal Medicine and Medical Specialties (DiMI), School of Medicine, University of Genoa, Genova, Italy
Abstract:In radiation oncology, Machine Learning classification publications are typically related to two outcome classes, e.g. the presence or absence of distant metastasis. However, multi-class classification problems also have great clinical relevance, e.g., predicting the grade of a treatment complication following lung irradiation. This work comprised two studies aimed at making work in this domain less prone to statistical blindsides.In multi-class classification, AUC is not defined, whereas correlation coefficients are. It may seem like solely quoting the correlation coefficient value (in lieu of the AUC value) is a suitable choice. In the first study, we illustrated using Monte Carlo (MC) models why this choice is misleading. We also considered the special case where the multiple classes are not ordinal, but nominal, and explained why Pearson or Spearman correlation coefficients are not only providing incomplete information but are actually meaningless.The second study concerned surrogate biomarkers for a clinical endpoint, which have purported benefits including potential for early assessment, being inexpensive, and being non-invasive. Using a MC experiment, we showed how conclusions derived from surrogate markers can be misleading. The simulated endpoint was radiation toxicity (scale of 0–5). The surrogate marker was the true toxicity grade plus a noise term. Five patient cohorts were simulated, including one control. Two of the cohorts were designed to have a statistically significant difference in toxicity. Under 1000 repeated experiments using the biomarker, these two cohorts were often found to be statistically indistinguishable, with the fraction of such occurrences rising with the level of noise.
Keywords:Machine Learning  Multi-class classification  Radiomics  Surrogate marker
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号