首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Data-Mining Discovery of Pattern and Process in Ecological Systems
Authors:WESLEY M HOCHACHKA  RICH CARUANA  DANIEL FINK  ART MUNSON  MIREK RIEDEWALD  DARIA SOROKINA  STEVE KELLING
Institution:1. Laboratory of Ornithology, Cornell University, Ithaca, NY 14850, USA

E-mail: wmh6@cornell.edu;2. Department of Computer Science, Cornell University, Ithaca, NY 14853, USA;3. Laboratory of Ornithology, Cornell University, Ithaca, NY 14850, USA

Abstract:ABSTRACT Most ecologists use statistical methods as their main analytical tools when analyzing data to identify relationships between a response and a set of predictors; thus, they treat all analyses as hypothesis tests or exercises in parameter estimation. However, little or no prior knowledge about a system can lead to creation of a statistical model or models that do not accurately describe major sources of variation in the response variable. We suggest that under such circumstances data mining is more appropriate for analysis. In this paper we 1) present the distinctions between data-mining (usually exploratory) analyses and parametric statistical (confirmatory) analyses, 2) illustrate 3 strengths of data-mining tools for generating hypotheses from data, and 3) suggest useful ways in which data mining and statistical analyses can be integrated into a thorough analysis of data to facilitate rapid creation of accurate models and to guide further research.
Keywords:bagging  data mining  decision trees  exploratory data analysis  hypothesis generation  machine learning  prediction
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号