Book contents
- Frontmatter
- Contents
- Preface
- Acknowledgments
- Part I History and ecological basis of species distribution modeling
- Part II The data needed for modeling species distributions
- Part III An overview of the modeling methods
- 6 Statistical models – modern regression
- 7 Machine learning methods
- 8 Classification, similarity and other methods for presence-only data
- Part IV Model evaluation and implementation
- References
- Index
7 - Machine learning methods
Published online by Cambridge University Press: 05 August 2012
- Frontmatter
- Contents
- Preface
- Acknowledgments
- Part I History and ecological basis of species distribution modeling
- Part II The data needed for modeling species distributions
- Part III An overview of the modeling methods
- 6 Statistical models – modern regression
- 7 Machine learning methods
- 8 Classification, similarity and other methods for presence-only data
- Part IV Model evaluation and implementation
- References
- Index
Summary
Introduction
As discussed in the overview of Part III, species distribution modeling can be treated as a supervised learning problem – observations of a response, such as species presence or absence, and associated environmental predictors, are used to develop rules that can be used to classify new observations where the values of the predictors, but not the response, are known. Statistical or machine learning approaches can be used to solve a supervised learning problem. In Chapter 6 it was noted that the linear (regression) model can be thought of as a model-driven or parametric approach to statistical learning, in which certain assumptions are made about the form of the model, and also a “global” method, meaning that all of the data (observations) are used to estimate the parameters. In other words, the problem in supervised learning is to construct a function that “maps” inputs X to outcome Y. In statistical inference the distributional form is chosen by the analyst and its parameters are estimated from the data. Machine learning methods, in contrast, are various kinds of algorithms that are used to learn the mapping function or classification rules inductively, directly from the training data (Breiman, 2001a; Gahegan, 2003).
- Type
- Chapter
- Information
- Mapping Species DistributionsSpatial Inference and Prediction, pp. 154 - 179Publisher: Cambridge University PressPrint publication year: 2010