Selecting pseudo-absence data for presence-only distribution modeling: How far should you stray from what you know?

Vanderwal, Jeremy, Shoo, Luke P., Graham, Catherine and William, Stephen E. (2009) Selecting pseudo-absence data for presence-only distribution modeling: How far should you stray from what you know?. Ecological Modelling, 220 4: 589-594. doi:10.1016/j.ecolmodel.2008.11.010


Author Vanderwal, Jeremy
Shoo, Luke P.
Graham, Catherine
William, Stephen E.
Title Selecting pseudo-absence data for presence-only distribution modeling: How far should you stray from what you know?
Journal name Ecological Modelling   Check publisher's open access policy
ISSN 0304-3800
1872-7026
Publication date 2009-02
Sub-type Article (original research)
DOI 10.1016/j.ecolmodel.2008.11.010
Volume 220
Issue 4
Start page 589
End page 594
Total pages 6
Place of publication Amsterdam, Netherlands
Publisher Elsevier
Language eng
Formatted abstract
An important decision in presence-only species distribution modeling is how to select background (or pseudo-absence) localities for model parameterization. The selection of such localities may influence model parameterization and thus, can influence the appropriateness and accuracy of the model prediction when extrapolating the species distribution across time and space. We used 12 species from the Australian Wet Tropics (AWT) to evaluate the relationship between the geographic extent from which pseudo-absences are taken and model performance, and shape and importance of predictor variables using the MAXENT modeling method. Model performance is lower when pseudo-absence points are taken from either a restricted or broad region with respect to species occurrence data than from an intermediate region. Furthermore, variable importance (i.e., contribution to the model) changed such that, models became increasingly simplified, dominated by just two variables, as the area from which pseudo-absence points were drawn increased. Our results suggest that it is important to consider the spatial extent from which pseudo-absence data are taken. We suggest species distribution modeling exercises should begin with exploratory analyses evaluating what extent might provide both the most accurate results and biologically meaningful fit between species occurrence and predictor variables. This is especially important when modeling across space or time-a growing application for species distributional modeling.
Keyword Auc
Conservation
Model evaluation
Pseudo-absence data
Species distribution model
Q-Index Code C1
Q-Index Status Provisional Code
Institutional Status Non-UQ

Document type: Journal Article
Sub-type: Article (original research)
Collections: ERA 2012 Admin Only
School of Biological Sciences Publications
 
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 238 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 255 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Sun, 11 Sep 2011, 18:43:25 EST by System User on behalf of School of Biological Sciences