ABSTRACT
This paper presents an artificial neural network (ANN) for modeling multicategorical land use changes.
Compared to conventional statistical models and cellular automata models, ANNs have both the architecture
appropriate for addressing complex problems and the power for spatio-temporal prediction. The model consists
of two layers with multiple input and output units. Bayesian regularization was used for network training in
order to select an optimal model that avoids over-fitting problem. When trained and applied to predict changes
in parcel use in a coastal county from 1990 to 2008, the ANN model performed well as measured by high
prediction accuracy (82.0-98.5%) and high Kappa coefficient (81.4-97.5%) with only slight variation across
five different land use categories. ANN also outperformed the benchmark multinomial logistic regression by
average 17.5 percentage points in categorical accuracy and by 9.2 percentage points in overall accuracy.
The authors used the ANN model to predict future parcel use change from 2007 to 2030.
Keywords:
INTRODUCTION
Land use change has been a major cause of many
environmental problems. Changes are often
necessary to accommodate population growth
and economic development. Degraded environments, however, threaten sustainable development. As more states and local governments are
willing to create and implement smart growth
strategies, planners and policy makers need to
DOI: 10.4018/jagr.2011070102
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
BACKGROUND
ANNs have been around since the early 1940s,
but it was not until the mid 1990s that they were
introduced into the geosciences for spatial interaction, interregional telecommunication, resource management, suitability assessment, and
image classification (Fischer & Gopal, 1994;
Openshaw, 1993). Openshaw and Openshaw
(1997) identified four major benefits of neural
networks for modeling: better performance,
greater representational flexibility and freedom
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
from current model design constrains, the opportunity to handle explicitly noisy data, and
the incorporation of spatial dependency in the
net representation (which is currently ignored).
Researchers in land use modeling have been
a half-decade slower in realizing such benefits.
Pioneering studies mainly focus the use of
ANNs for simulating the change in urbanization
patterns (Li & Yeh, 2001; Pijanowski, Gage, &
Long, 2000; Pijanowski et al., 2002; Yeh & Li,
2002). Pijanowski et al. (2005) later incorporated an ANN as one of the three components
of the Land Transformation Model (LTM), and
applied it in several other areas. Allen and Lu
(2004, 2006) found that ANN models not only
attained fair to high prediction accuracy but also
outperformed the logistic regression in several
applications in coastal regions. They also found
that ANN models were capable of discerning
isolated, sparsely distributed features or events
that other models often fail to predict. Land
use systems modeled in these studies are all
binary. It remains to be tested whether ANNs
can achieve similar success in predicting multicategory land use change.
Figure 1. A two-layer feedforward neural network for modeling multi-categorical land uses
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
METHODS
Study Area
Beaufort County, South Carolina, was selected
as the study area. It is a well-known coastal
tourist destination. The county covers 923 mi2 in
area, with 63 percent land and 37 percent water.
Driven by tourism development, new immigration, service sector expansion, and military
demand, Beaufort County has seen tremendous
growth in population, urban area, and impervious surface over the past two decades. From
1990 to 2000, its population increased by 39.93
percent, from 86,425 to 120,935. This growth
rate triples the national average (13 percent)
and leads all counties in South Carolina. From
1990 to 2008, the total number of built parcels
increased by 136.4 percent from 23,446 to
56,929. Impervious surface almost quadrupled
(380 percent) in the fast growing Bluffton area.
The land conversion rate has been high, with an
average of 2,575 parcels (1,500 acres) per year
over the last decade. As one of the top seven
retiree communities in the US, the county is
expected to continue growing at a rapid pace.
These changes impose tremendous pressure on
natural environments, local infrastructure, and
public services. There is a tremendous demand
for predictions and impact assessments needed
for smart growth decisions.
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Data Preparation
Parcel use in 2008 was derived from the parcel
shape file (ESRI, Inc., Redlands, California)
and Tax Assessors Table provided by Beaufort
County. We grouped parcel uses into five general
categories: 1) commercial, including all commercial, industrial, institutional, and service/
utility uses; 2) urban residential, including all
developed accommodation use < 5 ac; 3) rural
residential, with parcel size falling between 5-10
ac; 4) recreational (only golf courses); and 5)
undeveloped, including wetlands, open space,
forestlands, croplands, parks and reserves,
and other preserved and protected lands. Of
a total 99,781 parcels, 33,250 (33.32 percent)
were developed or built between 1990 and
2008. We call this new development or net
change for model validation. Because of rich
temporal information, we were able to derive
changes on an annual or decadal basis for different land uses, periods, and regions. We used
the growth rates for three different lengths of
Independent Variables
Physical properties
land area
wetland ratio
slope
distance to water front
forest wetland
Road accessibility
Proximity to utilities
distance to waterline
and distance to sewer line
Neighborhood effects
Demographic features
block population
block neighborhood population
Land value
Land ownership
private ownership
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
RESULTS
Model Performance
We performed multinomial regression analysis
in a preliminary study to screen predictive
variables. Eighteen of the variables included in
the preliminary study were found statistically
significant and thus used to construct the ANN
model. Based on the 4:2:2 ratios, the training
sample was split into three subsets for training,
validation, and testing, respectively, during the
training process. The training process stopped
within 100 epochs in several runs. MSE is
0.018 for the training subset and 0.038 for both
testing and validation subsets. Results of BR
suggest that an optimal model with 415 effective
parameters (18 x 17 x 5) would be adequate.
The ANN model performed well against
the training dataset (Table 2). The prediction
accuracy, overall or categorical, is quite impressive as measured in terms of correct percent
(82.0-98.5 percent) and Kappa coefficients
(0.81-0.98), particularly for a change of 33.3
percent over newly two decades. Kappa coefficient is considered a conservative measure of
agreement with chance corrected, and a value
above 0.80 suggests an excellent prediction.
Although both accuracy and Kappa tend to
increase with the total number of parcels within
the category, their variations across different
land uses are relatively small.
The benchmark MLR was also statistically significant based on the likelihood ratio
tests (Chi-square = 11,100, df = 76, p < 0.001).
It was, however, outperformed by the ANN
model in all measurements (Tables 2 and 3).
The ANN model improved prediction accuracy
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Table 2. Prediction accuracy of the ANN model against the training sample
Observed
Predicted
I
II
III
IV
Row Total
Accuracy
(%)
Kappa
511
24
24
574
89.0
0.88
II
12
2815
30
14
2877
97.8
0.97
III
10
22
866
38
943
91.8
0.91
IV
218
34
266
82.0
0.81
16
11
18
3105
3152
98.5
0.98
Column total
553
2877
943
242
3197
7812
96.2
0.94b
Notes: I=Commercial, II=Urban residential, III=Rural residential, IV=Golf course, and V=Other use.
a. Overall prediction accuracy, b. Overall kappa coefficient.
Predicted Changes
The predicted long-term trajectories for parcel
development are shown in Figure 2. They reflect
quantitative demands at three different growth
rates. We used 2008 data as the new input of
the trained ANN model to predict land transition probabilities for five parcel uses. These
two types of information were used to map
future change through 2030. Figure 3 shows
the distribution of predicted parcel use changes
Table 3. Prediction accuracy of the MLR model against the training sample
Observed
Predicted
Row Total
Accuracy
(%)
Kappa
35
574
55.4
0.53
II
III
IV
318
197
12
12
II
19
2659
50
146
2877
92.4
0.88
III
74
147
562
156
943
59.6
0.56
IV
13
24
177
47
266
66.5
0.66
52
3083
3152
97.8
0.96
Column total
429
3079
637
200
3467
7812
87.0a
0.80b
Notes: I=Commercial, II=Urban residential, III=Rural residential, IV=Golf course, and V=Other use.
a. Overall prediction accuracy, b. Overall kappa coefficient.
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Training Set
Validation Sets
Use-based
Area-based
Net Change a
Full Set
89.0
86.6
83.8
79.2
87.1
II
97.8
96.5
96.5
96.6
96.2
III
91.8
88.5
84.9
80.7
87.4
IV
82.0
79.8
77.6
94.2
80.6
98.5
97.7
97.6
Overall
96.2
96.4
96.1
97.7
94.9
96.2
Notes: I=Commercial, II=Urban residential, III=Rural residential, IV=Golf course, and V=Other use.
a
Category V or Other use is treated as the background category and is thus not applicable in this case.
DISCUSSION AND
CONCLUSION
A neural network was applied to predict parcel
use change in Beaufort County, South Carolina.
The model is not only theoretically appropriate
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Figure 3. Predicted parcel use changes in Beaufort County from 2000 to 2030 under different
growth scenarios
world units for land transaction and development (Landis, 1994). Many parcel attributes
are predictive variables with measurements
that can be readily incorporated into a model
to improve model semantics and prediction
accuracy. Also, many property variables are
spatially well-defined by parcel boundaries and
no additional error is induced by aggregation
or disaggregation. For the same reason, it is
less difficult to interpret the modeling results.
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Table 5. Potential impact of the growth boundary on regional development in Beaufort County
Growth Boundary
Region
Yes
Yes-No
Area
Area
N%
Area%
5132
4592
-100
-100
S Beaufort Islands
2726
2038
1664
875
64
133
Hilton Head
5239
3968
4884
3035
31
NE Corner
958
1341
-100
-100
10332
8148
9631
5945
37
Bluffton
9404
7318
8687
5677
Entire County
27701
21472
30956
21466
-11
St Henna Island
29
a
<1a
Notes: Prediction is based on the 1990-2008 growth rate. Area units are acres.
a
Discrepancies are caused by variable parcel sizes due to the use of area-based demand.
ACKNOWLEDGMENT
We would like to thank South Carolina Sea
Grant and NOAAs National Centers for Coastal
Ocean Science for funding this project.
REFERENCES
Allen, J. S., & Lu, K. S. (2006). Predicting trajectories
of urban growth in the coastal Southeast. In G. S.
Kleppel, M. R. Devoe & M. V. Rawson (Eds.), Changing land use patterns in the coastal zone: Managing
environmental quality in rapidly developing regions
(pp. 47-67). New York, NY: Springer.
Allen, J. S., & Lu, K. S. (2003). Modeling and prediction of future urban growth in the Charleston region
of South Carolina: A GIS-based integrated approach.
Conservation Ecology, 8(2), 120.
Allen, J. S., & Lu, K. S. (2004). Artificial neural
network vs. binary logistic regression: Two alternative models for predicting urban growth in the Myrtle
Beach region. Paper presented at the NOAAs Land
Use-Coastal Ecosystem Study (LU-CES) Program.
Batty, M., & Torrens, P. M. (2005). Modeling and
prediction in a complex world. Futures, 37(7),
745766. doi:10.1016/j.futures.2004.11.003
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Boissau, S., & Castella, J. (2003). Constructing a common representation of local institutions and land use
systems through simulation-gaming and multi-agent
modeling in rural areas of Northern Vietnam: The
SAMBA-Week methodology. Simulation & Gaming,
34(3), 342347. doi:10.1177/1046878103255789
Castella, J., Boissau, S., Trung, T. N., & Quang, D.
D. (2005). Agrarian transition and lowland-upland
interactions in mountain areas in northern Vietnam:
Application of a multi-agent simulation model. Agricultural Systems, 86(3), 312332. doi:10.1016/j.
agsy.2004.11.001
Dietzel, C. K., & Clarke, K. C. (2004). Spatial differences in multi-resolution urban automata modeling.
Transactions in GIS, 8, 479492. doi:10.1111/j.14679671.2004.00197.x
Duan, Z., Verburg, P. H., Zhang, F., & Yu, Z. (2004).
Construction of a land-use change simulation model
and its application in Haidian District, Beijing. Acta
Geographica Sinica, 59(6), 10371046.
Fischer, M. M., & Gopal, S. (1994). Artificial neural
networks: A new approach to modeling interregional
telecommunication flows. Journal of Regional Science, 34(4), 503527. doi:10.1111/j.1467-9787.1994.
tb00880.x
Landis, J. (1994). The California urban futures model:
A new generation of metropolitan simulation models.
Environment and Planning. B, Planning & Design,
21, 399420. doi:10.1068/b210399
Li, X., & Yeh, A. G. O. (2001). Calibration of cellular
automata by using neural networks for the simulation
of complex urban systems. Environment & Planning
A, 33(8), 14451462. doi:10.1068/a33210
Lu, K. S., & Allen, J. S. (in press). Neural networks
for modeling urban growth patterns and potential
ecosystem impact in the coastal zone . In Brouwer,
F., & Goetz, S. J. (Eds.), The dynamics of land use
and ecosystem interactions: A transatlantic, multidisciplinary and comparative approach. New York,
NY: Springer.
MacKay, D. J. C. (1992). Bayesian interpolation.
Neural Computation, 4(3), 415447. doi:10.1162/
neco.1992.4.3.415
Openshaw, S. (1993). Modeling spatial interaction
using a neural net . In Fischer, M. M., & Nijkamp,
P. (Eds.), GIS spatial modeling and policy (pp.
147164). Berlin, Germany: Springer-Verlag.
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Veldkamp, T. A., & Fresco, L. (1996). CLUECR: An integrated multi-scale model to simulate
land use change scenarios in Costa Rica. Ecological Modelling, 91, 231248. doi:10.1016/03043800(95)00158-1
Verburg, P. H., Soepboer, W., Veldkamp, T. A., Limpiada, R., Espaldon, V., & Mastura, S. A. S. (2002).
Modeling the spatial dynamics of regional land use:
The CLUE-S model. Environmental Management,
30(3), 391405. doi:10.1007/s00267-002-2630-x
Copyright 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.