Information-Theoretic Modeling of Categorical Spatiotemporal GIS Data

An information-theoretic data mining method is employed to analyze categorical spatiotemporal Geographic Information System land use data. Reconstructability Analysis (RA) is a maximum-entropy-based data modeling methodology that works exclusively with discrete data such as those in the National Lan...

Full description

Saved in:
Bibliographic Details
Main Authors: David Percy, Martin Zwick
Format: Article
Language:English
Published: MDPI AG 2024-09-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/26/9/784
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850261330730156032
author David Percy
Martin Zwick
author_facet David Percy
Martin Zwick
author_sort David Percy
collection DOAJ
description An information-theoretic data mining method is employed to analyze categorical spatiotemporal Geographic Information System land use data. Reconstructability Analysis (RA) is a maximum-entropy-based data modeling methodology that works exclusively with discrete data such as those in the National Land Cover Database (NLCD). The NLCD is organized into a spatial (raster) grid and data are available in a consistent format for every five years from 2001 to 2021. An NLCD tool reports how much change occurred for each category of land use; for the study area examined, the most dynamic class is Evergreen Forest (EFO), so the presence or absence of EFO in 2021 was chosen as the dependent variable that our data modeling attempts to predict. RA predicts the outcome with approximately 80% accuracy using a sparse set of cells from a spacetime data cube consisting of neighboring lagged-time cells. When the predicting cells are all Shrubs and Grasses, there is a high probability for a 2021 state of EFO, while when the predicting cells are all EFO, there is a high probability that the 2021 state will not be EFO. These findings are interpreted as detecting forest clear-cut cycles that show up in the data and explain why this class is so dynamic. This study introduces a new approach to analyzing GIS categorical data and expands the range of applications that this entropy-based methodology can successfully model.
format Article
id doaj-art-9d1c55d1f4a74d2cbdd7319994261d2c
institution OA Journals
issn 1099-4300
language English
publishDate 2024-09-01
publisher MDPI AG
record_format Article
series Entropy
spelling doaj-art-9d1c55d1f4a74d2cbdd7319994261d2c2025-08-20T01:55:27ZengMDPI AGEntropy1099-43002024-09-0126978410.3390/e26090784Information-Theoretic Modeling of Categorical Spatiotemporal GIS DataDavid Percy0Martin Zwick1Geology Department, Portland State University, Portland, OR 97207, USAComplex Systems Department, Portland State University, Portland, OR 97207, USAAn information-theoretic data mining method is employed to analyze categorical spatiotemporal Geographic Information System land use data. Reconstructability Analysis (RA) is a maximum-entropy-based data modeling methodology that works exclusively with discrete data such as those in the National Land Cover Database (NLCD). The NLCD is organized into a spatial (raster) grid and data are available in a consistent format for every five years from 2001 to 2021. An NLCD tool reports how much change occurred for each category of land use; for the study area examined, the most dynamic class is Evergreen Forest (EFO), so the presence or absence of EFO in 2021 was chosen as the dependent variable that our data modeling attempts to predict. RA predicts the outcome with approximately 80% accuracy using a sparse set of cells from a spacetime data cube consisting of neighboring lagged-time cells. When the predicting cells are all Shrubs and Grasses, there is a high probability for a 2021 state of EFO, while when the predicting cells are all EFO, there is a high probability that the 2021 state will not be EFO. These findings are interpreted as detecting forest clear-cut cycles that show up in the data and explain why this class is so dynamic. This study introduces a new approach to analyzing GIS categorical data and expands the range of applications that this entropy-based methodology can successfully model.https://www.mdpi.com/1099-4300/26/9/784GISreconstructability analysisspatiotemporalcategorical dataforest predictionspace–time data cube
spellingShingle David Percy
Martin Zwick
Information-Theoretic Modeling of Categorical Spatiotemporal GIS Data
Entropy
GIS
reconstructability analysis
spatiotemporal
categorical data
forest prediction
space–time data cube
title Information-Theoretic Modeling of Categorical Spatiotemporal GIS Data
title_full Information-Theoretic Modeling of Categorical Spatiotemporal GIS Data
title_fullStr Information-Theoretic Modeling of Categorical Spatiotemporal GIS Data
title_full_unstemmed Information-Theoretic Modeling of Categorical Spatiotemporal GIS Data
title_short Information-Theoretic Modeling of Categorical Spatiotemporal GIS Data
title_sort information theoretic modeling of categorical spatiotemporal gis data
topic GIS
reconstructability analysis
spatiotemporal
categorical data
forest prediction
space–time data cube
url https://www.mdpi.com/1099-4300/26/9/784
work_keys_str_mv AT davidpercy informationtheoreticmodelingofcategoricalspatiotemporalgisdata
AT martinzwick informationtheoreticmodelingofcategoricalspatiotemporalgisdata