Improved order selection method for hidden Markov models: A case study with movement data

Abstract Hidden Markov models (HMMs) are a versatile statistical framework commonly used in ecology to characterize behavioural patterns from animal movement data. In HMMs, the observed data depend on a finite number of underlying hidden states, generally interpreted as the animal's unobserved...

Full description

Saved in:
Bibliographic Details
Main Authors: Fanny Dupont, Marianne Marcoux, Nigel Hussey, Marie Auger‐Méthé
Format: Article
Language:English
Published: Wiley 2025-06-01
Series:Methods in Ecology and Evolution
Subjects:
Online Access:https://doi.org/10.1111/2041-210X.70025
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849320580144168960
author Fanny Dupont
Marianne Marcoux
Nigel Hussey
Marie Auger‐Méthé
author_facet Fanny Dupont
Marianne Marcoux
Nigel Hussey
Marie Auger‐Méthé
author_sort Fanny Dupont
collection DOAJ
description Abstract Hidden Markov models (HMMs) are a versatile statistical framework commonly used in ecology to characterize behavioural patterns from animal movement data. In HMMs, the observed data depend on a finite number of underlying hidden states, generally interpreted as the animal's unobserved behaviour. The number of states is a crucial hyperparameter, controlling the trade‐off between the ecological interpretability of behaviours (fewer states) and the goodness of fit of the model (more states). Selecting the number of states, commonly referred to as order selection, is notoriously challenging. Common model selection metrics, such as Akaike information criterion (AIC) and Bayesian information criterion (BIC), often perform poorly in determining the number of states, particularly when models are misspecified. Building on existing methods for HMMs and mixture models, we propose a double penalised maximum likelihood estimate (DPMLE) for the simultaneous estimation of the number of states and parameters of non‐stationary HMMs. The DPMLE differs from traditional information criteria by using two penalty functions on the stationary probabilities and state‐dependent parameters. For non‐stationary HMMs, forward and backward probabilities are used to approximate stationary probabilities. Using a simulation study that includes scenarios with additional complexity in the data, we compare the performance of our method with that of AIC and BIC. We also illustrate how the DPMLE differs from AIC and BIC using narwhal (Monodon monoceros) movement data. The proposed method outperformed AIC and BIC in identifying the correct number of states under model misspecification. Furthermore, its capacity to handle non‐stationary dynamics allowed for more realistic modelling of complex movement data, offering deeper insights into narwhal behaviour. Our method is a powerful tool for order selection in non‐stationary HMMs, with potential applications extending beyond the field of ecology.
format Article
id doaj-art-810e95e0563e4e52a74048ec1ef2ec64
institution Kabale University
issn 2041-210X
language English
publishDate 2025-06-01
publisher Wiley
record_format Article
series Methods in Ecology and Evolution
spelling doaj-art-810e95e0563e4e52a74048ec1ef2ec642025-08-20T03:50:02ZengWileyMethods in Ecology and Evolution2041-210X2025-06-011661215122710.1111/2041-210X.70025Improved order selection method for hidden Markov models: A case study with movement dataFanny Dupont0Marianne Marcoux1Nigel Hussey2Marie Auger‐Méthé3Department of Statistics University of British Columbia Vancouver British Columbia CanadaFreshwater Institute, Fisheries and Oceans Canada Winnipeg Manitoba CanadaDepartment of Integrative Biology University of Windsor Windsor Ontario CanadaDepartment of Statistics University of British Columbia Vancouver British Columbia CanadaAbstract Hidden Markov models (HMMs) are a versatile statistical framework commonly used in ecology to characterize behavioural patterns from animal movement data. In HMMs, the observed data depend on a finite number of underlying hidden states, generally interpreted as the animal's unobserved behaviour. The number of states is a crucial hyperparameter, controlling the trade‐off between the ecological interpretability of behaviours (fewer states) and the goodness of fit of the model (more states). Selecting the number of states, commonly referred to as order selection, is notoriously challenging. Common model selection metrics, such as Akaike information criterion (AIC) and Bayesian information criterion (BIC), often perform poorly in determining the number of states, particularly when models are misspecified. Building on existing methods for HMMs and mixture models, we propose a double penalised maximum likelihood estimate (DPMLE) for the simultaneous estimation of the number of states and parameters of non‐stationary HMMs. The DPMLE differs from traditional information criteria by using two penalty functions on the stationary probabilities and state‐dependent parameters. For non‐stationary HMMs, forward and backward probabilities are used to approximate stationary probabilities. Using a simulation study that includes scenarios with additional complexity in the data, we compare the performance of our method with that of AIC and BIC. We also illustrate how the DPMLE differs from AIC and BIC using narwhal (Monodon monoceros) movement data. The proposed method outperformed AIC and BIC in identifying the correct number of states under model misspecification. Furthermore, its capacity to handle non‐stationary dynamics allowed for more realistic modelling of complex movement data, offering deeper insights into narwhal behaviour. Our method is a powerful tool for order selection in non‐stationary HMMs, with potential applications extending beyond the field of ecology.https://doi.org/10.1111/2041-210X.70025animal movementdouble penalised maximum likelihood estimate (DPMLE)HMMinformation criterianon‐stationaryorder selection
spellingShingle Fanny Dupont
Marianne Marcoux
Nigel Hussey
Marie Auger‐Méthé
Improved order selection method for hidden Markov models: A case study with movement data
Methods in Ecology and Evolution
animal movement
double penalised maximum likelihood estimate (DPMLE)
HMM
information criteria
non‐stationary
order selection
title Improved order selection method for hidden Markov models: A case study with movement data
title_full Improved order selection method for hidden Markov models: A case study with movement data
title_fullStr Improved order selection method for hidden Markov models: A case study with movement data
title_full_unstemmed Improved order selection method for hidden Markov models: A case study with movement data
title_short Improved order selection method for hidden Markov models: A case study with movement data
title_sort improved order selection method for hidden markov models a case study with movement data
topic animal movement
double penalised maximum likelihood estimate (DPMLE)
HMM
information criteria
non‐stationary
order selection
url https://doi.org/10.1111/2041-210X.70025
work_keys_str_mv AT fannydupont improvedorderselectionmethodforhiddenmarkovmodelsacasestudywithmovementdata
AT mariannemarcoux improvedorderselectionmethodforhiddenmarkovmodelsacasestudywithmovementdata
AT nigelhussey improvedorderselectionmethodforhiddenmarkovmodelsacasestudywithmovementdata
AT marieaugermethe improvedorderselectionmethodforhiddenmarkovmodelsacasestudywithmovementdata