Selective state models are what you need for animal action recognition

Recognizing animal actions provides valuable insights into animal welfare, yielding crucial information for agricultural, ethological, and neuroscientific research. While video-based action recognition models have been applied to this task, current approaches often rely on computationally intensive...

Full description

Saved in:

Bibliographic Details
Main Authors:	Edoardo Fazzari, Donato Romano, Fabrizio Falchi, Cesare Stefanini
Format:	Article
Language:	English
Published:	Elsevier 2025-03-01
Series:	Ecological Informatics
Subjects:	Animal action recognition Deep learning Selective state models Computer vision Mamba Msqnet
Online Access:	http://www.sciencedirect.com/science/article/pii/S1574954124004977
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832595421091856384
author	Edoardo Fazzari Donato Romano Fabrizio Falchi Cesare Stefanini
author_facet	Edoardo Fazzari Donato Romano Fabrizio Falchi Cesare Stefanini
author_sort	Edoardo Fazzari
collection	DOAJ
description	Recognizing animal actions provides valuable insights into animal welfare, yielding crucial information for agricultural, ethological, and neuroscientific research. While video-based action recognition models have been applied to this task, current approaches often rely on computationally intensive Transformer layers, limiting their practical application in field settings such as farms and wildlife reserves. This study introduces Mamba-MSQNet, a novel architecture family for multilabel Animal Action Recognition using Selective Space Models. By transforming the state-of-the-art MSQNet model with Mamba blocks, we achieve significant reductions in computational requirements: up to 90% fewer Floating point OPerations and 78% fewer parameters compared to MSQNet. These optimizations not only make the model more efficient but also enable it to outperform Transformer-based counterparts on the Animal Kingdom dataset, achieving a mean Average Precision of 74.6, marking an improvement over previous architectures. This combination of enhanced efficiency and improved performance represents a significant advancement in the field of animal action recognition. The dramatic reduction in computational demands, coupled with a performance boost, opens new possibilities for real-time animal behavior monitoring in resource-constrained environments. This enhanced efficiency could revolutionize how we observe and analyze animal behavior, potentially leading to breakthroughs in animal welfare assessment, behavioral studies, and conservation efforts.
format	Article
id	doaj-art-bae440bcde364fda86a07361a093930d
institution	Kabale University
issn	1574-9541
language	English
publishDate	2025-03-01
publisher	Elsevier
record_format	Article
series	Ecological Informatics
spelling	doaj-art-bae440bcde364fda86a07361a093930d2025-01-19T06:24:39ZengElsevierEcological Informatics1574-95412025-03-0185102955Selective state models are what you need for animal action recognitionEdoardo Fazzari0Donato Romano1Fabrizio Falchi2Cesare Stefanini3The BioRobotics Institute, Sant’Anna School of Advanced Studies, Viale Rinaldo Piaggio, Pontedera, 56025, Tuscany, Italy; Department of Excellence in Robotics and AI, Sant’Anna School of Advanced Studies, Piazza Martiri della Libertà, Pisa, 56127, Tuscany, Italy; Institute of Information Science and Technologies, National Research Council of Italy, Via G. Moruzzi, Pisa, 56124, Tuscany, Italy; Corresponding author at: The BioRobotics Institute, Sant’Anna School of Advanced Studies, Viale Rinaldo Piaggio, Pontedera, 56025, Tuscany, Italy.The BioRobotics Institute, Sant’Anna School of Advanced Studies, Viale Rinaldo Piaggio, Pontedera, 56025, Tuscany, Italy; Department of Excellence in Robotics and AI, Sant’Anna School of Advanced Studies, Piazza Martiri della Libertà, Pisa, 56127, Tuscany, ItalyThe BioRobotics Institute, Sant’Anna School of Advanced Studies, Viale Rinaldo Piaggio, Pontedera, 56025, Tuscany, Italy; Institute of Information Science and Technologies, National Research Council of Italy, Via G. Moruzzi, Pisa, 56124, Tuscany, ItalyThe BioRobotics Institute, Sant’Anna School of Advanced Studies, Viale Rinaldo Piaggio, Pontedera, 56025, Tuscany, Italy; Department of Excellence in Robotics and AI, Sant’Anna School of Advanced Studies, Piazza Martiri della Libertà, Pisa, 56127, Tuscany, ItalyRecognizing animal actions provides valuable insights into animal welfare, yielding crucial information for agricultural, ethological, and neuroscientific research. While video-based action recognition models have been applied to this task, current approaches often rely on computationally intensive Transformer layers, limiting their practical application in field settings such as farms and wildlife reserves. This study introduces Mamba-MSQNet, a novel architecture family for multilabel Animal Action Recognition using Selective Space Models. By transforming the state-of-the-art MSQNet model with Mamba blocks, we achieve significant reductions in computational requirements: up to 90% fewer Floating point OPerations and 78% fewer parameters compared to MSQNet. These optimizations not only make the model more efficient but also enable it to outperform Transformer-based counterparts on the Animal Kingdom dataset, achieving a mean Average Precision of 74.6, marking an improvement over previous architectures. This combination of enhanced efficiency and improved performance represents a significant advancement in the field of animal action recognition. The dramatic reduction in computational demands, coupled with a performance boost, opens new possibilities for real-time animal behavior monitoring in resource-constrained environments. This enhanced efficiency could revolutionize how we observe and analyze animal behavior, potentially leading to breakthroughs in animal welfare assessment, behavioral studies, and conservation efforts.http://www.sciencedirect.com/science/article/pii/S1574954124004977Animal action recognitionDeep learningSelective state modelsComputer visionMambaMsqnet
spellingShingle	Edoardo Fazzari Donato Romano Fabrizio Falchi Cesare Stefanini Selective state models are what you need for animal action recognition Ecological Informatics Animal action recognition Deep learning Selective state models Computer vision Mamba Msqnet
title	Selective state models are what you need for animal action recognition
title_full	Selective state models are what you need for animal action recognition
title_fullStr	Selective state models are what you need for animal action recognition
title_full_unstemmed	Selective state models are what you need for animal action recognition
title_short	Selective state models are what you need for animal action recognition
title_sort	selective state models are what you need for animal action recognition
topic	Animal action recognition Deep learning Selective state models Computer vision Mamba Msqnet
url	http://www.sciencedirect.com/science/article/pii/S1574954124004977
work_keys_str_mv	AT edoardofazzari selectivestatemodelsarewhatyouneedforanimalactionrecognition AT donatoromano selectivestatemodelsarewhatyouneedforanimalactionrecognition AT fabriziofalchi selectivestatemodelsarewhatyouneedforanimalactionrecognition AT cesarestefanini selectivestatemodelsarewhatyouneedforanimalactionrecognition

Selective state models are what you need for animal action recognition

Similar Items