Predicting species distributions in the open ocean with convolutional neural networks

As biodiversity plummets due to anthropogenic disturbances, the conservation of oceanic species is made harder by limited knowledge of their distributions and migrations. Indeed, tracking species distributions in the open ocean is particularly challenging due to the scarcity of observations and the...

Full description

Saved in:
Bibliographic Details
Main Authors: Morand, Gaétan, Joly, Alexis, Rouyer, Tristan, Lorieul, Titouan, Barde, Julien
Format: Article
Language:English
Published: Peer Community In 2024-09-01
Series:Peer Community Journal
Subjects:
Online Access:https://peercommunityjournal.org/articles/10.24072/pcjournal.471/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1825206401380646912
author Morand, Gaétan
Joly, Alexis
Rouyer, Tristan
Lorieul, Titouan
Barde, Julien
author_facet Morand, Gaétan
Joly, Alexis
Rouyer, Tristan
Lorieul, Titouan
Barde, Julien
author_sort Morand, Gaétan
collection DOAJ
description As biodiversity plummets due to anthropogenic disturbances, the conservation of oceanic species is made harder by limited knowledge of their distributions and migrations. Indeed, tracking species distributions in the open ocean is particularly challenging due to the scarcity of observations and the complex and variable nature of the ocean system. In this study, we propose a new method that leverages deep learning, specifically convolutional neural networks (CNNs), to capture spatial features of environmental variables. This novelty eliminates the need to predefine these features before modelling and creates opportunities to discover unexpected correlations. Our aim is to present the results of the first trial of this method in the open ocean, discuss limitations and provide feedback for future improvements or adjustments. In this case study, we considered 38 taxa comprising pelagic fishes, elasmobranchs, marine mammals, marine turtles and birds. We trained a model to predict probabilities from the environmental conditions at any specific point in space and time, using species occurrence data from the Global Biodiversity Information Facility (GBIF) and environmental data from various sources. These variables included sea surface temperature, chlorophyll concentration, salinity and fifteen others. During the testing phase, the model was applied to environmental data at locations where species occurrences were recorded. The classifier accurately predicted the observed taxon as the most likely taxon in 69% of cases and included the observed taxon among the top three most likely predictions in 89% of cases. These findings show the adequacy of deep learning for species distribution modelling in the open ocean. Additionally, this purely correlative model was then analysed with explicability tools to understand which variables had an influence on the model’s predictions. While variable importance was species-dependent, we identified finite-size Lyapunov exponents (FSLEs), sea surface temperature, pH and salinity as the most influential variables, in that order. These insights can prove valuable for future species-specific ecology studies.
format Article
id doaj-art-6cda5843b9b0473ea1b88640761fedd1
institution Kabale University
issn 2804-3871
language English
publishDate 2024-09-01
publisher Peer Community In
record_format Article
series Peer Community Journal
spelling doaj-art-6cda5843b9b0473ea1b88640761fedd12025-02-07T10:17:17ZengPeer Community InPeer Community Journal2804-38712024-09-01410.24072/pcjournal.47110.24072/pcjournal.471Predicting species distributions in the open ocean with convolutional neural networks Morand, Gaétan0https://orcid.org/0000-0002-0826-4487Joly, Alexis1https://orcid.org/0000-0002-2161-9940Rouyer, Tristan2https://orcid.org/0000-0002-0172-8031Lorieul, Titouan3https://orcid.org/0000-0001-5228-9238Barde, Julien4https://orcid.org/0000-0002-3519-6141UMR Marbec, IRD, Univ. Montpellier, CNRS, Ifremer - Montpellier, FranceINRIA, Montpellier, FranceUMR Marbec, IRD, Univ. Montpellier, CNRS, Ifremer - Montpellier, FranceINRIA, Montpellier, FranceUMR Marbec, IRD, Univ. Montpellier, CNRS, Ifremer - Montpellier, FranceAs biodiversity plummets due to anthropogenic disturbances, the conservation of oceanic species is made harder by limited knowledge of their distributions and migrations. Indeed, tracking species distributions in the open ocean is particularly challenging due to the scarcity of observations and the complex and variable nature of the ocean system. In this study, we propose a new method that leverages deep learning, specifically convolutional neural networks (CNNs), to capture spatial features of environmental variables. This novelty eliminates the need to predefine these features before modelling and creates opportunities to discover unexpected correlations. Our aim is to present the results of the first trial of this method in the open ocean, discuss limitations and provide feedback for future improvements or adjustments. In this case study, we considered 38 taxa comprising pelagic fishes, elasmobranchs, marine mammals, marine turtles and birds. We trained a model to predict probabilities from the environmental conditions at any specific point in space and time, using species occurrence data from the Global Biodiversity Information Facility (GBIF) and environmental data from various sources. These variables included sea surface temperature, chlorophyll concentration, salinity and fifteen others. During the testing phase, the model was applied to environmental data at locations where species occurrences were recorded. The classifier accurately predicted the observed taxon as the most likely taxon in 69% of cases and included the observed taxon among the top three most likely predictions in 89% of cases. These findings show the adequacy of deep learning for species distribution modelling in the open ocean. Additionally, this purely correlative model was then analysed with explicability tools to understand which variables had an influence on the model’s predictions. While variable importance was species-dependent, we identified finite-size Lyapunov exponents (FSLEs), sea surface temperature, pH and salinity as the most influential variables, in that order. These insights can prove valuable for future species-specific ecology studies.https://peercommunityjournal.org/articles/10.24072/pcjournal.471/deep learning, megafauna, open ocean, pelagic species, species distribution models
spellingShingle Morand, Gaétan
Joly, Alexis
Rouyer, Tristan
Lorieul, Titouan
Barde, Julien
Predicting species distributions in the open ocean with convolutional neural networks
Peer Community Journal
deep learning, megafauna, open ocean, pelagic species, species distribution models
title Predicting species distributions in the open ocean with convolutional neural networks
title_full Predicting species distributions in the open ocean with convolutional neural networks
title_fullStr Predicting species distributions in the open ocean with convolutional neural networks
title_full_unstemmed Predicting species distributions in the open ocean with convolutional neural networks
title_short Predicting species distributions in the open ocean with convolutional neural networks
title_sort predicting species distributions in the open ocean with convolutional neural networks
topic deep learning, megafauna, open ocean, pelagic species, species distribution models
url https://peercommunityjournal.org/articles/10.24072/pcjournal.471/
work_keys_str_mv AT morandgaetan predictingspeciesdistributionsintheopenoceanwithconvolutionalneuralnetworks
AT jolyalexis predictingspeciesdistributionsintheopenoceanwithconvolutionalneuralnetworks
AT rouyertristan predictingspeciesdistributionsintheopenoceanwithconvolutionalneuralnetworks
AT lorieultitouan predictingspeciesdistributionsintheopenoceanwithconvolutionalneuralnetworks
AT bardejulien predictingspeciesdistributionsintheopenoceanwithconvolutionalneuralnetworks