Estimating Chlorophyll-<i>a</i> and Phycocyanin Concentrations in Inland Temperate Lakes across New York State Using Sentinel-2 Images: Application of Google Earth Engine for Efficient Satellite Image Processing

Harmful algae blooms (HABs) have been reported with greater frequency in lakes across New York State (NYS) in recent years. In situ sampling is used to assess water quality, but such observations are time intensive and therefore practically limited in their spatial extent. Previous research has used...

Full description

Saved in:
Bibliographic Details
Main Authors: Sara Akbarnejad Nesheli, Lindi J. Quackenbush, Lewis McCaffrey
Format: Article
Language:English
Published: MDPI AG 2024-09-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/16/18/3504
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850259654579322880
author Sara Akbarnejad Nesheli
Lindi J. Quackenbush
Lewis McCaffrey
author_facet Sara Akbarnejad Nesheli
Lindi J. Quackenbush
Lewis McCaffrey
author_sort Sara Akbarnejad Nesheli
collection DOAJ
description Harmful algae blooms (HABs) have been reported with greater frequency in lakes across New York State (NYS) in recent years. In situ sampling is used to assess water quality, but such observations are time intensive and therefore practically limited in their spatial extent. Previous research has used remote sensing imagery to estimate phytoplankton pigments (typically chlorophyll-<i>a</i> or phycocyanin) as HAB indicators. The primary goal of this study was to validate a remote sensing-based method to estimate cyanobacteria concentrations at high temporal (5 days) and spatial (10–20 m) resolution, to allow identification of lakes across NYS at a significant risk of algal blooms, thereby facilitating targeted field investigations. We used Google Earth Engine (GEE) as a cloud computing platform to develop an efficient methodology to process Sentinel-2 image collections at a large spatial and temporal scale. Our research used linear regression to model the correlation between in situ observations of chlorophyll-<i>a</i> (Chl-a) and phycocyanin and indices derived from Sentinel-2 data to evaluate the potential of remote sensing-derived inputs for estimating cyanobacteria concentrations. We tested the performance of empirical models based on seven remote-sensing-derived indices, two in situ measurements, two cloud mitigation approaches, and three temporal sampling windows across NYS lakes for 2019 and 2020. Our best base model (R<sup>2</sup> of 0.63), using concurrent sampling data and the ESA cloud masking—i.e., the QA60 bitmask—approach, related the maximum peak height (MPH) index to phycocyanin concentrations. Expanding the temporal match using a one-day time window increased the available training dataset size and improved the fit of the linear regression model (R<sup>2</sup> of 0.71), highlighting the positive impact of increasing the training dataset on model fit. Applying the Cloud Score+ method for filtering cloud and cloud shadows further improved the fit of the phycocyanin estimation model, with an R<sup>2</sup> of 0.84, but did not result in substantial improvements in the model’s application. The fit of the Chl-a models was generally poorer, but these models still had good accuracy in detecting moderate and high Chl-a values. Future work will focus on exploring alternative algorithms that can incorporate diverse data sources and lake characteristics, contributing to a deeper understanding of the relationship between remote sensing data and water quality parameters. This research provides a valuable tool for cyanobacteria parameter estimation with confidence quantification to identify lakes at risk of algal blooms.
format Article
id doaj-art-fb381a2d4d384ba29d5d9e2ed04a2e58
institution OA Journals
issn 2072-4292
language English
publishDate 2024-09-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj-art-fb381a2d4d384ba29d5d9e2ed04a2e582025-08-20T01:55:49ZengMDPI AGRemote Sensing2072-42922024-09-011618350410.3390/rs16183504Estimating Chlorophyll-<i>a</i> and Phycocyanin Concentrations in Inland Temperate Lakes across New York State Using Sentinel-2 Images: Application of Google Earth Engine for Efficient Satellite Image ProcessingSara Akbarnejad Nesheli0Lindi J. Quackenbush1Lewis McCaffrey2Department of Environmental Resources Engineering, State University of New York College of Environmental Science and Forestry (ESF), Syracuse, NY 13210, USADepartment of Environmental Resources Engineering, State University of New York College of Environmental Science and Forestry (ESF), Syracuse, NY 13210, USANew York State Department of Environmental Conservation (NYSDEC), Syracuse, NY 13214, USAHarmful algae blooms (HABs) have been reported with greater frequency in lakes across New York State (NYS) in recent years. In situ sampling is used to assess water quality, but such observations are time intensive and therefore practically limited in their spatial extent. Previous research has used remote sensing imagery to estimate phytoplankton pigments (typically chlorophyll-<i>a</i> or phycocyanin) as HAB indicators. The primary goal of this study was to validate a remote sensing-based method to estimate cyanobacteria concentrations at high temporal (5 days) and spatial (10–20 m) resolution, to allow identification of lakes across NYS at a significant risk of algal blooms, thereby facilitating targeted field investigations. We used Google Earth Engine (GEE) as a cloud computing platform to develop an efficient methodology to process Sentinel-2 image collections at a large spatial and temporal scale. Our research used linear regression to model the correlation between in situ observations of chlorophyll-<i>a</i> (Chl-a) and phycocyanin and indices derived from Sentinel-2 data to evaluate the potential of remote sensing-derived inputs for estimating cyanobacteria concentrations. We tested the performance of empirical models based on seven remote-sensing-derived indices, two in situ measurements, two cloud mitigation approaches, and three temporal sampling windows across NYS lakes for 2019 and 2020. Our best base model (R<sup>2</sup> of 0.63), using concurrent sampling data and the ESA cloud masking—i.e., the QA60 bitmask—approach, related the maximum peak height (MPH) index to phycocyanin concentrations. Expanding the temporal match using a one-day time window increased the available training dataset size and improved the fit of the linear regression model (R<sup>2</sup> of 0.71), highlighting the positive impact of increasing the training dataset on model fit. Applying the Cloud Score+ method for filtering cloud and cloud shadows further improved the fit of the phycocyanin estimation model, with an R<sup>2</sup> of 0.84, but did not result in substantial improvements in the model’s application. The fit of the Chl-a models was generally poorer, but these models still had good accuracy in detecting moderate and high Chl-a values. Future work will focus on exploring alternative algorithms that can incorporate diverse data sources and lake characteristics, contributing to a deeper understanding of the relationship between remote sensing data and water quality parameters. This research provides a valuable tool for cyanobacteria parameter estimation with confidence quantification to identify lakes at risk of algal blooms.https://www.mdpi.com/2072-4292/16/18/3504cloud masksmall- and mid-size lakesmaximum peak height (MPH) index
spellingShingle Sara Akbarnejad Nesheli
Lindi J. Quackenbush
Lewis McCaffrey
Estimating Chlorophyll-<i>a</i> and Phycocyanin Concentrations in Inland Temperate Lakes across New York State Using Sentinel-2 Images: Application of Google Earth Engine for Efficient Satellite Image Processing
Remote Sensing
cloud mask
small- and mid-size lakes
maximum peak height (MPH) index
title Estimating Chlorophyll-<i>a</i> and Phycocyanin Concentrations in Inland Temperate Lakes across New York State Using Sentinel-2 Images: Application of Google Earth Engine for Efficient Satellite Image Processing
title_full Estimating Chlorophyll-<i>a</i> and Phycocyanin Concentrations in Inland Temperate Lakes across New York State Using Sentinel-2 Images: Application of Google Earth Engine for Efficient Satellite Image Processing
title_fullStr Estimating Chlorophyll-<i>a</i> and Phycocyanin Concentrations in Inland Temperate Lakes across New York State Using Sentinel-2 Images: Application of Google Earth Engine for Efficient Satellite Image Processing
title_full_unstemmed Estimating Chlorophyll-<i>a</i> and Phycocyanin Concentrations in Inland Temperate Lakes across New York State Using Sentinel-2 Images: Application of Google Earth Engine for Efficient Satellite Image Processing
title_short Estimating Chlorophyll-<i>a</i> and Phycocyanin Concentrations in Inland Temperate Lakes across New York State Using Sentinel-2 Images: Application of Google Earth Engine for Efficient Satellite Image Processing
title_sort estimating chlorophyll i a i and phycocyanin concentrations in inland temperate lakes across new york state using sentinel 2 images application of google earth engine for efficient satellite image processing
topic cloud mask
small- and mid-size lakes
maximum peak height (MPH) index
url https://www.mdpi.com/2072-4292/16/18/3504
work_keys_str_mv AT saraakbarnejadnesheli estimatingchlorophylliaiandphycocyaninconcentrationsininlandtemperatelakesacrossnewyorkstateusingsentinel2imagesapplicationofgoogleearthengineforefficientsatelliteimageprocessing
AT lindijquackenbush estimatingchlorophylliaiandphycocyaninconcentrationsininlandtemperatelakesacrossnewyorkstateusingsentinel2imagesapplicationofgoogleearthengineforefficientsatelliteimageprocessing
AT lewismccaffrey estimatingchlorophylliaiandphycocyaninconcentrationsininlandtemperatelakesacrossnewyorkstateusingsentinel2imagesapplicationofgoogleearthengineforefficientsatelliteimageprocessing