Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies

The rapidly increasing installed capacity of Wind Turbines (WTs) worldwide emphasizes the need for Operation and Maintenance (O&M) strategies favoring high availability, reliability, and cost-effective operation. Optimal decision-making and planning are supported by WT health condition analyses...

Full description

Saved in:
Bibliographic Details
Main Authors: Adaiton Oliveira-Filho, Monelle Comeau, James Cave, Charbel Nasr, Pavel Côté, Antoine Tahan
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Energies
Subjects:
Online Access:https://www.mdpi.com/1996-1073/18/1/59
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850072339911278592
author Adaiton Oliveira-Filho
Monelle Comeau
James Cave
Charbel Nasr
Pavel Côté
Antoine Tahan
author_facet Adaiton Oliveira-Filho
Monelle Comeau
James Cave
Charbel Nasr
Pavel Côté
Antoine Tahan
author_sort Adaiton Oliveira-Filho
collection DOAJ
description The rapidly increasing installed capacity of Wind Turbines (WTs) worldwide emphasizes the need for Operation and Maintenance (O&M) strategies favoring high availability, reliability, and cost-effective operation. Optimal decision-making and planning are supported by WT health condition analyses based on data from the Supervisory Control and Data Acquisition (SCADA) system. However, SCADA data are highly imbalanced, with a predominance of healthy condition samples. Although this imbalance can negatively impact analyses such as detection, Condition Monitoring (CM), diagnosis, and prognosis, it is often overlooked in the literature. This review specifically addresses the problem of SCADA data imbalance, focusing on strategies to mitigate this condition. Five categories of such strategies were identified: Normal Behavior Models (NBMs), data-level strategies, algorithm-level strategies, cost-sensitive learning, and data augmentation techniques. This review evidenced that the choice among these strategies is mainly dictated by the availability of data and the intended analysis. Moreover, algorithm-level strategies are predominant in analyzing SCADA data because these strategies do not require the costly and time-consuming task of data labeling. An extensive public SCADA database could ease the problem of abnormal data scarcity and help handle the problem of data imbalance. However, long-dated requests to create such a database are still unaddressed.
format Article
id doaj-art-549ae2985c71499abd8fabfcd9c7879a
institution DOAJ
issn 1996-1073
language English
publishDate 2024-12-01
publisher MDPI AG
record_format Article
series Energies
spelling doaj-art-549ae2985c71499abd8fabfcd9c7879a2025-08-20T02:47:06ZengMDPI AGEnergies1996-10732024-12-011815910.3390/en18010059Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation StrategiesAdaiton Oliveira-Filho0Monelle Comeau1James Cave2Charbel Nasr3Pavel Côté4Antoine Tahan5Department of Mechanical Engineering, École de Technologie Supérieure, Université du Québec, 1100 Rue Notre Dame O, Montreal, QC H3C 1K3, CanadaPower Factors, 7005 Boulevard Taschereau, Brossard, QC J4Z 1A7, CanadaDepartment of Mechanical Engineering, École de Technologie Supérieure, Université du Québec, 1100 Rue Notre Dame O, Montreal, QC H3C 1K3, CanadaDepartment of Mechanical Engineering, École de Technologie Supérieure, Université du Québec, 1100 Rue Notre Dame O, Montreal, QC H3C 1K3, CanadaPower Factors, 7005 Boulevard Taschereau, Brossard, QC J4Z 1A7, CanadaDepartment of Mechanical Engineering, École de Technologie Supérieure, Université du Québec, 1100 Rue Notre Dame O, Montreal, QC H3C 1K3, CanadaThe rapidly increasing installed capacity of Wind Turbines (WTs) worldwide emphasizes the need for Operation and Maintenance (O&M) strategies favoring high availability, reliability, and cost-effective operation. Optimal decision-making and planning are supported by WT health condition analyses based on data from the Supervisory Control and Data Acquisition (SCADA) system. However, SCADA data are highly imbalanced, with a predominance of healthy condition samples. Although this imbalance can negatively impact analyses such as detection, Condition Monitoring (CM), diagnosis, and prognosis, it is often overlooked in the literature. This review specifically addresses the problem of SCADA data imbalance, focusing on strategies to mitigate this condition. Five categories of such strategies were identified: Normal Behavior Models (NBMs), data-level strategies, algorithm-level strategies, cost-sensitive learning, and data augmentation techniques. This review evidenced that the choice among these strategies is mainly dictated by the availability of data and the intended analysis. Moreover, algorithm-level strategies are predominant in analyzing SCADA data because these strategies do not require the costly and time-consuming task of data labeling. An extensive public SCADA database could ease the problem of abnormal data scarcity and help handle the problem of data imbalance. However, long-dated requests to create such a database are still unaddressed.https://www.mdpi.com/1996-1073/18/1/59wind turbineSCADA dataimbalanced datanormal behavior modeldata augmentation techniques
spellingShingle Adaiton Oliveira-Filho
Monelle Comeau
James Cave
Charbel Nasr
Pavel Côté
Antoine Tahan
Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies
Energies
wind turbine
SCADA data
imbalanced data
normal behavior model
data augmentation techniques
title Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies
title_full Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies
title_fullStr Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies
title_full_unstemmed Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies
title_short Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies
title_sort wind turbine scada data imbalance a review of its impact on health condition analyses and mitigation strategies
topic wind turbine
SCADA data
imbalanced data
normal behavior model
data augmentation techniques
url https://www.mdpi.com/1996-1073/18/1/59
work_keys_str_mv AT adaitonoliveirafilho windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies
AT monellecomeau windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies
AT jamescave windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies
AT charbelnasr windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies
AT pavelcote windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies
AT antoinetahan windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies