Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies
The rapidly increasing installed capacity of Wind Turbines (WTs) worldwide emphasizes the need for Operation and Maintenance (O&M) strategies favoring high availability, reliability, and cost-effective operation. Optimal decision-making and planning are supported by WT health condition analyses...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2024-12-01
|
| Series: | Energies |
| Subjects: | |
| Online Access: | https://www.mdpi.com/1996-1073/18/1/59 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850072339911278592 |
|---|---|
| author | Adaiton Oliveira-Filho Monelle Comeau James Cave Charbel Nasr Pavel Côté Antoine Tahan |
| author_facet | Adaiton Oliveira-Filho Monelle Comeau James Cave Charbel Nasr Pavel Côté Antoine Tahan |
| author_sort | Adaiton Oliveira-Filho |
| collection | DOAJ |
| description | The rapidly increasing installed capacity of Wind Turbines (WTs) worldwide emphasizes the need for Operation and Maintenance (O&M) strategies favoring high availability, reliability, and cost-effective operation. Optimal decision-making and planning are supported by WT health condition analyses based on data from the Supervisory Control and Data Acquisition (SCADA) system. However, SCADA data are highly imbalanced, with a predominance of healthy condition samples. Although this imbalance can negatively impact analyses such as detection, Condition Monitoring (CM), diagnosis, and prognosis, it is often overlooked in the literature. This review specifically addresses the problem of SCADA data imbalance, focusing on strategies to mitigate this condition. Five categories of such strategies were identified: Normal Behavior Models (NBMs), data-level strategies, algorithm-level strategies, cost-sensitive learning, and data augmentation techniques. This review evidenced that the choice among these strategies is mainly dictated by the availability of data and the intended analysis. Moreover, algorithm-level strategies are predominant in analyzing SCADA data because these strategies do not require the costly and time-consuming task of data labeling. An extensive public SCADA database could ease the problem of abnormal data scarcity and help handle the problem of data imbalance. However, long-dated requests to create such a database are still unaddressed. |
| format | Article |
| id | doaj-art-549ae2985c71499abd8fabfcd9c7879a |
| institution | DOAJ |
| issn | 1996-1073 |
| language | English |
| publishDate | 2024-12-01 |
| publisher | MDPI AG |
| record_format | Article |
| series | Energies |
| spelling | doaj-art-549ae2985c71499abd8fabfcd9c7879a2025-08-20T02:47:06ZengMDPI AGEnergies1996-10732024-12-011815910.3390/en18010059Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation StrategiesAdaiton Oliveira-Filho0Monelle Comeau1James Cave2Charbel Nasr3Pavel Côté4Antoine Tahan5Department of Mechanical Engineering, École de Technologie Supérieure, Université du Québec, 1100 Rue Notre Dame O, Montreal, QC H3C 1K3, CanadaPower Factors, 7005 Boulevard Taschereau, Brossard, QC J4Z 1A7, CanadaDepartment of Mechanical Engineering, École de Technologie Supérieure, Université du Québec, 1100 Rue Notre Dame O, Montreal, QC H3C 1K3, CanadaDepartment of Mechanical Engineering, École de Technologie Supérieure, Université du Québec, 1100 Rue Notre Dame O, Montreal, QC H3C 1K3, CanadaPower Factors, 7005 Boulevard Taschereau, Brossard, QC J4Z 1A7, CanadaDepartment of Mechanical Engineering, École de Technologie Supérieure, Université du Québec, 1100 Rue Notre Dame O, Montreal, QC H3C 1K3, CanadaThe rapidly increasing installed capacity of Wind Turbines (WTs) worldwide emphasizes the need for Operation and Maintenance (O&M) strategies favoring high availability, reliability, and cost-effective operation. Optimal decision-making and planning are supported by WT health condition analyses based on data from the Supervisory Control and Data Acquisition (SCADA) system. However, SCADA data are highly imbalanced, with a predominance of healthy condition samples. Although this imbalance can negatively impact analyses such as detection, Condition Monitoring (CM), diagnosis, and prognosis, it is often overlooked in the literature. This review specifically addresses the problem of SCADA data imbalance, focusing on strategies to mitigate this condition. Five categories of such strategies were identified: Normal Behavior Models (NBMs), data-level strategies, algorithm-level strategies, cost-sensitive learning, and data augmentation techniques. This review evidenced that the choice among these strategies is mainly dictated by the availability of data and the intended analysis. Moreover, algorithm-level strategies are predominant in analyzing SCADA data because these strategies do not require the costly and time-consuming task of data labeling. An extensive public SCADA database could ease the problem of abnormal data scarcity and help handle the problem of data imbalance. However, long-dated requests to create such a database are still unaddressed.https://www.mdpi.com/1996-1073/18/1/59wind turbineSCADA dataimbalanced datanormal behavior modeldata augmentation techniques |
| spellingShingle | Adaiton Oliveira-Filho Monelle Comeau James Cave Charbel Nasr Pavel Côté Antoine Tahan Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies Energies wind turbine SCADA data imbalanced data normal behavior model data augmentation techniques |
| title | Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies |
| title_full | Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies |
| title_fullStr | Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies |
| title_full_unstemmed | Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies |
| title_short | Wind Turbine SCADA Data Imbalance: A Review of Its Impact on Health Condition Analyses and Mitigation Strategies |
| title_sort | wind turbine scada data imbalance a review of its impact on health condition analyses and mitigation strategies |
| topic | wind turbine SCADA data imbalanced data normal behavior model data augmentation techniques |
| url | https://www.mdpi.com/1996-1073/18/1/59 |
| work_keys_str_mv | AT adaitonoliveirafilho windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies AT monellecomeau windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies AT jamescave windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies AT charbelnasr windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies AT pavelcote windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies AT antoinetahan windturbinescadadataimbalanceareviewofitsimpactonhealthconditionanalysesandmitigationstrategies |