LAPSE:2023.30385
Published Article
LAPSE:2023.30385
Wind Turbine Fault Detection Using Highly Imbalanced Real SCADA Data
April 14, 2023
Wind power is cleaner and less expensive compared to other alternative sources, and it has therefore become one of the most important energy sources worldwide. However, challenges related to the operation and maintenance of wind farms significantly contribute to the increase in their overall costs, and, therefore, it is necessary to monitor the condition of each wind turbine on the farm and identify the different states of alarm. Common alarms are raised based on data acquired by a supervisory control and data acquisition (SCADA) system; however, this system generates a large number of false positive alerts, which must be handled to minimize inspection costs and perform preventive maintenance before actual critical or catastrophic failures occur. To this end, a fault detection methodology is proposed in this paper; in the proposed method, different data analysis and data processing techniques are applied to real SCADA data (imbalanced data) for improving the detection of alarms related to the temperature of the main gearbox of a wind turbine. An imbalanced dataset is a classification data set that contains skewed class proportions (more observations from one class than the other) which can cause a potential bias if it is not handled with caution. Furthermore, the dataset is time dependent introducing an additional variable to deal with when processing and splitting the data. These methods are aimed to reduce false positives and false negatives, and to demonstrate the effectiveness of well-applied preprocessing techniques for improving the performance of different machine learning algorithms.
Keywords
Fault Detection, imbalanced data, k nearest neighbors, Machine Learning, principal component analysis, SCADA, structural health monitoring, support vector machines, wind turbine
Suggested Citation
Velandia-Cardenas C, Vidal Y, Pozo F. Wind Turbine Fault Detection Using Highly Imbalanced Real SCADA Data. (2023). LAPSE:2023.30385
Author Affiliations
Velandia-Cardenas C: Control, Modeling, Identification and Applications (CoDAlab), Department of Mathematics, Escola d’Enginyeria de Barcelona Est (EEBE), Campus Diagonal-Besòs (CDB), Universitat Politècnica de Catalunya (UPC), Eduard Maristany 16, 08019 Barcelona, Spain [ORCID]
Vidal Y: Control, Modeling, Identification and Applications (CoDAlab), Department of Mathematics, Escola d’Enginyeria de Barcelona Est (EEBE), Campus Diagonal-Besòs (CDB), Universitat Politècnica de Catalunya (UPC), Eduard Maristany 16, 08019 Barcelona, Spain; [ORCID]
Pozo F: Control, Modeling, Identification and Applications (CoDAlab), Department of Mathematics, Escola d’Enginyeria de Barcelona Est (EEBE), Campus Diagonal-Besòs (CDB), Universitat Politècnica de Catalunya (UPC), Eduard Maristany 16, 08019 Barcelona, Spain; [ORCID]
Journal Name
Energies
Volume
14
Issue
6
First Page
1728
Year
2021
Publication Date
2021-03-20
Published Version
ISSN
1996-1073
Version Comments
Original Submission
Other Meta
PII: en14061728, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2023.30385
This Record
External Link

doi:10.3390/en14061728
Publisher Version
Download
Files
[Download 1v1.pdf] (1.5 MB)
Apr 14, 2023
Main Article
License
CC BY 4.0
Meta
Record Statistics
Record Views
129
Version History
[v1] (Original Submission)
Apr 14, 2023
 
Verified by curator on
Apr 14, 2023
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2023.30385
 
Original Submitter
Auto Uploader for LAPSE
Links to Related Works
Directly Related to This Work
Publisher Version