LAPSE:2023.6213
Published Article

LAPSE:2023.6213
Ensemble Machine Learning for Predicting the Power Output from Different Solar Photovoltaic Systems
February 23, 2023
Abstract
In this paper, ensemble-based machine learning models with gradient boosting machine and random forest are proposed for predicting the power production from six different solar PV systems. The models are based on three year’s performance of a 1.2 MW grid-integrated solar photo-voltaic (PV) power plant. After cleaning the data for errors and outliers, the model features were chosen on the basis of principal component analysis. Accuracies of the developed models were tested and compared with the performance of models based on other supervised learning algorithms, such as k-nearest neighbour and support vector machines. Though the accuracies of the models varied with the type of PV systems, in general, the machine learned models developed under the study could perform well in predicting the power output from different solar PV technologies under varying working environments. For example, the average root mean square error of the models based on the gradient boosting machines, random forest, k-nearest neighbour, and support vector machines are 17.59 kW, 17.14 kW, 18.74 kW, and 16.91 kW, respectively. Corresponding averages of mean absolute errors are 8.28 kW, 7.88 kW, 14.45 kW, and 6.89 kW. Comparing the different modelling methods, the decision-tree-based ensembled algorithms and support vector machine models outperformed the approach based on the k-nearest neighbour method. With these high accuracies and lower computational costs compared with the deep learning approaches, the proposed ensembled models could be good options for PV performance predictions used in real and near-real-time applications.
In this paper, ensemble-based machine learning models with gradient boosting machine and random forest are proposed for predicting the power production from six different solar PV systems. The models are based on three year’s performance of a 1.2 MW grid-integrated solar photo-voltaic (PV) power plant. After cleaning the data for errors and outliers, the model features were chosen on the basis of principal component analysis. Accuracies of the developed models were tested and compared with the performance of models based on other supervised learning algorithms, such as k-nearest neighbour and support vector machines. Though the accuracies of the models varied with the type of PV systems, in general, the machine learned models developed under the study could perform well in predicting the power output from different solar PV technologies under varying working environments. For example, the average root mean square error of the models based on the gradient boosting machines, random forest, k-nearest neighbour, and support vector machines are 17.59 kW, 17.14 kW, 18.74 kW, and 16.91 kW, respectively. Corresponding averages of mean absolute errors are 8.28 kW, 7.88 kW, 14.45 kW, and 6.89 kW. Comparing the different modelling methods, the decision-tree-based ensembled algorithms and support vector machine models outperformed the approach based on the k-nearest neighbour method. With these high accuracies and lower computational costs compared with the deep learning approaches, the proposed ensembled models could be good options for PV performance predictions used in real and near-real-time applications.
Record ID
Keywords
gradient boosting machine, k-nearest neighbour, Machine Learning, random forest, solar PV power prediction, support vector machines
Subject
Suggested Citation
Raj V, Dotse SQ, Sathyajith M, Petra MI, Yassin H. Ensemble Machine Learning for Predicting the Power Output from Different Solar Photovoltaic Systems. (2023). LAPSE:2023.6213
Author Affiliations
Raj V: Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei [ORCID]
Dotse SQ: School of Sustainable Development, University of Environment and Sustainable Development, Private Mail Bag, Somanya, Ghana [ORCID]
Sathyajith M: Faculty of Engineering and Science, University of Agder, Jon Lilletunsvei 9, 4879 Grimstad, Norway
Petra MI: Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei
Yassin H: Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei
Dotse SQ: School of Sustainable Development, University of Environment and Sustainable Development, Private Mail Bag, Somanya, Ghana [ORCID]
Sathyajith M: Faculty of Engineering and Science, University of Agder, Jon Lilletunsvei 9, 4879 Grimstad, Norway
Petra MI: Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei
Yassin H: Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei
Journal Name
Energies
Volume
16
Issue
2
First Page
671
Year
2023
Publication Date
2023-01-06
ISSN
1996-1073
Version Comments
Original Submission
Other Meta
PII: en16020671, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2023.6213
This Record
External Link

https://doi.org/10.3390/en16020671
Publisher Version
Download
Meta
Record Statistics
Record Views
230
Version History
[v1] (Original Submission)
Feb 23, 2023
Verified by curator on
Feb 23, 2023
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2023.6213
Record Owner
Auto Uploader for LAPSE
Links to Related Works
(0.34 seconds) 0.01 + 0.02 + 0.15 + 0.08 + 0 + 0.02 + 0.01 + 0 + 0.01 + 0.02 + 0 + 0
