LAPSE:2023.12965
Published Article

LAPSE:2023.12965
Short-Term PV Power Forecasting Using a Regression-Based Ensemble Method
February 28, 2023
Abstract
One of the most critical aspects of integrating renewable energy sources into the smart grid is photovoltaic (PV) power generation forecasting. This ensemble forecasting technique combines several forecasting models to increase the forecasting accuracy of the individual models. This study proposes a regression-based ensemble method for day-ahead PV power forecasting. The general framework consists of three steps: model training, creating the optimal set of weights, and testing the model. In step 1, a Random forest (RF) with different parameters is used for a single forecasting method. Five RF models (RF1, RF2, RF3, RF4, and RF5) and a support vector machine (SVM) for classification are established. The hyperparameters for the regression-based method involve learners (linear regression (LR) or support vector regression (SVR)), regularization (least absolute shrinkage and selection operator (LASSO) or Ridge), and a penalty coefficient for regularization (λ). Bayesian optimization is performed to find the optimal value of these three hyperparameters based on the minimum function. The optimal set of weights is obtained in step 2 and each set of weights contains five weight coefficients and a bias. In the final step, the weather forecasting data for the target day is used as input for the five RF models and the average daily weather forecasting data is also used as input for the SVM classification model. The SVM output selects the weather conditions, and the corresponding set of weight coefficients from step 2 is combined with the output from each RF model to obtain the final forecasting results. The stacking recurrent neural network (RNN) is used as a benchmark ensemble method for comparison. Historical PV power data for a PV site in Zhangbin Industrial Area, Taiwan, with a 2000 kWp capacity is used to test the methodology. The results for the single best RF model, the stacking RNN, and the proposed method are compared in terms of the mean relative error (MRE), the mean absolute error (MAE), and the coefficient of determination (R2) to verify the proposed method. The results for the MRE show that the proposed method outperforms the best RF method by 20% and the benchmark method by 2%.
One of the most critical aspects of integrating renewable energy sources into the smart grid is photovoltaic (PV) power generation forecasting. This ensemble forecasting technique combines several forecasting models to increase the forecasting accuracy of the individual models. This study proposes a regression-based ensemble method for day-ahead PV power forecasting. The general framework consists of three steps: model training, creating the optimal set of weights, and testing the model. In step 1, a Random forest (RF) with different parameters is used for a single forecasting method. Five RF models (RF1, RF2, RF3, RF4, and RF5) and a support vector machine (SVM) for classification are established. The hyperparameters for the regression-based method involve learners (linear regression (LR) or support vector regression (SVR)), regularization (least absolute shrinkage and selection operator (LASSO) or Ridge), and a penalty coefficient for regularization (λ). Bayesian optimization is performed to find the optimal value of these three hyperparameters based on the minimum function. The optimal set of weights is obtained in step 2 and each set of weights contains five weight coefficients and a bias. In the final step, the weather forecasting data for the target day is used as input for the five RF models and the average daily weather forecasting data is also used as input for the SVM classification model. The SVM output selects the weather conditions, and the corresponding set of weight coefficients from step 2 is combined with the output from each RF model to obtain the final forecasting results. The stacking recurrent neural network (RNN) is used as a benchmark ensemble method for comparison. Historical PV power data for a PV site in Zhangbin Industrial Area, Taiwan, with a 2000 kWp capacity is used to test the methodology. The results for the single best RF model, the stacking RNN, and the proposed method are compared in terms of the mean relative error (MRE), the mean absolute error (MAE), and the coefficient of determination (R2) to verify the proposed method. The results for the MRE show that the proposed method outperforms the best RF method by 20% and the benchmark method by 2%.
Record ID
Keywords
clustering method, ensemble method, linear regression, PV power forecasting, Random forest, support vector machine
Suggested Citation
Lateko AAH, Yang HT, Huang CM. Short-Term PV Power Forecasting Using a Regression-Based Ensemble Method. (2023). LAPSE:2023.12965
Author Affiliations
Lateko AAH: Department of Electrical Engineering, National Cheng Kung University, Tainan 701, Taiwan; Department of Electrical Engineering, Muhammadiyah University of Makassar, Makassar 90221, Indonesia [ORCID]
Yang HT: Department of Electrical Engineering, National Cheng Kung University, Tainan 701, Taiwan [ORCID]
Huang CM: Department of Electrical Engineering, Kun Shan University, Tainan 710, Taiwan [ORCID]
Yang HT: Department of Electrical Engineering, National Cheng Kung University, Tainan 701, Taiwan [ORCID]
Huang CM: Department of Electrical Engineering, Kun Shan University, Tainan 710, Taiwan [ORCID]
Journal Name
Energies
Volume
15
Issue
11
First Page
4171
Year
2022
Publication Date
2022-06-06
ISSN
1996-1073
Version Comments
Original Submission
Other Meta
PII: en15114171, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2023.12965
This Record
External Link

https://doi.org/10.3390/en15114171
Publisher Version
Download
Meta
Record Statistics
Record Views
206
Version History
[v1] (Original Submission)
Feb 28, 2023
Verified by curator on
Feb 28, 2023
This Version Number
v1
Citations
Most Recent
This Version
URL Here
http://psecommunity.org/LAPSE:2023.12965
Record Owner
Auto Uploader for LAPSE
Links to Related Works
(0.27 seconds)
[[0.28 s]]
