LAPSE:2024.1037
Published Article
LAPSE:2024.1037
Research on Imbalanced Data Regression Based on Confrontation
June 7, 2024
Abstract
The regression model has higher requirements for the quality and balance of data to ensure the accuracy of predictions. However, there is a common problem of imbalanced distribution in real datasets, which directly affects the prediction accuracy of regression models. In order to solve the problem of data imbalance regression, considering the continuity of the target value and the correlation of the data and using the idea of optimization and confrontation, we propose an IRGAN (imbalanced regression generative adversarial network) algorithm. Considering the context information of the target data and the disappearance of the deep network gradient, we constructed a generation module and designed a composite loss function. In the early stages of training, the gap between the generated samples and the real samples is large, which easily causes the problem of non-convergence. A correction module is designed to train the internal relationship between the state and action as well as the subsequent state and reward of the real samples, guide the generation module to generate samples, and alleviate the non-convergence of the training process. The corrected samples and real samples are input into the discriminant module. On this basis, the confrontation idea is used to generate high-quality samples to balance the original samples. The proposed method is tested in the fields of aerospace, biology, physics, and chemistry. The similarity between the generated samples and the real samples is comprehensively measured from multiple perspectives to evaluate the quality of the generated samples, which proves the superiority of the generated module. Regression prediction is performed on the balanced samples processed by the IRGAN algorithm, and it is proven that the proposed algorithm can improve the prediction accuracy in terms of the imbalanced data regression problem.
Keywords
imbalanced data, imbalanced regression, IRGAN
Suggested Citation
Liu X, Tian H. Research on Imbalanced Data Regression Based on Confrontation. (2024). LAPSE:2024.1037
Author Affiliations
Liu X: School of Control Science and Engineering, Tiangong University, Tianjin 300387, China; Tianjin Key Laboratory of Intelligent Control of Electrical Equipment, Tiangong University, Tianjin 300387, China [ORCID]
Tian H: School of Control Science and Engineering, Tiangong University, Tianjin 300387, China; Tianjin Key Laboratory of Intelligent Control of Electrical Equipment, Tiangong University, Tianjin 300387, China [ORCID]
Journal Name
Processes
Volume
12
Issue
2
First Page
375
Year
2024
Publication Date
2024-02-13
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr12020375, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2024.1037
This Record
External Link

https://doi.org/10.3390/pr12020375
Publisher Version
Download
Files
Jun 7, 2024
Main Article
License
CC BY 4.0
Meta
Record Statistics
Record Views
411
Version History
[v1] (Original Submission)
Jun 7, 2024
 
Verified by curator on
Jun 7, 2024
This Version Number
v1
Citations
Most Recent
This Version
URL Here
http://psecommunity.org/LAPSE:2024.1037
 
Record Owner
Auto Uploader for LAPSE
Links to Related Works
Directly Related to This Work
Publisher Version