LAPSE:2020.0567
Published Article
LAPSE:2020.0567
Semi-Supervised Ensemble Classification Method Based on Near Neighbor and Its Application
Chuang Li, Yongfang Xie, Xiaofang Chen
June 10, 2020
Semi-supervised learning can be used to solve the problem of insufficient labeled samples in the process industry. However, in an actual scenario, traditional semi-supervised learning methods usually do not achieve satisfactory performance when the small number of labeled samples is subjective and inaccurate and some do not consider how to develop a strategy to expand the training set. In this paper, a new algorithm is proposed to alleviate the above two problems, and consequently, the information contained in unlabeled samples can be fully mined. First, the multivariate adaptive regression splines (MARS) and adaptive boosting (Adaboost) algorithms are adopted for co-training to make the most of the deep connection between samples and features. In addition, the strategies, pseudo-labeled dataset selection algorithm based on near neighbor degree (DSSA) and pseudo-labeled sample detection algorithm based on near neighbor degree selection (SPDA) are adopted to enlarge the dataset of labeled samples. When we select the samples from the pseudo-labeled data to join the training set, the confidence degree and the spatial relationship with labeled samples are considered, which are able to improve classifier accuracy. The results of tests on multiple University of California Irvine (UCI) datasets and an actual dataset in the aluminum electrolysis industry demonstrate the effectiveness of the proposed algorithm.
Keywords
adaboost, aluminum electrolysis, ensemble learning, multivariate adaptive regression splines, semi-supervised learning
Suggested Citation
Li C, Xie Y, Chen X. Semi-Supervised Ensemble Classification Method Based on Near Neighbor and Its Application. (2020). LAPSE:2020.0567
Author Affiliations
Li C: School of Automation, Central South University, Changsha 410083, China
Xie Y: School of Automation, Central South University, Changsha 410083, China
Chen X: School of Automation, Central South University, Changsha 410083, China
Journal Name
Processes
Volume
8
Issue
4
Article Number
E415
Year
2020
Publication Date
2020-04-01
Published Version
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr8040415, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2020.0567
This Record
External Link

doi:10.3390/pr8040415
Publisher Version
Download
Files
[Download 1v1.pdf] (1.8 MB)
Jun 10, 2020
Main Article
License
CC BY 4.0
Meta
Record Statistics
Record Views
464
Version History
[v1] (Original Submission)
Jun 10, 2020
 
Verified by curator on
Jun 10, 2020
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2020.0567
 
Original Submitter
Calvin Tsay
Links to Related Works
Directly Related to This Work
Publisher Version