Proceedings of ESCAPE 35ISSN: 2818-4734
Volume: 4 (2025)
Table of Contents
LAPSE:2025.0451
Published Article
LAPSE:2025.0451
Data-Driven Soft Sensors for Process Industries: Case Study on a Delayed Coker Unit
Wei Sun, James G. Brigman, Cheng Ji, Pratap Nair, Fangyuan Ma, Jingde Wang
June 27, 2025
Abstract
This research addresses the challenges associated with data-driven soft sensors in industrial applications, where successful implementations remain limited. The scarcity of practical applications can be attributed to variable operating conditions and frequent disturbances in real-time processes. Industrial data are often nonlinear, dynamic, and highly unbalanced, complicating efforts to capture the essential characteristics of underlying processes. To tackle these issues, we propose a comprehensive solution for industrial application, that encompasses feature selection, feature extraction, and model updating. Feature selection aims to pinpoint the independent variables that have a substantial impact on key performance indicators, including quality, safety, efficiency, reliability, and sustainability. By doing so, it simplifies the model and boosts its predictive accuracy. The process begins with screening variables based on process knowledge, followed by a thorough analysis of correlation and redundancy to eliminate redundant information, which can burden computational resources and degrade prediction accuracy. We propose a mutual information-based algorithm for feature selection that assesses the relevance and redundancy among process variables through a comprehensive correlation function. This algorithm ranks variables by their importance using a Greedy search method to identify the optimal set of variables. After selecting the optimal variables, feature extraction is carried out to derive internal features from this set and establish a relationship between these latent features and the output variables. Given the intricate nature of industrial processes, we employ deep learning techniques, specifically Long Short-Term Memory (LSTM) networks, which are a type of Recurrent Neural Network (RNN) well-suited for capturing long-term dependencies in sequential data. LSTMs excel at modeling temporal correlations due to their ability to maintain memory states that allow for learning from sequential data over extended periods. To address short-term nonstationary features resulting from process disturbances, we incorporate a differential unit into the latent layer of the LSTM network. Once trained, the model is updated during online applications to incorporate gradual changes in equipment and reaction agents. Quality-related data, although typically available only post-measurement, can be leveraged to fine-tune model parameters, ensuring sustained predictive accuracy over time. To validate our approach, we present a case study on a delayed coker unit, yielding promising long-term predictions for tube metal temperature and showcasing the potential of our methodology for industrial applications.
Keywords
feature extraction, feature selection, quality prediction
Suggested Citation
Sun W, Brigman JG, Ji C, Nair P, Ma F, Wang J. Data-Driven Soft Sensors for Process Industries: Case Study on a Delayed Coker Unit. Systems and Control Transactions 4:1860-1865 (2025) https://doi.org/10.69997/sct.185205
Author Affiliations
Sun W: College of Chemical Engineering, Beijing University of Chemical Technology, 100029, North Third Ring Road 15, Chaoyang District, Beijing, China
Brigman JG: Ingenero Inc. 4615 Southwest Freeway, Suite 320, Houston TX 77027, USA
Ji C: College of Chemical Engineering, Beijing University of Chemical Technology, 100029, North Third Ring Road 15, Chaoyang District, Beijing, China
Nair P: Ingenero Inc. 4615 Southwest Freeway, Suite 320, Houston TX 77027, USA
Ma F: College of Chemical Engineering, Beijing University of Chemical Technology, 100029, North Third Ring Road 15, Chaoyang District, Beijing, China
Wang J: College of Chemical Engineering, Beijing University of Chemical Technology, 100029, North Third Ring Road 15, Chaoyang District, Beijing, China
Journal Name
Systems and Control Transactions
Volume
4
First Page
1860
Last Page
1865
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 1860-1865-1574-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0451
This Record
External Link

https://doi.org/10.69997/sct.185205
Article DOI
Download
Files
Jun 27, 2025
Main Article
License
CC BY-SA 4.0
Meta
Record Statistics
Record Views
1112
Version History
[v1] (Original Submission)
Jun 27, 2025
 
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0451
 
Record Owner
PSE Press
Links to Related Works
Directly Related to This Work
Article DOI
References Cited
  1. Kadlec P, Gabrys B, Strandt S. Data-driven soft sensors in the process industry[J]. Computers & Chemical Engineering, 2009, 33: 795-814 https://doi.org/10.1016/j.compchemeng.2008.12.012
  2. Souza F A A, Araújo R, Mendes J. Review of soft sensor methods for regression applications[J]. Chemometrics and Intelligent Laboratory Systems, 2016, 152: 69-79 https://doi.org/10.1016/j.chemolab.2015.12.011
  3. Wang Z X, He Q P, Wang J. Comparison of variable selection methods for PLS-based soft sensor modeling[J]. Journal of Process Control, 2015, 26: 56-72 https://doi.org/10.1016/j.jprocont.2015.01.003
  4. Ji C, Sun W. A review on data-driven process monitoring methods: Characterization and mining of industrial data[J]. Processes, 2022, 10(2): 335 https://doi.org/10.3390/pr10020335
  5. Pearson K. VII. Mathematical contributions to the theory of evolution. - III. Regression, heredity, and panmixia[J]. Philosophical Transactions of the Royal Society A, 1896(187): 253-318 https://doi.org/10.1098/rsta.1896.0007
  6. Sun K, Huang S H, Wong D S H, et al. Design and application of a variable selection method for multilayer perceptron neural network with LASSO[J]. IEEE Transactions on Neural Networks and Learning Systems, 2017, 28(6): 1386-1396 https://doi.org/10.1109/TNNLS.2016.2542866
  7. Vergara J R, Estévez P A. A review of feature selection methods based on mutual information[J]. Neural Computing & Applications, 2014, 24: 175-186 https://doi.org/10.1007/s00521-013-1368-0
  8. Ji C, Ma F, Wang J, et al. Profitability related industrial-scale batch processes monitoring via deep learning based soft sensor development[J]. Computers & Chemical Engineering, 2023, 170: 108125 https://doi.org/10.1016/j.compchemeng.2022.108125
  9. Estevez P A, Tesmer M, Perez C A, et al. Normalized Mutual Information Feature Selection[J]. IEEE Transactions on Neural Networks and Learning Systems, 2009, 20(2): 189-201 https://doi.org/10.1109/TNN.2008.2005601
  10. Reshef D N, Reshef Y A, Finucane H K, et al. Detecting novel associations in large data sets[J]. Science, 2011, 334(6062): 1518-1524 https://doi.org/10.1126/science.1205438
  11. Freeman C, Kulic D, Basir O. An evaluation of classifier-specific filter measure performance for feature selection[J]. Pattern Recognition, 2015, 48(5): 1812-1826 https://doi.org/10.1016/j.patcog.2014.11.010
  12. Xiao L, Wang C, Dong Y, et al. A novel sub-models selection algorithm based on max-relevance and min-redundancy neighborhood mutual information[J]. Information Sciences, 2019, 486: 310-339 https://doi.org/10.1016/j.ins.2019.01.075
  13. Yin K, Zhai J, Xie A, et al. Feature selection using max dynamic relevancy and min redundancy[J]. Pattern Analysis and Applications, 2023, 26(2): 631-643 https://doi.org/10.1007/s10044-023-01138-y
  14. Bennasar M, Hicks Y, Setchi R. Feature selection using joint mutual information maximisation[J]. Expert Systems with Applications, 2015, 42(22): 8520-8532 https://doi.org/10.1016/j.eswa.2015.07.007
  15. Ji C, Ma F, Wan J, et al. A Conditional Entropy Based Feature Selection for Soft Sensor Development in Chemical Processes[J]. Chemical Engineering Transactions, 2023, 103: 61-66
  16. Che J, Yang Y, Li L, et al. Maximum relevance minimum common redundancy feature selection for nonlinear data[J]. Information Sciences, 2017, 409: 68-86 https://doi.org/10.1016/j.ins.2017.05.013
  17. Shannon C E. A mathematical theory of communication[J]. Bell Labs Technical Journal, 1948, 27(3): 379-423 https://doi.org/10.1002/j.1538-7305.1948.tb01338.x
  18. Kraskov A, Stögbauer H, Grassberger P. Estimating mutual information[J]. Physical Review E-Statistical, Nonlinear, and Soft Matter Physics, 2004, 69(6): 066138 https://doi.org/10.1103/PhysRevE.69.066138
  19. Moon Y I, Rajagopalan B, Lall U. Estimation of mutual information using kernel density estimators[J]. Physical Review E, 1995, 52(3): 2318 https://doi.org/10.1103/PhysRevE.52.2318
(0.08 seconds)

[0.09 s]