LAPSE:2025.0396
Published Article

LAPSE:2025.0396
Knowledge Discovery in Large-Scale Batch Processes through Explainable Boosted Models and Uncertainty Quantification: Application to Rubber Mixing
June 27, 2025
Abstract
Rubber mixing (RM) is a vital batch process producing high-quality composites, which serve as input material for manufacturing different types of final products, such as tires. Due to its complexity, this process faces two main challenges regarding the final quality: i) lack of online measurement and ii) limited comprehension of the influence of the different factors involved in the process. While data-driven and machine learning (ML) based soft-sensing methods have been widely applied to address the first challenge, the second challenge, to the best of the author's knowledge, has not yet been addressed in the rubber industry. This work presents a data-driven method for extracting knowledge and providing explainability in the quality prediction in RM processes. The method centers on an XGBoost model while leveraging high-dimensional data collected over extended time periods from one of Michelins complex mixing processes. First, a recursive feature elimination-based procedure is used for selecting relevant features, which reduces the number of input features used for building the ML model by 82% while improving its predictive performance by 17%. Secondly, SHapley Additive exPlanations (SHAP) techniques are employed to explain the ML models predictions through global and local analyses of feature interactions. The selected quality-related variables can be leveraged to improve process control and supervision. Finally, an uncertainty quantification (UQ) module, based on Split Conformal Prediction (SCP), is combined with the ML model, providing confidence intervals with 90% coverage and empirically verified theoretical guarantees. This module ensures prediction reliability and robustness in real applications.
Rubber mixing (RM) is a vital batch process producing high-quality composites, which serve as input material for manufacturing different types of final products, such as tires. Due to its complexity, this process faces two main challenges regarding the final quality: i) lack of online measurement and ii) limited comprehension of the influence of the different factors involved in the process. While data-driven and machine learning (ML) based soft-sensing methods have been widely applied to address the first challenge, the second challenge, to the best of the author's knowledge, has not yet been addressed in the rubber industry. This work presents a data-driven method for extracting knowledge and providing explainability in the quality prediction in RM processes. The method centers on an XGBoost model while leveraging high-dimensional data collected over extended time periods from one of Michelins complex mixing processes. First, a recursive feature elimination-based procedure is used for selecting relevant features, which reduces the number of input features used for building the ML model by 82% while improving its predictive performance by 17%. Secondly, SHapley Additive exPlanations (SHAP) techniques are employed to explain the ML models predictions through global and local analyses of feature interactions. The selected quality-related variables can be leveraged to improve process control and supervision. Finally, an uncertainty quantification (UQ) module, based on Split Conformal Prediction (SCP), is combined with the ML model, providing confidence intervals with 90% coverage and empirically verified theoretical guarantees. This module ensures prediction reliability and robustness in real applications.
Record ID
Keywords
explainable machine learning, quality monitoring, rubber mixing, uncertainty quantification
Suggested Citation
Berthier L, Shokry A, Moulines E, Ramelet G, Desroziers S. Knowledge Discovery in Large-Scale Batch Processes through Explainable Boosted Models and Uncertainty Quantification: Application to Rubber Mixing. Systems and Control Transactions 4:1518-1523 (2025) https://doi.org/10.69997/sct.183525
Author Affiliations
Berthier L: Centre de Mathématiques Appliquées (CMAP), Ecole Polytechnique, Institut Polytechnique de Paris, 91120 Palaiseau, France; Explore Team I4.0, Manufacture Française des Pneumatiques Michelin, 63100 Clermont-Ferrand, France
Shokry A: Centre de Mathématiques Appliquées (CMAP), Ecole Polytechnique, Institut Polytechnique de Paris, 91120 Palaiseau, France
Moulines E: Centre de Mathématiques Appliquées (CMAP), Ecole Polytechnique, Institut Polytechnique de Paris, 91120 Palaiseau, France
Ramelet G: Explore Team I4.0, Manufacture Française des Pneumatiques Michelin, 63100 Clermont-Ferrand, France
Desroziers S: Explore Team I4.0, Manufacture Française des Pneumatiques Michelin, 63100 Clermont-Ferrand, France
Shokry A: Centre de Mathématiques Appliquées (CMAP), Ecole Polytechnique, Institut Polytechnique de Paris, 91120 Palaiseau, France
Moulines E: Centre de Mathématiques Appliquées (CMAP), Ecole Polytechnique, Institut Polytechnique de Paris, 91120 Palaiseau, France
Ramelet G: Explore Team I4.0, Manufacture Française des Pneumatiques Michelin, 63100 Clermont-Ferrand, France
Desroziers S: Explore Team I4.0, Manufacture Française des Pneumatiques Michelin, 63100 Clermont-Ferrand, France
Journal Name
Systems and Control Transactions
Volume
4
First Page
1518
Last Page
1523
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 1518-1523-1510-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0396
This Record
External Link

https://doi.org/10.69997/sct.183525
Article DOI
Download
Meta
Record Statistics
Record Views
1214
Version History
[v1] (Original Submission)
Jun 27, 2025
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0396
Record Owner
PSE Press
Links to Related Works
References Cited
- Diancai Yang, Yi Liu, Yugang Fan, and Haiqing Wang. Online prediction of mooney viscosity in industrial rubber mixing process via adaptive kernel learning method. 48th IEEE CDC, pages 404 - 409, 01 (2010) https://doi.org/10.1109/CDC.2009.5400936
- Chunying Zhang, Sun Chen, Fang Wu, and Kai Song. A novel q-based online model updating strategy and its application in statistical process control for rubber mixing. Chin. J. Chem. Eng., 23(5):796-803 (2015) https://doi.org/10.1016/j.cjche.2014.11.029
- Weiya Jin, Yi Liu, and Zengliang Gao. Fast property prediction in an industrial rubber mixing process with local elm model. J. App. Poly. Sc., 134:45391, 06 (2017) https://doi.org/10.1002/app.45391
- Shuihua Zheng, Kaixin Liu, Yili Xu, Hao Chen, Xuelei Zhang, and Yi Liu. Robust soft sensor with deep kernel learning for quality prediction in rubber mixing processes. Sensors, 20(3) (2020) https://doi.org/10.3390/s20030695
- Aysun Urhan and Burak Alakent. Integrating adaptive moving window and just-in-time learning paradigms for soft-sensor design. Neurocomputing, 392:23-37 (2020) https://doi.org/10.1016/j.neucom.2020.01.083
- International Organization for Standardization, ISO 6502-1. Rubber - Measurement of vulcanization characteristics using curemeters - Part 1: Introduction (2018)
- Tianqi Chen and Carlos Guestrin. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 785-794 (2016) https://doi.org/10.1145/2939672.2939785
- Isabelle Guyon and Andr'e Elisseeff. An introduction to variable and feature selection. J. Mach. Learn. Res., 3(null):1157-1182, 3 (2003)
- Scott M. Lundberg and Su-In Lee. A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS'17, pages 4768-4777 (2017)
- Anastasios N. Angelopoulos and Stephen Bates. A gentle introduction to conformal prediction and distribution-free uncertainty quantification. CoRR, abs/2107.07511 (2021)
(0.08 seconds)
[0.09 s]

