LAPSE:2025.0457v1
Published Article

LAPSE:2025.0457v1
Hybrid model development for Succinic Acid fermentation: relevance of ensemble learning for enhancing model prediction
June 27, 2025
Abstract
Sustainable development goals have spurred advancements in bioprocess design, driven by improved process monitoring, data storage, and computational power. High-fidelity models are essential for advanced process system engineering, yet accurate parametric models for bioprocessing remain challenging due to overparameterization, often resulting in poor predictive accuracy. Hybrid modeling, combining parametric and non-parametric methods, offers a promising solution by enhancing accuracy while maintaining interpretability. This study explores hybrid models for succinic acid fermentation by Escherichia coli, a critical process for sustainable bio-based chemical production. The research presents a structured exploration of hybrid model architectures and their robustness under varying conditions. Experimental data were preprocessed to remove noise and outliers, and hybrid model structures were developed with differing levels of hybridization (from one to all reaction rates). Kinetic parameters were recalibrated and compared against original values. Machine learning algorithms, including Artificial Neural Networks, Support Vector Machines, and Gaussian Processes, were tested, with tuning strategies applied to original or recalibrated parameters. Due to a considerable variability in individual model performance for validation, an ensemble learning approach was proposed to enhance robustness. Results demonstrate that despite not solving the overparametrization issues, all hybrid models outperform the original parametric model, with the best-performing hybrid model achieving a 52.3% lower RMSE of validation, avoiding overfitting. Ensemble approaches further improved predictions, reducing RMSE by up to 62.3% compared to individual parametric models. This highlights hybrid modelings potential to enhance bioprocess prediction accuracy, even with limited data, supporting future advancements in bioprocess scale-up, digitalization, and sustainable biorefinery implementations.
Sustainable development goals have spurred advancements in bioprocess design, driven by improved process monitoring, data storage, and computational power. High-fidelity models are essential for advanced process system engineering, yet accurate parametric models for bioprocessing remain challenging due to overparameterization, often resulting in poor predictive accuracy. Hybrid modeling, combining parametric and non-parametric methods, offers a promising solution by enhancing accuracy while maintaining interpretability. This study explores hybrid models for succinic acid fermentation by Escherichia coli, a critical process for sustainable bio-based chemical production. The research presents a structured exploration of hybrid model architectures and their robustness under varying conditions. Experimental data were preprocessed to remove noise and outliers, and hybrid model structures were developed with differing levels of hybridization (from one to all reaction rates). Kinetic parameters were recalibrated and compared against original values. Machine learning algorithms, including Artificial Neural Networks, Support Vector Machines, and Gaussian Processes, were tested, with tuning strategies applied to original or recalibrated parameters. Due to a considerable variability in individual model performance for validation, an ensemble learning approach was proposed to enhance robustness. Results demonstrate that despite not solving the overparametrization issues, all hybrid models outperform the original parametric model, with the best-performing hybrid model achieving a 52.3% lower RMSE of validation, avoiding overfitting. Ensemble approaches further improved predictions, reducing RMSE by up to 62.3% compared to individual parametric models. This highlights hybrid modelings potential to enhance bioprocess prediction accuracy, even with limited data, supporting future advancements in bioprocess scale-up, digitalization, and sustainable biorefinery implementations.
Record ID
Keywords
Fermentation, Hybrid modelling, Machine Learning, Modelling, Modelling and Simulations, Reaction Engineering, Succinic Acid Kinetics
Subject
Suggested Citation
Herrera-Ruiz JF, Fontalvo J, Prado-Rubio OA. Hybrid model development for Succinic Acid fermentation: relevance of ensemble learning for enhancing model prediction. Systems and Control Transactions 4:1896-1901 (2025) https://doi.org/10.69997/sct.153338
Author Affiliations
Herrera-Ruiz JF: Departamento de Ingeniería Química, Universidad Nacional de Colombia 170003 Manizales, Colombia
Fontalvo J: Departamento de Ingeniería Química, Universidad Nacional de Colombia 170003 Manizales, Colombia
Prado-Rubio OA: Departamento de Ingeniería Química, Universidad Nacional de Colombia 170003 Manizales, Colombia
Fontalvo J: Departamento de Ingeniería Química, Universidad Nacional de Colombia 170003 Manizales, Colombia
Prado-Rubio OA: Departamento de Ingeniería Química, Universidad Nacional de Colombia 170003 Manizales, Colombia
Journal Name
Systems and Control Transactions
Volume
4
First Page
1896
Last Page
1901
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 1896-1901-1679-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0457v1
This Record
External Link

https://doi.org/10.69997/sct.153338
Article DOI
Download
Meta
Record Statistics
Record Views
740
Version History
[v1] (Original Submission)
Jun 27, 2025
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
http://psecommunity.org/LAPSE:2025.0457v1
Record Owner
PSE Press
Links to Related Works
References Cited
- Dai Z, Guo F, Zhang S, Zhang W, Yang Q, Dong W, et al. Bio-based succinic acid: an overview of strain development, substrate utilization, and downstream purification. Biofuels, Bioproducts and Biorefining 2020;14:965-85. https://doi.org/10.1002/bbb.2063
- de Azevedo CR, Díaz VG, Prado-Rubio OA, Willis MJ, Préat V, Oliveira R, et al. Hybrid Semiparametric Modeling: A Modular Process Systems Engineering Approach for the Integration of Available Knowledge Sources. Systems Engineering in the Fourth Industrial Revolution, Wiley; 2019, p. 345-73. https://doi.org/10.1002/9781119513957.ch14
- Leonov P. Bio-succinic acid production from alternative feedstock. Denmark Technical University, 2022
- F. Vigato, J.M. Woodley, M. Alvarado-Morales, Modeling the effect of CO2 limitation in continuous fermentation for biosuccinic acid production, Journal of CO2 Utilization 79 (2024) 102651. https://doi.org/10.1016/j.jcou.2023.102651
- P. Shah, M.Z. Sheriff, M.S.F. Bangi, C. Kravaris, J.S. Il Kwon, C. Botre, J. Hirota, Deep neural network-based hybrid modeling and experimental validation for an industry-scale fermentation process: Identification of time-varying dependencies among parameters, Chemical Engineering Journal 441 (2022). https://doi.org/10.1016/j.cej.2022.135643
- Chaleewong T, Khunnonkwao P, Puchongkawarin C, Jantama K. Kinetic modeling of succinate production from glucose and xylose by metabolically engineered Escherichia coli KJ12201. Biochem Eng J 2022;185:108487. https://doi.org/10.1016/j.bej.2022.108487
- Sánchez-Rendón JC, Morales-Rodriguez R, Matallana-Pérez LG, Prado-Rubio OA. Assessing Parameter Relative Importance in Bioprocesses Mathematical Models through Dynamic Sensitivity Analysis, 2020, p. 1711-6. https://doi.org/10.1016/B978-0-12-823377-1.50286-X
- Levenspiel O. The monod equation: A revisit and a generalization to product inhibition situations. Biotechnol Bioeng 1980;22:1671-87. https://doi.org/10.1002/bit.260220810

