Proceedings of ESCAPE 35ISSN: 2818-4734
Volume: 4 (2025)
Table of Contents
LAPSE:2025.0318v1
Published Article
LAPSE:2025.0318v1
Accelerating Solvent Design Optimisation with Group-Contribution Machine Learning Surrogate Classifiers
Lifeng Zhang, Benoît Chachuat, Claire S. Adjiman
June 27, 2025
Abstract
Asserting the phase stability of multi-component mixtures is an important task in computer-aided mixture/blend design (CAMbD), but it is often hindered by the lack of reliable and tractable models. In this paper, we propose a group-contribution machine-learning (GC-ML) method to predict phase coexistence for a large set of ternary mixtures consisting of two solvents and one (fixed) solute. Each solvent is represented by a vector of functional group numbers, encoded by integer values. The solvent vectors are combined with mixture composition and temperature to form the input features to a GC-ML surrogate classifier, which distinguishes between four types of stable phase configurations as possible outputs: liquid (L), solid-liquid (SL), liquid-liquid (LL) or solid-liquid-liquid (SLL). To explore the performance of the trained GC-ML multi-classifier, it is embedded as a surrogate phase-stability constraint in the optimisation of an ibuprofen crystallisation process. A two-step solution strategy is proposed, iterating between a surrogate-based subproblem and a rigorous UNIFAC-based subproblem, to design binary solvent mixtures that improve the yield of ibuprofen. A high classification accuracy score of over 0.96 is achieved in identifying the correct phase configurations with the surrogate model, making it possible to accurately predict whole phase diagrams for numerous mixtures. Moreover, reliable solutions to the crystallisation design problem are generated with the two-step strategy, yielding objective function values close to those with the UNIFAC-based model at a lower computational cost. These results reveal the value of the proposed surrogate model in guiding the search for better solvent mixtures in CAMbD applications.
Keywords
Group contribution, Machine Learning, Optimisation, Phase stability, Solvent design
Suggested Citation
Zhang L, Chachuat B, Adjiman CS. Accelerating Solvent Design Optimisation with Group-Contribution Machine Learning Surrogate Classifiers. Systems and Control Transactions 4:1035-1040 (2025) https://doi.org/10.69997/sct.166568
Author Affiliations
Zhang L: Department of Chemical Engineering, The Sargent Centre for Process Systems Engineering, Imperial College London, London SW7 2AZ, United Kingdom
Chachuat B: Department of Chemical Engineering, The Sargent Centre for Process Systems Engineering, Imperial College London, London SW7 2AZ, United Kingdom
Adjiman CS: Department of Chemical Engineering, The Sargent Centre for Process Systems Engineering, Imperial College London, London SW7 2AZ, United Kingdom
Journal Name
Systems and Control Transactions
Volume
4
First Page
1035
Last Page
1040
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 1035-1040-1274-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0318v1
This Record
External Link

https://doi.org/10.69997/sct.166568
Article DOI
Download
Files
Jun 27, 2025
Main Article
License
CC BY-SA 4.0
Meta
Record Statistics
Record Views
1345
Version History
[v1] (Original Submission)
Jun 27, 2025
 
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0318v1
 
Record Owner
PSE Press
Links to Related Works
Directly Related to This Work
Article DOI
References Cited
  1. Jonuzaj, S. et al. The formulation of optimal mixtures with generalized disjunctive programming: A solvent design case study. AIChE J. 62, 1616-1633 (2016) https://doi.org/10.1002/aic.15122
  2. Gani, R. Computer-Aided Methods and Tools for Chemical Product Design. Chem. Eng. Res. Des. 82, 1494-1504 (2004) https://doi.org/10.1205/cerd.82.11.1494.52032
  3. Baker, L. E., Pierce, A. C. & Luks, K. D. Gibbs Energy Analysis of Phase Equilibria. Soc. Pet. Eng. J. 22, 731-742 (1982) https://doi.org/10.2118/9806-PA
  4. Mitsos, A. & Barton, P. I. A dual extremum principle in thermodynamics. AIChE J. 53, 2131-2147 (2007) https://doi.org/10.1002/aic.11230
  5. McDonald, C. M. & Floudas, C. A. Global optimization for the phase stability problem. AIChE J. 41, 1798-1814 (1995) https://doi.org/10.1002/aic.690410715
  6. Karunanithi, A. T., Achenie, L. E. K. & Gani, R. A computer-aided molecular design framework for crystallization solvent design. Chem. Eng. Sci. 61, 1247-1260 (2006) https://doi.org/10.1016/j.ces.2005.08.031
  7. Schweidtmann, A. M. & Mitsos, A. Deterministic Global Optimization with Artificial Neural Networks Embedded. J. Optim. Theory Appl. 180, 925-948 (2019) https://doi.org/10.1007/s10957-018-1396-0
  8. Schmitz, J. E., Zemp, R. J. & Mendes, M. J. Artificial neural networks for the solution of the phase stability problem. Fluid Phase Equilibria 245, 83-87 (2006) https://doi.org/10.1016/j.fluid.2006.02.013
  9. Lopez-Ramirez, E. et al. Artificial Neural Networks (ANNs) for Vapour-Liquid-Liquid Equilibrium (VLLE) Predictions in N-Octane/Water Blends. Processes 11, 2026 (2023) https://doi.org/10.3390/pr11072026
  10. Gaganis, V. & Varotsis, N. Non-iterative phase stability calculations for process simulation using discriminating functions. Fluid Phase Equilibria 314, 69-77 (2012) https://doi.org/10.1016/j.fluid.2011.10.021
  11. Karia, T. et al. Classifier surrogates to ensure phase stability in optimisation-based design of solvent mixtures. Digit. Chem. Eng. 100200 (2024) https://doi.org/10.2139/ssrn.4898054
  12. Zhang, L. et al. Comparative study of classifier models to assert phase stability in multicomponent mixtures. in Computer Aided Chemical Engineering (eds. Manenti, F. & Reklaitis, G. V.) vol. 53 1465-1470 (Elsevier, 2024) https://doi.org/10.1016/B978-0-443-28824-1.50245-3
  13. Published Parameters UNIFAC - DDBST GmbH. https://www.ddbst.com/published-parameters-unifac.html
  14. Mann, V., Gani, R. & Venkatasubramanian, V. Group contribution-based property modeling for chemical product design: A perspective in the AI era. Fluid Phase Equilibria 568, 113734 (2023) https://doi.org/10.1016/j.fluid.2023.113734
  15. Paduszynski, K. & Domanska, U. Viscosity of Ionic Liquids: An Extensive Database and a New Group Contribution Model Based on a Feed-Forward Artificial Neural Network. J. Chem. Inf. Model. 54, 1311-1324 (2014) https://doi.org/10.1021/ci500206u
  16. Prat, D. et al. CHEM21 selection guide of classical- and less classical-solvents. Green Chem. 18, 288-296 (2016) https://doi.org/10.1039/C5GC01008J
  17. Walker, P. J., Riedemann, A. & Wang, Z.-G. GCIdentifier.jl: A Julia package for identifyingmolecular fragments from SMILES. J. Open Source Softw. 9, 6453 (2024) https://doi.org/10.21105/joss.06453
  18. Haslam, A. J. et al. Expanding the Applications of the SAFT-? Mie Group-Contribution Equation of State: Prediction of Thermodynamic Properties and Phase Behavior of Mixtures. J. Chem. Eng. Data 65, 5862-5890 (2020) https://doi.org/10.1021/acs.jced.0c00746
  19. Sobol', I. M. On the distribution of points in a cube and the approximate evaluation of integrals. USSR Comput. Math. Math. Phys. 7, 86-112 (1967) https://doi.org/10.1016/0041-5553(67)90144-9
  20. Chaparro, G. & Mejía, A. Phasepy: A Python based framework for fluid phase equilibria and interfacial properties computation. J. Comput. Chem. 41, 2504-2526 (2020) https://doi.org/10.1002/jcc.26405
  21. Paszke, A. et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library. Preprint at http://arxiv.org/abs/1912.01703 (2019)
  22. Chicco, D. & Jurman, G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics 21, 6 (2020) https://doi.org/10.1186/s12864-019-6413-7
  23. Ceccon, F. et al. OMLT: Optimization & Machine Learning Toolkit
  24. Bynum, M. L. et al. Pyomo - Optimization Modeling in Python. vol. 67 (Springer International Publishing, Cham, 2021) https://doi.org/10.1007/978-3-030-68928-5_5
  25. Watson, O. L. et al. Computer Aided Design of Solvent Blends for Hybrid Cooling and Antisolvent Crystallization of Active Pharmaceutical Ingredients. Org. Process Res. Dev. 25, 1123-1142 (2021) https://doi.org/10.1021/acs.oprd.0c00516
  26. SBB. https://www.gams.com.html
  27. Drud, A. CONOPT: A GRG code for large sparse dynamic nonlinear optimization problems. Math. Program. 31, 153-191 (1985) https://doi.org/10.1007/BF02591747
(0.1 seconds)

[0.1 s]