Proceedings of ESCAPE 35ISSN: 2818-4734
Volume: 4 (2025)
Table of Contents
LAPSE:2025.0432
Published Article
LAPSE:2025.0432
Computational Assessment of Molecular Synthetic Accessibility using Economic Indicators
Friedrich Hastedt, Klaus Hellgardt, Sophia Yaliraki, Antonio del Rio Chanona, Dongda Zhang
June 27, 2025
Abstract
The rapid advancement of computational drug discovery has enabled the generation of vast virtual libraries of promising drug candidates. However, evaluating the synthetic accessibility (SA) of these compounds remains a critical bottleneck. While computer-aided synthesis planning (CASP) tools can provide synthesis routes to the candidate, their computational demands make them impractical for large-scale screening. Existing rapid SA scoring methods, struggle to generalize to out-of-distribution molecules and do not account for economic viability. To address these challenges, we present MolPrice, an accurate and reliable price prediction tool. By introducing a novel self-supervised learning approach, MolPrice achieves robust generalization to diverse molecular structures of various complexities. Our comprehensive analysis of model architectures and molecular representations reveals that substructure-based features strongly correlate with market prices, supporting the relationship between synthetic complexity and economic value. MolPrice performs well on the standard literature SA benchmark, showcasing its ability for SA estimation. MolPrice thus serves as both an accurate molecular price predictor and a rapid synthetic accessibility assessment tool, enhancing the efficiency of modern drug discovery pipelines.
Keywords
Machine Learning, Molecular Complexity, Retrosynthesis, Synthetic Accessibility, Virtual Screening
Suggested Citation
Hastedt F, Hellgardt K, Yaliraki S, Chanona ADR, Zhang D. Computational Assessment of Molecular Synthetic Accessibility using Economic Indicators. Systems and Control Transactions 4:1744-1749 (2025) https://doi.org/10.69997/sct.175859
Author Affiliations
Hastedt F: Imperial College London, Exhibition Rd, South Kensington, London SW7 2BX, United Kingdom
Hellgardt K: University of Manchester, Oxford Rd, Manchester M13 9PL, United Kingdom
Yaliraki S: Imperial College London, Exhibition Rd, South Kensington, London SW7 2BX, United Kingdom
Chanona ADR: Imperial College London, Exhibition Rd, South Kensington, London SW7 2BX, United Kingdom
Zhang D: University of Manchester, Oxford Rd, Manchester M13 9PL, United Kingdom
Journal Name
Systems and Control Transactions
Volume
4
First Page
1744
Last Page
1749
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 1744-1749-1295-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0432
This Record
External Link

https://doi.org/10.69997/sct.175859
Article DOI
Download
Files
Jun 27, 2025
Main Article
License
CC BY-SA 4.0
Meta
Record Statistics
Record Views
1152
Version History
[v1] (Original Submission)
Jun 27, 2025
 
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0432
 
Record Owner
PSE Press
Links to Related Works
Directly Related to This Work
Article DOI
References Cited
  1. Du Y, Jamasb AR, Guo J, et al. Machine learning-aided generative molecular design. Nat Mach Intell. 2024;6(6):589-604. https://doi.org/10.1038/s42256-024-00843-5
  2. Coley CW, Green WH, Jensen KF. Machine Learning in Computer-Aided Synthesis Planning. Acc Chem Res. 2018;51(5):1281-1289. https://doi.org/10.1021/acs.accounts.8b00087
  3. Liu CH, Korablyov M, Jastrzebski S, Wlodarczyk-Pruszynski P, Bengio Y, Segler M. RetroGNN: Fast Estimation of Synthesizability for Virtual Screening and De Novo Design by Learning from Slow Retrosynthesis Software. J Chem Inf Model. 2022;62(10):2293-2300. https://doi.org/10.1021/acs.jcim.1c01476
  4. Ertl P, Schuffenhauer A. Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Cheminformatics. 2009;1(1):8. https://doi.org/10.1186/1758-2946-1-8
  5. Coley CW, Rogers L, Green WH, Jensen KF. SCScore: Synthetic Complexity Learned from a Reaction Corpus. J Chem Inf Model. 2018;58(2):252-261. https://doi.org/10.1021/acs.jcim.7b00622
  6. Thakkar A, Chadimová V, Bjerrum EJ, Engkvist O, Reymond JL. Retrosynthetic accessibility score (RAscore) - rapid machine learned synthesizability classification from AI driven retrosynthetic planning. Chem Sci. 2021;12(9):3339-3349. https://doi.org/10.1039/D0SC05401A
  7. Sanchez-Garcia R, Havasi D, Takács G, et al. CoPriNet: graph neural networks provide accurate and rapid compound price prediction for molecule prioritisation. Digit Discov. 2023;2(1):103-111. https://doi.org/10.1039/D2DD00071G
  8. Landrum G. RDKit: Open-source chemoinformatics. https://www.rdkit.org/docs
  9. Yu J, Wang J, Zhao H, et al. Organic Compound Synthetic Accessibility Prediction Based on the Graph Attention Mechanism. J Chem Inf Model. 2022;62(12):2973-2986. https://doi.org/10.1021/acs.jcim.2c00038
  10. Probst D, Reymond JL. A probabilistic molecular fingerprint for big data settings. J Cheminformatics. 2018;10(1):66. https://doi.org/10.1186/s13321-018-0321-8
  11. Ramsundar B, Eastman, Walters P, Pande V, Leswing K, Wu Z. Deep Learning for the Life Sciences. O'Reilly Media
  12. Lu J, Xia S, Lu J, Zhang Y. Dataset Construction to Explore Chemical Space with 3D Geometry and Deep Learning. J Chem Inf Model. 2021;61(3):1095-1104. https://doi.org/10.1021/acs.jcim.1c00007
  13. Fromer JC, Graff DE, Coley CW. Pareto optimization to accelerate multi-objective virtual screening. Digit Discov. 2024;3(3):467-481. https://doi.org/10.1039/D3DD00227F
(0.1 seconds)

[0.1 s]