PSE Community.org

ISSN: 2818-4734
Volume: 5 (2026)
Table of Contents

LAPSE:2026.0315

Published Article

LAPSE:2026.0315

Chemical Language Transformers for the Inverse Design of Novel Surfactants

Alexander W. Rogers, Ruediger Zillmer, Amanda Lane, Adam Kowalski, Dongda Zhang

June 12, 2026

Abstract
Rapid, sustainable redesign of large functional molecules demands efficient exploration of vast chemical spaces. Chemical language models (CLMs), especially transformers, can learn long-range structure-property relationships and enable fast candidate generation after training. However, inverse molecular design is ill-posed - many structures can meet the same target - and conditioned generation often decodes to invalid or off-spec molecules. To address this challenge, we propose a CLM-based inverse design framework that optimises latent representations toward target properties and explicitly evaluates round-trip fidelity, i.e., whether decoded candidates remain on-target after decoding and forward re-evaluation. To improve reliability, we introduce post-decoding beam re-ranking using round-trip consistency and a predictor-guided minimal-edit repair step that corrects invalid near-misses while preserving closeness to the target property. We demonstrate the approach on surfactant critical micelle concentration (CMC) design, benchmarking existing large pretrained CLMs against our lightweight domain-trained CLM. The framework produces a high proportion of valid and diverse molecules (~90%) while maintaining target property error near 1%. Moreover, atom-level saliency analysis confirms that the generated structures follow established surfactant design rules, supporting interpretable structure-property control. Overall, the framework provides an efficient and broadly applicable solution to reliable inverse design of novel functional molecules.

Record ID

LAPSE:2026.0315

Keywords

chemical language models, interpretable AI, Inverse molecular design, surfactants, transformers

Subject

Modelling and Simulations

Suggested Citation

Rogers AW, Zillmer R, Lane A, Kowalski A, Zhang D. Chemical Language Transformers for the Inverse Design of Novel Surfactants. Systems and Control Transactions 5:903-909 (2026) https://doi.org/10.69997/sct.161720

Author Affiliations

Rogers AW: The University of Manchester, Department of Chemical Engineering, Manchester, UK [ORCID]
Zillmer R: Unilever, R&D Port Sunlight, Liverpool, UK
Lane A: Unilever, R&D Port Sunlight, Liverpool, UK
Kowalski A: Unilever, R&D Port Sunlight, Liverpool, UK
Zhang D: The University of Manchester, Department of Chemical Engineering, Manchester, UK. Unilever, R&D Port Sunlight, Liverpool, UK [ORCID]
[Login] to see author email addresses.

Journal Name

Systems and Control Transactions

Volume

First Page

903

Last Page

909

Year

2026

Publication Date

2026-06-12

DOI: