PSE Community.org

ISSN: 2818-4734
Volume: 5 (2026)
Table of Contents

LAPSE:2026.0393

Published Article

LAPSE:2026.0393

A Multimodal Framework Integrating Procedural Texts and Visual Perception for Laboratory Safety Monitoring

Shuo Xu, Jinsong Zhao

June 12, 2026

Abstract
Laboratory safety is procedure-dependent: required personal protective equipment (PPE) and permissible actions vary across experiments and across experimental steps, yet most vision-based monitoring remains appearance-driven and often produces generic warnings without reliable procedural context. We propose a multimodal framework for step-aware safety monitoring in laboratory videos. The framework first localizes procedural context through clip-level step prediction and protocol alignment to identify the experiment and current step. Given this context, it retrieves step-specific safety constraints, extracts evidence of step-relevant equipment and interactions using an equipment database, and prompts a video-capable vision-language model (VLM) to generate structured (JSON) monitoring reports supported by retrieved constraints and visual evidence. Experiments on protocol-annotated molecular biology lab videos show that our approach improves the mean score from 0.4352 to 0.6430 and reduces the missing rate from 65.00% to 33.75% relative to a video-only baseline, demonstrating more faithful and step-specific safety judgments.

Record ID

LAPSE:2026.0393

Keywords

Artificial Intelligence, Laboratory Safety Monitoring, Vision-Language Model

Subject

Modelling and Simulations

Suggested Citation

Xu S, Zhao J. A Multimodal Framework Integrating Procedural Texts and Visual Perception for Laboratory Safety Monitoring. Systems and Control Transactions 5:1503-1512 (2026) https://doi.org/10.69997/sct.104078

Author Affiliations

Xu S: Tsinghua University, Department of Chemical Engineering, Beijing, China. State Key Laboratory of Chemical Engineering and Low-Carbon Technology, Tsinghua University
Zhao J: Tsinghua University, Department of Chemical Engineering, Beijing, China. State Key Laboratory of Chemical Engineering and Low-Carbon Technology, Tsinghua University
[Login] to see author email addresses.

Journal Name

Systems and Control Transactions

Volume

First Page

1503

Last Page

1512

Year

2026

Publication Date

2026-06-12

DOI: