Proceedings of ESCAPE 35ISSN: 2818-4734
Volume: 4 (2025)
Table of Contents
LAPSE:2025.0170v1
Published Article
LAPSE:2025.0170v1
Diagnosing Faults in Wastewater Systems: A Data-Driven Approach to Handle Imbalanced Big Data
M. Zadkarami, K.V. Gernaey, A.A. Safavi, P. Ramin
June 27, 2025
Abstract
Process monitoring is essential in industrial settings to ensure system functionality, necessitating the identification and understanding of fault causes. While a substantial body of research focuses on fault detection, fault diagnosis has received significantly less attention. Typically, faults originate either from abnormal instrument behavior, indicating the need for calibration or replacement, or from process faults, signaling a malfunction within the system. A primary objective of this study is to apply the proposed fault diagnosis methodology to a benchmark that closely mirrors real-world conditions. Specifically, we introduce a fault diagnosis framework for a wastewater treatment plant (WWTP) that effectively addresses the challenges posed by imbalanced big data commonly encountered in large-scale systems. In our study, four distinct fault scenarios were investigated: fault-free conditions, process faults only, sensor faults only, and simultaneous sensor and process faults. To enhance our research, we explored various techniques and approaches to optimally address the fault diagnosis problem. Implementing an undersampling technique based on clustering and determining appropriate hyperparameters proved challenging, alongside deciding the most effective approach for undersampling. Additionally, advanced time-series models, including Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM), were employed, with particular emphasis on identifying the correct regularization factor and number of epochs being critical. When optimal model parameters are selected, each fault scenario achieves an accuracy of at least 90%, and the maximum simulation time does not exceed 15 minutes. Furthermore, we developed a Python-based GUI for this study to enable users to fully explore the capabilities of the proposed fault diagnosis algorithm.
Keywords
Artificial Intelligence, Big Data, Industry 40, Process Monitoring, Wastewater
Suggested Citation
Zadkarami M, Gernaey K, Safavi A, Ramin P. Diagnosing Faults in Wastewater Systems: A Data-Driven Approach to Handle Imbalanced Big Data. Systems and Control Transactions 4:123-128 (2025) https://doi.org/10.69997/sct.114532
Author Affiliations
Zadkarami M: School of Electrical and Computer Engineering, Shiraz University, Iran
Gernaey K: Process and Systems Engineering Center (PROSYS), Department of Chemical and Biochemical Engineering, Technical University of Denmark (DTU), Denmark
Safavi A: School of Electrical and Computer Engineering, Shiraz University, Iran
Ramin P: Process and Systems Engineering Center (PROSYS), Department of Chemical and Biochemical Engineering, Technical University of Denmark (DTU), Denmark
Journal Name
Systems and Control Transactions
Volume
4
First Page
123
Last Page
128
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 0123-0128-1289-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0170v1
This Record
External Link

https://doi.org/10.69997/sct.114532
Article DOI
Download
Files
Jun 27, 2025
Main Article
License
CC BY-SA 4.0
Meta
Record Statistics
Record Views
682
Version History
[v1] (Original Submission)
Jun 27, 2025
 
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0170v1
 
Record Owner
PSE Press
Links to Related Works
Directly Related to This Work
Article DOI
References Cited
  1. Gandomi A, Haider M. Beyond the hype: Big data concepts, methods, and analytics. International journal of information management. 2015 Apr 1;35(2):137-44 https://doi.org/10.1016/j.ijinfomgt.2014.10.007
  2. Zadkarami M, Safavi AA, Taheri M, Salimi FF. Data driven leakage diagnosis for oil pipelines: An integrated approach of factor analysis and deep neural network classifier. Transactions of the Institute of Measurement and Control. 2020 Oct;42(14):2708-18 https://doi.org/10.1177/0142331220928145
  3. Zadkarami M, Safavi AA, Gernaey KV, Ramin P. A Process Monitoring Framework for Imbalanced Big Data: A Wastewater Treatment Plant Case Study. IEEE Access. 2024 Sep https://doi.org/10.1109/ACCESS.2024.3454516
  4. Zadkarami M, Gernaey KV, Safavi AA, Ramin P. Big Data Analytics for Advanced Fault Detection in Wastewater Treatment Plants. InComputer Aided Chemical Engineering 2024 Jan 1 (Vol. 53, pp. 1831-1836). Elsevier https://doi.org/10.1016/B978-0-443-28824-1.50306-9
  5. Ramin P, Flores-Alsina X, Topalian SO, Jeppsson U, Gernaey K. Fault detection in a benchmark simulation model for wastewater treatment plants. InComputer Aided Chemical Engineering 2022 Jan 1 (Vol. 49, pp. 1363-1368). Elsevier https://doi.org/10.1016/B978-0-323-85159-6.50227-X