Browse
Keywords
Records with Keyword: Machine Learning
126. LAPSE:2024.1145
Artificial Intelligence for Hybrid Modeling in Fluid Catalytic Cracking (FCC)
June 21, 2024 (v1)
Subject: Modelling and Simulations
Keywords: CPFD, CREC riser simulator, FCC, Machine Learning
This study reports a novel hybrid model for the prediction of six critical process variables of importance in an industrial-scale FCC (fluid catalytic cracking) riser reactor: vacuum gas oil (VGO) conversion, outlet riser temperature, light cycle oil (LCO), gasoline, light gases, and coke yields. The proposed model is developed via the integration of a computational particle-fluid dynamics (CPFD) methodology with artificial intelligence (AI). The adopted methodology solves the first principle model (FPM) equations numerically using the CPFD Barracuda Virtual Reactor 22.0® software. Based on 216 of these CPFD simulations, the performance of an industrial-scale FCC riser reactor unit was assessed using VGO catalytic cracking kinetics developed at CREC-UWO. The dataset obtained with CPFD is employed for the training and testing of a machine learning (ML) algorithm. This algorithm is based on a multiple output feedforward neural network (FNN) selected to allow one to establish correlations... [more]
127. LAPSE:2024.1124
Power Generation Prediction for Photovoltaic System of Hose-Drawn Traveler Based on Machine Learning Models
June 21, 2024 (v1)
Subject: Modelling and Simulations
Keywords: hose-drawn traveler, Machine Learning, prediction model, PV power generation
A photovoltaic (PV)-powered electric motor is used for hose-drawn traveler driving instead of a water turbine to achieve high transmission efficiency. PV power generation (PVPG) is affected by different meteorological conditions, resulting in different power generation of PV panels for a hose-drawn traveler. In the above situation, the hose-drawn traveler may experience deficit power generation. The reasonable determination of the PV panel capacity is crucial. Predicting the PVPG is a prerequisite for the reasonable determination of the PV panel capacity. Therefore, it is essential to develop a method for accurately predicting PVPG. Extreme gradient boosting (XGBoost) is currently an outstanding machine learning model for prediction performance, but its hyperparameters are difficult to set. Thus, the XGBoost model based on particle swarm optimization (PSO-XGBoost) is applied for PV power prediction in this study. The PSO algorithm is introduced to optimize hyperparameters in XGBoost mo... [more]
128. LAPSE:2024.1104
A Study of the Physical Characteristics and Defects of Green Coffee Beans That Influence the Sensory Notes Using Machine Learning Models
June 21, 2024 (v1)
Subject: Materials
Keywords: green coffee, Machine Learning, physical properties, sensory notes
This paper presents a detailed analysis of the relation between physical characteristics and defects of green coffee beans and the sensory profile that influence the sensory notes of fragrance, aroma, flavor, and aftertaste of coffee. Machine learning models were used to identify the variables of importance and identify the ways in which these variables affect the sensory note of coffee, to determine which algorithm and its hyperparameters have greater precision in determining the sensory values of coffee such as floral, fruity, herbal, nutty, caramel, chocolate, spicy, resinous, pyrolytic, earthy, fermented, and phenolic. The result indicates the relationship and importance that exist between the physical variables, defects, and size of the green coffee bean, with respect to their respective sensory notes. The data of the proposed system demonstrate that by combining the scores of several experts, a precision can be achieved analogously to that obtained by cupping experts; therefore,... [more]
129. LAPSE:2024.0931
Using a Machine Learning Regression Approach to Predict the Aroma Partitioning in Dairy Matrices
June 7, 2024 (v1)
Subject: Food & Agricultural Processes
Keywords: aroma release, explainable artificial intelligence, food reformulation, Machine Learning, regression
Aroma partitioning in food is a challenging area of research due to the contribution of several physical and chemical factors that affect the binding and release of aroma in food matrices. The partition coefficient measured by the Kmg value refers to the partition coefficient that describes how aroma compounds distribute themselves between matrices and a gas phase, such as between different components of a food matrix and air. This study introduces a regression approach to predict the Kmg value of aroma compounds of a wide range of physicochemical properties in dairy matrices representing products of different compositions and/or processing. The approach consists of data cleaning, grouping based on the temperature of Kmg analysis, pre-processing (log transformation and normalization), and, finally, the development and evaluation of prediction models with regression methods. We compared regression analysis with linear regression (LR) to five machine-learning-based regression algorithms:... [more]
130. LAPSE:2024.0916
Data-Driven Process Monitoring and Fault Diagnosis: A Comprehensive Survey
June 7, 2024 (v1)
Subject: Process Monitoring
Keywords: fault detection and diagnosis, latent variable modeling, Machine Learning, Multivariate Statistics, process monitoring, process systems engineering
This paper presents a comprehensive review of the historical development, the current state of the art, and prospects of data-driven approaches for industrial process monitoring. The subject covers a vast and diverse range of works, which are compiled and critically evaluated based on the different perspectives they provide. Data-driven modeling techniques are surveyed and categorized into two main groups: multivariate statistics and machine learning. Representative models, namely principal component analysis, partial least squares and artificial neural networks, are detailed in a didactic manner. Topics not typically covered by other reviews, such as process data exploration and treatment, software and benchmarks availability, and real-world industrial implementations, are thoroughly analyzed. Finally, future research perspectives are discussed, covering aspects related to system performance, the significance and usefulness of the approaches, and the development environment. This work... [more]
131. LAPSE:2024.0864
A Novel Ensemble Machine Learning Model for Oil Production Prediction with Two-Stage Data Preprocessing
June 7, 2024 (v1)
Subject: Modelling and Simulations
Keywords: CEEMDAN algorithm, Machine Learning, oil production prediction, random forest algorithm, TCN-GRU-MA model
Petroleum production forecasting involves the anticipation of fluid production from wells based on historical data. Compared to traditional empirical, statistical, or reservoir simulation-based models, machine learning techniques leverage inherent relationships among historical dynamic data to predict future production. These methods are characterized by readily available parameters, fast computational speeds, high precision, and time−cost advantages, making them widely applicable in oilfield production. In this study, time series forecast models utilizing robust and efficient machine learning techniques are formulated for the prediction of production. We have fused the two-stage data preprocessing methods and the attention mechanism into the temporal convolutional network-gated recurrent unit (TCN-GRU) model. Firstly, the random forest (RF) algorithm is employed to extract key dynamic production features that influence output, serving to reduce data dimensionality and mitigate overfit... [more]
132. LAPSE:2024.0847
Exploring the REEs Energy Footprint: Interlocking AI/ML with an Empirical Approach for Analysis of Energy Consumption in REEs Production
June 7, 2024 (v1)
Subject: Modelling and Simulations
Keywords: Artificial Intelligence, energy consumption, Machine Learning, processing, rare earths
Rare earth elements (REEs including Sc, Y) are critical minerals for developing sustainable energy sources. The gradual transition adopted in developed and developing countries to meet energy targets has propelled the need for REEs in addition to critical metals (CMs). The rise in demand which has propelled REEs into the spotlight is driven by the crucial role these REEs play in technologies that aim to reduce our carbon footprint in the atmosphere. Regarding decarbonized technologies in the energy sector, REEs are widely applied for use in NdFeB permanent magnets, which are crucial parts of wind turbines and motors of electric vehicles. The underlying motive behind exploring the energy and carbon footprint caused by REEs production is to provide a more complete context and rationale for REEs usage that is more holistic. Incorporating artificial intelligence (AI)/machine learning (ML) models with empirical approaches aids in flowsheet validation, and thus, it presents a vivid holistic... [more]
133. LAPSE:2024.0826
Optimizing Short-Term Photovoltaic Power Forecasting: A Novel Approach with Gaussian Process Regression and Bayesian Hyperparameter Tuning
June 7, 2024 (v1)
Subject: Optimization
Keywords: Bayesian optimization, Gaussian process regression, Machine Learning, PV power forecasting, solar radiation intensity
The inherent volatility of PV power introduces unpredictability to the power system, necessitating accurate forecasting of power generation. In this study, a machine learning (ML) model based on Gaussian process regression (GPR) for short-term PV power output forecasting is proposed. With its benefits in handling nonlinear relationships, estimating uncertainty, and generating probabilistic forecasts, GPR is an appropriate approach for addressing the problems caused by PV power generation’s irregularity. Additionally, Bayesian optimization to identify optimal hyper-parameter combinations for the ML model is utilized. The research leverages solar radiation intensity data collected at 60-min and 30-min intervals over periods of 1 year and 6 months, respectively. Comparative analysis reveals that the data set with 60-min intervals performs slightly better than the 30-min intervals data set. The proposed GPR model, coupled with Bayesian optimization, demonstrates superior performance compar... [more]
134. LAPSE:2024.0805
Research on the Scaling Mechanism and Countermeasures of Tight Sandstone Gas Reservoirs Based on Machine Learning
June 7, 2024 (v1)
Subject: Modelling and Simulations
Keywords: enhanced oil recovery, Machine Learning, scale prevention measures, scaling mechanism, tight sandstone gas reservoirs
The Sulige gas field is a typical “three lows” (low permeability, low pressure, and low abundance) tight sandstone gas reservoir, with formation pressures often characterized by abnormally high or low pressures. The complex geological features of the reservoir further deviate from conventional understanding, impacting the effective implementation of wellbore blockage removal measures. Therefore, it is imperative to establish the wellbore blockage mechanism, prediction model, and effective prevention measures for the target area. In this study, based on field data, we first experimentally analyzed the water quality and types of blockage in the target area. Subsequently, utilizing a BP neural network model, we established a model for predicting the risk of wellbore blockage and analyzing mitigation measures in the target reservoir. The model’s prediction results, consistent with on-site actual results, demonstrate its reliability and accuracy. Experimental results show that the water qua... [more]
135. LAPSE:2024.0745
Smart Strategic Management for the Cold Plasma Process Using ORP Monitoring and Total Organic Carbon Correlation
June 6, 2024 (v1)
Subject: Modelling and Simulations
Keywords: cold plasma, Machine Learning, oxidation–reduction potential, sensor, TOC
Assessing oxidation−reduction potential (ORP) is of paramount importance in the efficient management of wastewater within both chemical and biological treatment processes. However, despite its critical role, insufficient information exists about how reactive chemical species generated by cold plasma (CP) in chemical treatment are associated with ORP and air flow rate. Therefore, we aim to identify the correlation between ORP and the removal of organic pollutants when using CP treatment. Additionally, we introduce a machine-learning-based operation to predict removal efficiency in the CP process. Results reveal a significant correlation of over 0.9 between real-time ORP and total organic carbon (TOC), which underscores the efficacy of ORP as a key parameter. This approach made it possible to control OH radical generation by regulating the air flow rate of the CP. This study posits that smart management facilitated by machine learning has the potential to enhance the economic viability o... [more]
136. LAPSE:2024.0665
Data-Driven Method for Vacuum Prediction in the Underwater Pump of a Cutter Suction Dredger
June 6, 2024 (v1)
Subject: Numerical Methods and Statistics
Keywords: cutter suction dredger, forecast, Machine Learning, vacuum for underwater pump
Vacuum is an important parameter in cutter suction dredging operations because the equipment is underwater and can easily fail. It is necessary to analyze other parameters related to the vacuum to make real-time predictions about it, which can improve the construction efficiency of the dredger under abnormal working conditions. In this paper, a data-driven method for predicting the vacuum of the underwater pump of the cutter suction dredger (CSD) is proposed with the help of big data, machine learning, data mining, and other technologies, and based on the historical data of “Hua An Long” CSD. The method eliminates anomalous data, standardizes the data set, and then relies on theory and engineering experience to achieve feature extraction using the Spearman correlation coefficient. Then, six machine learning methods were employed in this study to train and predict the data set, namely, lasso regression (lasso), elastic network (Enet), gradient boosting decision tree (including tradition... [more]
137. LAPSE:2024.0651
Diesel Adulteration Detection with a Machine Learning-Enhanced Laser Sensor Approach
June 6, 2024 (v1)
Subject: Energy Systems
Keywords: COMSOL Multiphysics, diesel adulteration, kerosene, light reflection/refraction, Machine Learning, models, refractive index, sensor
This paper introduces a novel and cost-effective method for detecting adulterated diesel, specifically targeting contamination with kerosene, by leveraging machine learning and the refractive index values of mixed diesel samples. It proposes a laser-based sensor, employing COMSOL simulations for synthetic data generation to facilitate machine learning training. This innovative approach not only streamlines the detection process by eliminating the need for expensive equipment and specialized personnel but also enables on-site testing without extensive sample preparation. The sensor’s design, utilizing light refraction and reflection principles, allows for the accurate measurement of diesel adulteration levels. Validation results showcase the machine learning models’ high precision in predicting adulteration percentages, as evidenced by an R-squared value of 0.999 and a mean absolute error of 0.074. This research signifies a leap in sensor technology, offering a practical solution for ra... [more]
138. LAPSE:2024.0435
Artificial Neural Network Modeling in the Presence of Uncertainty for Predicting Hydrogenation Degree in Continuous Nitrile Butadiene Rubber Processing
June 5, 2024 (v1)
Subject: Numerical Methods and Statistics
Keywords: hydrogenation, Machine Learning, mechanistic modeling, static mixer reactor, uncertainty
The transition from batch to continuous production in the catalytic hydrogenation of nitrile butadiene rubber (NBR) into hydrogenated NBR (HNBR) marks a significant advance for applications under demanding conditions. This study introduces a continuous process utilizing a static mixer (SM) reactor, which notably achieves a hydrogenation conversion rate exceeding 97%. We thoroughly review a mechanistic model of the SM reactor to elucidate the internal dynamics governing the hydrogenation process and address the inherent uncertainties in key parameters such as the Peclet number (Pe), dimensionless time (θτ), reaction coefficient (R), and flow rate coefficient (q). A comprehensive dataset generated from varied parameter values serves as the basis for training an artificial neural network (ANN), which is then compared against traditional models including linear regression, decision tree, and random forest in terms of efficacy. Our results clearly demonstrate the ANN’s superiority in predic... [more]
139. LAPSE:2024.0427
Machine Learning Algorithms That Emulate Controllers Based on Particle Swarm Optimization—An Application to a Photobioreactor for Algal Growth
June 5, 2024 (v1)
Subject: Optimization
Particle Swarm Optimization (PSO) algorithms within control structures are a realistic approach; their task is often to predict the optimal control values working with a process model (PM). Owing to numerous numerical integrations of the PM, there is a big computational effort that leads to a large controller execution time. The main motivation of this work is to decrease the computational effort and, consequently, the controller execution time. This paper proposes to replace the PSO predictor with a machine learning model that has “learned” the quasi-optimal behavior of the couple (PSO and PM); the training data are obtained through closed-loop simulations over the control horizon. The new controller should preserve the process’s quasi-optimal control. In identical conditions, the process evolutions must also be quasi-optimal. The multiple linear regression and the regression neural networks were considered the predicting models. This paper first proposes algorithms for collecting and... [more]
140. LAPSE:2024.0413
Investigating Precise Decision-Making in Greenhouse Environments Based on Intelligent Optimization Algorithms
June 5, 2024 (v1)
Subject: Environment
Keywords: greenhouse environment, Machine Learning, model library, precise decision-making, tomato
The precise control of a greenhouse environment is vital in production. Currently, environmental control in traditional greenhouse production relies on experience, making it challenging to accurately control it, leading to environmental stress, resource waste, and pollution. Hence, this paper proposes a decision-making greenhouse environment control strategy that employs an existing monitoring system and intelligent algorithms to enhance greenhouse productivity and reduce costs. Specifically, a model library is created based on machine learning algorithms, and an intelligent optimization algorithm is designed based on the Non-Dominated Sorting Genetic Algorithm III (NSGA-3) and an expert experience knowledge base. Then, optimal environmental decision-making solutions under different greenhouse environments are obtained by adjusting the greenhouse environmental parameters. Our method’s effectiveness is verified through a simulated fertilization plan that was simulated for a real greenho... [more]
141. LAPSE:2024.0356
Optimization of Abnormal Hydraulic Fracturing Conditions of Unconventional Natural Gas Reservoirs Based on a Surrogate Model
June 5, 2024 (v1)
Subject: Optimization
Keywords: abnormal conditions, differential evolution, Machine Learning, probability optimization, unconventional gas
Abnormal conditions greatly reduce the efficiency of hydraulic fracturing of unconventional gas reservoirs. Optimizing the fracturing scheme is crucial to minimize the likelihood of abnormal operational conditions, such as pressure channeling, casing deformation, and proppant plugging. This paper proposes a novel machine learning-based method for optimizing abnormal conditions during hydraulic fracturing of unconventional natural gas reservoirs. Firstly, the main controlling factors of abnormal conditions are selected through a hybrid controlling analysis, upon which a surrogate model is established for predicting the occurrence probability of abnormal conditions, rather than whether abnormal conditions happen or not. Subsequently, a machine learning-based optimization algorithm is developed to minimize the occurrence probability of abnormal conditions, acknowledging their inevitability during the fracturing process. The optimal results demonstrate the proposed method outperforms tradi... [more]
142. LAPSE:2024.0276
Supplementary Materials to “Learn-To-Design: Reinforcement Learning-Assisted Chemical Process Optimization”
April 8, 2024 (v1)
Subject: Process Design
Keywords: Artificial Intelligence, Carbon Capture, Machine Learning, Process Design Optimization, Reinforcement Learning, Simulation-based Optimization
This paper proposes an AI-assisted approach aimed at accelerating chemical process design through causal incremental reinforcement learning (CIRL) where an intelligent agent is interacting iteratively with a process simulation environment (e.g., Aspen HYSYS, DWSIM, etc.). The proposed approach is based on an incremental learnable optimizer capable of guiding multi-objective optimization towards optimal design variable configurations, depending on several factors including the problem complexity, selected RL algorithm and hyperparameters tuning. One advantage of this approach is that the agent-simulator interaction significantly reduces the vast search space of design variables, leading to an accelerated and optimized design process. This is a generic causal approach that enables the exploration of new process configurations and provides actionable insights to designers to improve not only the process design but also the design process across various applications. The approach was valid... [more]
143. LAPSE:2024.0263
Machine Learning Aided Prediction of Glass-Forming Ability of Metallic Glass
February 19, 2024 (v1)
Subject: Process Design
Keywords: glass-forming ability, Machine Learning, metallic glass, optimal design
The prediction of the glass-forming ability (GFA) of metallic glasses (MGs) can accelerate the efficiency of their development. In this paper, a dataset was constructed using experimental data collected from the literature and books, and a machine learning-based predictive model was established to predict the GFA. Firstly, a classification model based on the size of the critical diameter (Dmax) was established to determine whether an alloy system could form a glass state, with an accuracy rating of 0.98. Then, regression models were established to predict the crystallization temperature (Tx), glass transition temperature (Tg), and liquidus temperature (Tl) of MGs. The R2 of the prediction model obtained in the test set was greater than 0.89, which showed that the model had good prediction accuracy. The key features used by the regression models were analyzed using variance, correlation, embedding, recursive, and exhaustive methods to select the most important features. Furthermore, to... [more]
144. LAPSE:2024.0068
Exploring Bayesian Optimization for Photocatalytic Reduction of CO2
January 12, 2024 (v1)
Subject: Optimization
Keywords: Bayesian optimization, design of experiment, Machine Learning, photocatalytic reduction, reaction optimization
The optimization of photocatalysis is complex, as heterogenous catalysis makes its kinetic modeling or design of experiment (DOE) significantly more difficult than homogeneous reactions. On the other hand, Bayesian optimization (BO) has been found to be efficient in the optimization of many complex chemical problems but has rarely been studied in photocatalysis. In this paper, we developed a BO platform and applied it to the optimization of three photocatalytic CO2 reduction systems that have been kinetically modeled in previous studies. Three decision variables, namely, partial pressure of CO2, partial pressure of H2O, and reaction time, were used to optimize the reaction rate. We first compared BO with the traditional DOE methods in the Khalilzadeh and Tan systems and found that the optimized reaction rates predicted by BO were 0.7% and 11.0% higher, respectively, than the best results of optimization by DOE, and were significantly better than the original experimental data, which we... [more]
145. LAPSE:2023.36921
A Comprehensive Prediction Method for Pore Pressure in Abnormally High-Pressure Blocks Based on Machine Learning
November 30, 2023 (v1)
Subject: Other
Keywords: empirical models, Extra Trees, KNN, LightGBM, Machine Learning, overpressure, pore pressure prediction, Random Forest, well logs
In recent years, there has been significant research and practical application of machine learning methods for predicting reservoir pore pressure. However, these studies frequently concentrate solely on reservoir blocks exhibiting normal-pressure conditions. Currently, there exists a scarcity of research addressing the prediction of pore pressure within reservoir blocks characterized by abnormally high pressures. In light of this, the present paper introduces a machine learning-based approach to predict pore pressure within reservoir blocks exhibiting abnormally high pressures. The methodology is demonstrated using the X block as a case study. Initially, the combination of the density−sonic velocity crossplot and the Bowers method is favored for elucidating the overpressure-to-compact mechanism within the X block. The elevated pressure within the lower reservoir is primarily attributed to the pressure generated during hydrocarbon formation. The Bowers method has been chosen to forecast... [more]
146. LAPSE:2023.36862
A Fault Warning Approach Using an Enhanced Sand Cat Swarm Optimization Algorithm and a Generalized Neural Network
November 30, 2023 (v1)
Subject: Numerical Methods and Statistics
Keywords: Fault Detection, generalized neural network, industrial systems, Machine Learning, sand cat swarm optimization
With the continuous development and complexity of industrial systems, various types of industrial equipment and systems face increasing risks of failure during operation. Important to these systems is fault warning technology, which can timely detect anomalies before failures and take corresponding preventive measures, thereby reducing production interruptions and maintenance costs, improving production efficiency, and enhancing equipment reliability. Machine learning techniques have proven highly effective for fault detection in modern production processes. Among numerous machine learning algorithms, the generalized neural network stands out due to its simplicity, effectiveness, and applicability to various fault warning scenarios. However, the increasing complexity of systems and equipment presents significant challenges to the generalized neural network. In real-world scenarios, it suffers from drawbacks such as difficulties in determining parameters and getting trapped in local opt... [more]
147. LAPSE:2023.36718
Green Techniques for Detecting Microplastics in Marine with Emphasis on FTIR and NIR Spectroscopy—Short Review
September 21, 2023 (v1)
Subject: Modelling and Simulations
Keywords: ecotoxicological testing, health impact, Machine Learning, marine pollution, microplastics, microplastics analysis, novel methods
The amount of microplastics (MPs) present in marine ecosystems are a growing concern, with potential impacts on human health because they are associated with an increase in the ecotoxicity of certain foods, such as fish. As a result, there has been a growing interest in developing effective methods for the analysis of MPs in marine waters. Traditional methods for MP analysis involve visual inspection and manual sorting, which can be time-consuming and subject to human error. However, novel methods have been developed that offer more efficient and accurate analyses. One such method is based on spectroscopy, such as Fourier transform infrared spectroscopy (FTIR). Another method involves the use of fluorescent dyes, which can selectively bind to microplastics and allow for their detection under UV light. Additionally, machine learning approaches have been developed to analyze large volumes of water samples for MP detection and classification. These methods involve the use of specialized a... [more]
148. LAPSE:2023.36709
Counting Abalone with High Precision Using YOLOv3 and DeepSORT
September 21, 2023 (v1)
Subject: Modelling and Simulations
Keywords: abalone, abalone counting, abalone detection, deep learning, Machine Learning
In this research work, an approach using You Only Look Once version three (YOLOv3)-TensorFlow for abalone detection and Deep Simple Online Real-time Tracking (DeepSORT) for abalone tracking in conveyor belt systems is proposed. The conveyor belt system works in coordination with the cameras used to detect abalones. Considering the computational effectiveness and improved detection algorithms, this proposal is promising compared to the previously proposed methods. Some of these methods have low effectiveness and accuracy, and they provide an incorrect counting rate because some of the abalones tend to entangle, resulting in counting two or more abalones as one. Conducting detection and tracking research is crucial to achieve modern solutions for small- and large-scale fishing industries that enable them to accomplish higher automation, non-invasiveness, and low cost. This study is based on the development and improvement of counting analysis tools for automation in the fishing industry.... [more]
149. LAPSE:2023.36683
Critical Analysis of Risk Factors and Machine-Learning-Based Gastric Cancer Risk Prediction Models: A Systematic Review
September 20, 2023 (v1)
Subject: Biosystems
Keywords: classification algorithm, gastric cancer, Machine Learning, predictive factors, risk prediction model
The gastric cancer risk prediction model used for large-scale gastric cancer screening and individual risk stratification is an artificial intelligence tool that combines clinical diagnostic data with a classification algorithm. The ability to automatically make a quantitative assessment of complex clinical data contributes to increased accuracy for diagnosis with higher efficiency, significantly reducing the incidence of advanced gastric cancer. Previous studies have explored the predictive performance of gastric cancer risk prediction models, as well as the predictive factors and algorithms between each model, but have reached controversial conclusions. Thus, the performance of current machine-learning-based gastric cancer risk prediction models alongside the clinical relevance of different predictive factors needs to be evaluated to help build more efficient and feasible models in the future. In this systematic review, we summarize the current research progress related to the gastri... [more]
150. LAPSE:2023.36652
An Interpretable Predictive Model for Health Aspects of Solvents via Rough Set Theory
September 20, 2023 (v1)
Subject: Modelling and Simulations
Keywords: health indices, Machine Learning, organic solvents, rough set theory, rough set-based machine learning
This paper presents a machine learning (ML) approach to predict the potential health issues of solvents by uncovering the hidden relationship between substances and toxicity. Solvent selection is a crucial step in industrial processes. However, prolonged exposure to solvents has been found to pose significant risks to human health. To mitigate these hazards, it is crucial to develop a predictive model for health performance by identifying the contributing factors to solvent toxicity. This research aims to develop a predictive model for health issues related to solvent toxicity. Among various algorithms in ML, Rough Set Machine Learning (RSML) was chosen for this work due to its interpretable nature of the generated models. The models have been developed through data collection on the toxicity of various organic solvents, the construction of predictive models with decision rules, and model verification. The results reveal correlations between solvent toxicity and the Balaban index, vale... [more]
[Show All Keywords]
[0.06 s]


