LAPSE:2024.0931
Published Article
LAPSE:2024.0931
Using a Machine Learning Regression Approach to Predict the Aroma Partitioning in Dairy Matrices
June 7, 2024
Abstract
Aroma partitioning in food is a challenging area of research due to the contribution of several physical and chemical factors that affect the binding and release of aroma in food matrices. The partition coefficient measured by the Kmg value refers to the partition coefficient that describes how aroma compounds distribute themselves between matrices and a gas phase, such as between different components of a food matrix and air. This study introduces a regression approach to predict the Kmg value of aroma compounds of a wide range of physicochemical properties in dairy matrices representing products of different compositions and/or processing. The approach consists of data cleaning, grouping based on the temperature of Kmg analysis, pre-processing (log transformation and normalization), and, finally, the development and evaluation of prediction models with regression methods. We compared regression analysis with linear regression (LR) to five machine-learning-based regression algorithms: Random Forest Regressor (RFR), Gradient Boosting Regression (GBR), Extreme Gradient Boosting (XGBoost, XGB), Support Vector Regression (SVR), and Artificial Neural Network Regression (NNR). Explainable AI (XAI) was used to calculate feature importance and therefore identify the features that mainly contribute to the prediction. The top three features that were identified are log P, specific gravity, and molecular weight. For the prediction of the Kmg in dairy matrices, R2 scores of up to 0.99 were reached. For 37.0 °C, which resembles the temperature of the mouth, RFR delivered the best results, and, at lower temperatures of 7.0 °C, typical for a household fridge, XGB performed best. The results from the models work as a proof of concept and show the applicability of a data-driven approach with machine learning to predict the Kmg value of aroma compounds in different dairy matrices.
Keywords
aroma release, explainable artificial intelligence, food reformulation, Machine Learning, regression
Suggested Citation
Anker M, Borsum C, Zhang Y, Zhang Y, Krupitzer C. Using a Machine Learning Regression Approach to Predict the Aroma Partitioning in Dairy Matrices. (2024). LAPSE:2024.0931
Author Affiliations
Anker M: Department of Food Informatics and Computational Science Hub, University of Hohenheim, 70599 Stuttgart, Germany [ORCID]
Borsum C: Department of Food Informatics and Computational Science Hub, University of Hohenheim, 70599 Stuttgart, Germany; Department of Process Engineering (Essential Oils, Natural Cosmetics), University of Applied Sciences Kempten, 87435 Kempten, Germany [ORCID]
Zhang Y: Department of Flavor Chemistry, University of Hohenheim, 70599 Stuttgart, Germany
Zhang Y: Department of Flavor Chemistry, University of Hohenheim, 70599 Stuttgart, Germany [ORCID]
Krupitzer C: Department of Food Informatics and Computational Science Hub, University of Hohenheim, 70599 Stuttgart, Germany [ORCID]
Journal Name
Processes
Volume
12
Issue
2
First Page
266
Year
2024
Publication Date
2024-01-26
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr12020266, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2024.0931
This Record
External Link

https://doi.org/10.3390/pr12020266
Publisher Version
Download
Files
Jun 7, 2024
Main Article
License
CC BY 4.0
Meta
Record Statistics
Record Views
470
Version History
[v1] (Original Submission)
Jun 7, 2024
 
Verified by curator on
Jun 7, 2024
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2024.0931
 
Record Owner
Auto Uploader for LAPSE
Links to Related Works
Directly Related to This Work
Publisher Version