LAPSE:2025.0535
Published Article

LAPSE:2025.0535
Machine Learning Models for Predicting the Amount of Nutrients Required in a Microalgae Cultivation System
June 27, 2025
Abstract
Effective prediction of nutrient demands is crucial for optimising microalgae growth, maximising productivity and minimising the waste of resources. With the increasing amount of data related to microalgae cultivation systems, data mining and machine learning models to extract additional knowledge have gained popularity. In the development of such models, a data preprocessing stage is necessary due to the poor data quality. At this stage, cleaning and outlier removal techniques are employed to eliminate missing data and outliers, respectively. Afterwards, data splitting and cross-validation strategies are employed to ensure that the models are trained and evaluated with representative subsets of the data. Principal component analysis is also applied to simplify complex environmental datasets by reducing the number of features while retaining as much information as possible. To further improve prediction capabilities, ensemble methods are incorporated, leveraging multiple models to achieve a higher overall performance. Stacking, a popular ensemble technique, is used to combine the outputs of individual models into a single meta-model. The application of these machine learning methods has been demonstrated using a dataset acquired from the cultivation of the microalgae Dunaliella in a flat-panel photobioreactor. The results have shown that the data mining workflow, in combination with different machine learning models, was able to describe the nutrients requirements enforcing good performance of microalgae Dunaliella ?-carotene production in the carotenogenic phase.
Effective prediction of nutrient demands is crucial for optimising microalgae growth, maximising productivity and minimising the waste of resources. With the increasing amount of data related to microalgae cultivation systems, data mining and machine learning models to extract additional knowledge have gained popularity. In the development of such models, a data preprocessing stage is necessary due to the poor data quality. At this stage, cleaning and outlier removal techniques are employed to eliminate missing data and outliers, respectively. Afterwards, data splitting and cross-validation strategies are employed to ensure that the models are trained and evaluated with representative subsets of the data. Principal component analysis is also applied to simplify complex environmental datasets by reducing the number of features while retaining as much information as possible. To further improve prediction capabilities, ensemble methods are incorporated, leveraging multiple models to achieve a higher overall performance. Stacking, a popular ensemble technique, is used to combine the outputs of individual models into a single meta-model. The application of these machine learning methods has been demonstrated using a dataset acquired from the cultivation of the microalgae Dunaliella in a flat-panel photobioreactor. The results have shown that the data mining workflow, in combination with different machine learning models, was able to describe the nutrients requirements enforcing good performance of microalgae Dunaliella ?-carotene production in the carotenogenic phase.
Record ID
Keywords
Data Mining, Dunaliella carotenogenesis, Machine Learning, Microalgae Cultivation
Subject
Suggested Citation
Freitas GR, Badenes SM, Oliveira R, Martins FG. Machine Learning Models for Predicting the Amount of Nutrients Required in a Microalgae Cultivation System. Systems and Control Transactions 4:2385-2390 (2025) https://doi.org/10.69997/sct.105325
Author Affiliations
Freitas GR: LEPABE, Department of Chemical Engineering, Faculty of Engineering, University of Porto, Porto, Portugal; ALiCE, Associate Laboratory in Chemical Engineering, Faculty of Engineering, University of Porto, Porto, Portugal; A4F Algae for future, Campus do
Badenes SM: A4F Algae for future, Campus do Lumiar, Estrada do Paço do Lumiar, Lisbon, Portugal
Oliveira R: UCIBIO, Department of Chemistry, NOVA School of Science and Technology, NOVA University Lisbon, Caparica, Portugal; Associate Laboratory i4HB - Institute for Health and Bioeconomy, NOVA School of Science and Technology, Universidade NOVA de Lisboa, Portug
Martins FG: LEPABE, Department of Chemical Engineering, Faculty of Engineering, University of Porto, Porto, Portugal; ALiCE, Associate Laboratory in Chemical Engineering, Faculty of Engineering, University of Porto, Porto, Portugal
Badenes SM: A4F Algae for future, Campus do Lumiar, Estrada do Paço do Lumiar, Lisbon, Portugal
Oliveira R: UCIBIO, Department of Chemistry, NOVA School of Science and Technology, NOVA University Lisbon, Caparica, Portugal; Associate Laboratory i4HB - Institute for Health and Bioeconomy, NOVA School of Science and Technology, Universidade NOVA de Lisboa, Portug
Martins FG: LEPABE, Department of Chemical Engineering, Faculty of Engineering, University of Porto, Porto, Portugal; ALiCE, Associate Laboratory in Chemical Engineering, Faculty of Engineering, University of Porto, Porto, Portugal
Journal Name
Systems and Control Transactions
Volume
4
First Page
2385
Last Page
2390
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 2385-2390-1421-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0535
This Record
External Link

https://doi.org/10.69997/sct.105325
Article DOI
Download
Meta
Record Statistics
Record Views
878
Version History
[v1] (Original Submission)
Jun 27, 2025
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0535
Record Owner
PSE Press
Links to Related Works
References Cited
- Ge'ron A. HANDS-ON MACHINE LEARNING WITH SCIKIT-LEARN AND TENSORFLOW: CONCEPTS, TOOLS, AND TECHNIQUES TO BUILD INTELLIGENT SYSTEMS. O'Reilly (2018)
- Witten I H, Frank E, Hall M A. DATA MINING: PRACTICAL MACHINE LEARNING TOOLS AND TECHNIQUES. Morgan Kaufmann Publishers (2011)
- Reimann R, Zeng B, Jakopec M, Burdukiewicz M, Petrick I, Schierack P, Rödiger S. Classification of dead and living microalgae Chlorella vulgaris by bioimage informatics and machine learning. Algal Res 48 (2020) https://doi.org/10.1016/j.algal.2020.101908
- Singh V, Verma M, Chivate M S, Mishra V. Machine learning-based optimisation of microalgae biomass production by using wastewater. J Environ Chem Eng 11 (2023) https://doi.org/10.1016/j.jece.2023.111387
- Meenatchisundaram K, Gowd S C, Lee J, Barathi S, Rajendran K. Data-driven model development for prediction and optimization of biomass yield of microalgae-based wastewater treatment. Sustain En Tech and Assmt 63 (2024) https://doi.org/10.1016/j.seta.2024.103670
- Barbosa M, Inácio L G, Afonso C, Maranhão P. The microalga Dunaliella and its applications: a review. Applied Phycology 4:99-120 (2023) https://doi.org/10.1080/26388081.2023.2222318
- Ye Z W, Jiang J G, Wu G H. Biosynthesis and regulation of carotenoids in Dunaliella: Progresses and prospects. Biotechnol Adv 26:352-360 (2008) https://doi.org/10.1016/j.biotechadv.2008.03.004
- Carvalho A P, Meireles L A, Malcata F X. Microalgal reactors: A review of enclosed system designs and performances. Biotechnol Prog 22:1490-1506 (2006) https://doi.org/10.1021/bp060065r
- Han J, Kamber M, Pei J. DATA MINING: CONCEPTS AND TECHNIQUES. 3rd Edition, Morgan Kaufmann Publishers. Elsevier (2012)
- Alpaydin E. INTRODUCTION TO MACHINE LEARNING. The MIT Press (2010)
- Ding C, Peng H. Minimum Redundancy Feature Selection from Microarray Gene Expression Data. Proceedings of the Computational Systems Bioinformatics (2003)
- Mazzelli A, Cicci A, Di Caprio F, Altimari P, Toro L, Iaquaniello G, Pagnanelli F. Multivariate modeling for microalgae growth in outdoor photobioreactors. Algal Res 45 (2020) https://doi.org/10.1016/j.algal.2019.101663
- Kennard R W, Stone L A. Computer Aided Design of Experiments. Technometrics 11:137-148 (1969) https://doi.org/10.1080/00401706.1969.10490666
(0.09 seconds)
[0.09 s]

