LAPSE:2023.2768
Published Article

LAPSE:2023.2768
Prediction of Heavy Metal Concentrations in Contaminated Sites from Portable X-ray Fluorescence Spectrometer Data Using Machine Learning
February 21, 2023
Abstract
Portable X-ray fluorescence (pXRF) spectrometers provide simple, rapid, nondestructive, and cost-effective analysis of the metal contents in soils. The current method for improving pXRF measurement accuracy is soil sample preparation, which inevitably consumes significant amounts of time. To eliminate the influence of sample preparation on PXRF measurements, this study evaluates the performance of pXRF measurements in the prediction of eight heavy metals’ contents through machine learning algorithm linear regression (LR) and multivariate adaptive regression spline (MARS) models. Soil samples were collected from five industrial sites and separated into high-value and low-value datasets with pXRF measurements above or below the background values. The results showed that for Cu and Cr, the MARS models were better than the LR models at prediction (the MARS-R2 values were 0.88 and 0.78; the MARS-RPD values were 2.89 and 2.11). For the pXRF low-value dataset, the multivariate MARS models improved the pXRF measurement accuracy, with the R2 values improved from 0.032 to 0.39 and the RPD values increased by 0.02 to 0.37. For the pXRF high-value dataset, the univariate MARS models predicted the content of Cu and Cr with less calculation. Our study reveals that machine learning methods can better predict the Cu and Cr of large samples from multiple contaminated sites.
Portable X-ray fluorescence (pXRF) spectrometers provide simple, rapid, nondestructive, and cost-effective analysis of the metal contents in soils. The current method for improving pXRF measurement accuracy is soil sample preparation, which inevitably consumes significant amounts of time. To eliminate the influence of sample preparation on PXRF measurements, this study evaluates the performance of pXRF measurements in the prediction of eight heavy metals’ contents through machine learning algorithm linear regression (LR) and multivariate adaptive regression spline (MARS) models. Soil samples were collected from five industrial sites and separated into high-value and low-value datasets with pXRF measurements above or below the background values. The results showed that for Cu and Cr, the MARS models were better than the LR models at prediction (the MARS-R2 values were 0.88 and 0.78; the MARS-RPD values were 2.89 and 2.11). For the pXRF low-value dataset, the multivariate MARS models improved the pXRF measurement accuracy, with the R2 values improved from 0.032 to 0.39 and the RPD values increased by 0.02 to 0.37. For the pXRF high-value dataset, the univariate MARS models predicted the content of Cu and Cr with less calculation. Our study reveals that machine learning methods can better predict the Cu and Cr of large samples from multiple contaminated sites.
Record ID
Keywords
heavy metals, in situ pXRF, multivariate adaptive regression splines (MARS), rapid field screening, site investigation
Subject
Suggested Citation
Xia F, Fan T, Chen Y, Ding D, Wei J, Jiang D, Deng S. Prediction of Heavy Metal Concentrations in Contaminated Sites from Portable X-ray Fluorescence Spectrometer Data Using Machine Learning. (2023). LAPSE:2023.2768
Author Affiliations
Xia F: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China [ORCID]
Fan T: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China
Chen Y: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China
Ding D: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China [ORCID]
Wei J: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China
Jiang D: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China
Deng S: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China [ORCID]
Fan T: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China
Chen Y: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China
Ding D: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China [ORCID]
Wei J: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China
Jiang D: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China
Deng S: Nanjing Institute of Environmental Science, Ministry of Ecology and Environment, Nanjing 210042, China; Key Laboratory of Soil Environmental Management and Pollution Control, Ministry of Ecology and Environment, Nanjing 210042, China [ORCID]
Journal Name
Processes
Volume
10
Issue
3
First Page
536
Year
2022
Publication Date
2022-03-09
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr10030536, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2023.2768
This Record
External Link

https://doi.org/10.3390/pr10030536
Publisher Version
Download
Meta
Record Statistics
Record Views
242
Version History
[v1] (Original Submission)
Feb 21, 2023
Verified by curator on
Feb 21, 2023
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2023.2768
Record Owner
Auto Uploader for LAPSE
Links to Related Works
