Pursuit of the Ultimate Regression Model for Samarium(III), Europium(III), and LiCl Using Laser-Induced Fluorescence, Design of Experiments, and a Genetic Algorithm for Feature Selection

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Laser-induced fluorescence spectroscopy, Raman scattering, and partial least squares regression models were optimized for the quantification of samarium (0-150 μg mL-1), europium (0-75 μg mL-1), and lithium chloride (0.1-12 M) with a transformational preprocessing strategy. Selecting combinations of preprocessing methods to optimize the prediction performance of regression models is frequently a major bottleneck for chemometric analysis. Here, we propose an optimization tool using an innovative combination of optimal experimental designs for selecting preprocessing transformation and a genetic algorithm (GA) for feature selection. A D-optimal design containing 26 samples (i.e., combinations of preprocessing strategies) and a user-defined design (576 samples) did not statistically lower the root mean square error of the prediction (RMSEP). The greatest improvement in prediction performance was achieved when a GA was used for feature selection. This feature selection greatly lowered RMSEP statistics by an average of 53%, resulting in the top models with percent RMSEP values of 0.91, 3.5, and 2.1% for Sm(III), Eu(III), and LiCl, respectively. These results indicate that preprocessing corrections (e.g., scatter, scaling, noise, and baseline) alone cannot realize the optimal regression model; feature selection is a more crucial aspect to consider. This unique approach provides a powerful tool for approaching the true optimum prediction performance and can be applied to numerous fields of spectroscopy and chemometrics to rapidly construct models.

Original languageEnglish
Pages (from-to)2281-2290
Number of pages10
JournalACS Omega
Volume8
Issue number2
DOIs
StatePublished - Jan 17 2023

Funding

This manuscript has been authored by UT-Battelle LLC under the contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan ( http://energy.gov/downloads/doe-public-access-plan ). Acknowledgments The authors wish to thank Laetitia Delmau for helpful discussions regarding lanthanide speciation and Kyle Morgan for assistance. This work was supported by the Cf Program at Oak Ridge National Laboratory. 252 This research was supported by the US Department of Energy Isotope Program, managed by the Office of Science. Oak Ridge National Laboratory is managed by UT-Battelle, LLC, for the U. S. Department of Energy under contract DE-AC05-00OR22725. This work used resources at the Radiochemical Engineering Development Center operated by the Oak Ridge National Laboratory.

FundersFunder number
U.S. Department of EnergyDE-AC05-00OR22725
Office of Science
Oak Ridge National Laboratory252
UT-Battelle

    Fingerprint

    Dive into the research topics of 'Pursuit of the Ultimate Regression Model for Samarium(III), Europium(III), and LiCl Using Laser-Induced Fluorescence, Design of Experiments, and a Genetic Algorithm for Feature Selection'. Together they form a unique fingerprint.

    Cite this