Treffer: Visualization of a Machine Learning Framework toward Highly Sensitive Qualitative Analysis by SERS.

Title:
Visualization of a Machine Learning Framework toward Highly Sensitive Qualitative Analysis by SERS.
Authors:
Luo SH; State Key Laboratory for Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen, Fujian 361005, China.; State Key Laboratory of Marine Environmental Science, Fujian Provincial Key Laboratory for Coastal Ecology and Environmental Studies, Center for Marine Environmental Chemistry & Toxicology, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China., Wang WL; State Key Laboratory of Marine Environmental Science, Fujian Provincial Key Laboratory for Coastal Ecology and Environmental Studies, Center for Marine Environmental Chemistry & Toxicology, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China., Zhou ZF; State Key Laboratory of Marine Environmental Science, Fujian Provincial Key Laboratory for Coastal Ecology and Environmental Studies, Center for Marine Environmental Chemistry & Toxicology, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China., Xie Y; Fujian Key Laboratory of Sensing and Computing for Smart City, School of Information Science and Engineering, Xiamen University, Xiamen, Fujian 361005, China.; Shenzhen Research Institute of Xiamen University, Xiamen University, Shenzhen 518000, China., Ren B; State Key Laboratory for Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen, Fujian 361005, China., Liu GK; State Key Laboratory of Marine Environmental Science, Fujian Provincial Key Laboratory for Coastal Ecology and Environmental Studies, Center for Marine Environmental Chemistry & Toxicology, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China., Tian ZQ; State Key Laboratory for Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen, Fujian 361005, China.
Source:
Analytical chemistry [Anal Chem] 2022 Jul 19; Vol. 94 (28), pp. 10151-10158. Date of Electronic Publication: 2022 Jul 06.
Publication Type:
Journal Article; Research Support, Non-U.S. Gov't
Language:
English
Journal Info:
Publisher: American Chemical Society Country of Publication: United States NLM ID: 0370536 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1520-6882 (Electronic) Linking ISSN: 00032700 NLM ISO Abbreviation: Anal Chem Subsets: MEDLINE
Imprint Name(s):
Original Publication: Washington, American Chemical Society.
Substance Nomenclature:
0 (Polycyclic Aromatic Hydrocarbons)
Entry Date(s):
Date Created: 20220706 Date Completed: 20220720 Latest Revision: 20220801
Update Code:
20250114
DOI:
10.1021/acs.analchem.2c01450
PMID:
35794045
Database:
MEDLINE

Weitere Informationen

Surface-enhanced Raman spectroscopy (SERS), providing near-single-molecule-level fingerprint information, is a powerful tool for the trace analysis of a target in a complicated matrix and is especially facilitated by the development of modern machine learning algorithms. However, both the high demand of mass data and the low interpretability of the mysterious black-box operation significantly limit the well-trained model to real systems in practical applications. Aiming at these two issues, we constructed a novel machine learning algorithm-based framework (Vis-CAD), integrating visual random forest, characteristic amplifier, and data augmentation. The introduction of data augmentation significantly reduced the requirement of mass data, and the visualization of the random forest clearly presented the captured features, by which one was able to determine the reliability of the algorithm. Taking the trace analysis of individual polycyclic aromatic hydrocarbons in a mixture as an example, a trustworthy accuracy no less than 99% was realized under the optimized condition. The visualization of the algorithm framework distinctly demonstrated that the captured feature was well correlated to the characteristic Raman peaks of each individual. Furthermore, the sensitivity toward the trace individual could be improved by least 1 order of magnitude as compared to that with the naked eye. The proposed algorithm distinguished by the lesser demand of mass data and the visualization of the operation process offers a new way for the indestructible application of machine learning algorithms, which would bring push-to-the-limit sensitivity toward the qualitative and quantitative analysis of trace targets, not only in the field of SERS, but also in the much wider spectroscopy world. It is implemented in the Python programming language and is open-source at https://github.com/3331822w/Vis-CAD.