Treffer: Role of Optimal Features Selection with Machine Learning Algorithms for Chest X-ray Image Analysis.

Title:
Role of Optimal Features Selection with Machine Learning Algorithms for Chest X-ray Image Analysis.
Authors:
Manav M; Department of Physics, GLA University, Mathura, Uttar Pradesh, India.; Department of Radiotherapy, S N Medical College, Agra, Uttar Pradesh, India., Goyal M; Department of Physics, GLA University, Mathura, Uttar Pradesh, India., Kumar A; Department of Radiotherapy, S N Medical College, Agra, Uttar Pradesh, India.
Source:
Journal of medical physics [J Med Phys] 2023 Apr-Jun; Vol. 48 (2), pp. 195-203. Date of Electronic Publication: 2023 Jun 29.
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: Medknow Publications Country of Publication: India NLM ID: 9441104 Publication Model: Print-Electronic Cited Medium: Print ISSN: 0971-6203 (Print) Linking ISSN: 09716203 NLM ISO Abbreviation: J Med Phys Subsets: PubMed not MEDLINE
Imprint Name(s):
Publication: Mumbai : Medknow Publications
Original Publication: Bombay : The Association, [1994-
References:
Expert Syst Appl. 2021 Nov 30;183:115452. (PMID: 34177133)
AJR Am J Roentgenol. 2020 May;214(5):1078-1082. (PMID: 32108495)
Sci Rep. 2021 May 10;11(1):9887. (PMID: 33972584)
Multimed Tools Appl. 2022;81(10):13731-13750. (PMID: 35221781)
Front Public Health. 2021 Mar 04;9:568603. (PMID: 33748054)
Int J Imaging Syst Technol. 2021 Mar;31(1):5-15. (PMID: 32904960)
J King Saud Univ Comput Inf Sci. 2022 Jun;34(6):3226-3235. (PMID: 38620614)
Sci Rep. 2020 Sep 21;10(1):15364. (PMID: 32958781)
Comput Biol Med. 2021 Dec;139:105002. (PMID: 34749094)
Multimed Tools Appl. 2022;81(19):27631-27655. (PMID: 35368858)
J Med Phys. 2022 Jul-Sep;47(3):279-286. (PMID: 36684704)
JAMA. 2020 Aug 25;324(8):782-793. (PMID: 32648899)
Comput Methods Programs Biomed. 2020 Oct;194:105532. (PMID: 32446037)
Phys Med. 2021 Mar;83:9-24. (PMID: 33662856)
Multimed Tools Appl. 2021;80(13):19931-19946. (PMID: 33686333)
Appl Soft Comput. 2020 Dec;97:106580. (PMID: 32837453)
Clin Transl Med. 2020 Aug;10(4):e158. (PMID: 32898340)
Cognit Comput. 2021 Mar 4;:1-13. (PMID: 33688379)
Expert Syst Appl. 2021 Mar 1;165:113909. (PMID: 32868966)
Comput Biol Med. 2021 May;132:104319. (PMID: 33799220)
Health Inf Sci Syst. 2021 Apr 1;9(1):15. (PMID: 33824721)
Radiology. 2020 Aug;296(2):E113-E114. (PMID: 32105562)
Contributed Indexing:
Keywords: Artificial intelligence; chest X-ray; feature selection; image classification; machine learning
Entry Date(s):
Date Created: 20230814 Latest Revision: 20240923
Update Code:
20250114
PubMed Central ID:
PMC10419742
DOI:
10.4103/jmp.jmp_104_22
PMID:
37576090
Database:
MEDLINE

Weitere Informationen

Introduction: The objective of the present study is to classify chest X-ray (CXR) images into COVID-positive and normal categories with the optimal number of features extracted from the images. The successful optimal feature selection algorithm that can represent images and the classification algorithm with good classification ability has been determined as the result of experiments.
Materials and Methods: This study presented a framework for the automatic detection of COVID-19 from the CXR images. To enhance small details, textures, and contrast of the images, contrast limited adaptive histogram equalization was used. Features were extracted from the first-order statistics, Gray-Level Co-occurrence Matrix, Gray-Level Run Length Matrix, local binary pattern, Law's Texture Energy Measures, Discrete Wavelet Transform, and Zernikes' Moments using an image feature extraction tool "pyFeats. For the feature selection, three nature-inspired optimization algorithms, Grey Wolf Optimization, Particle Swarm Optimization (PSO), and Genetic Algorithm, were used. For classification, Random Forest classifier, K-Nearest Neighbour classifier, support vector machine (SVM) classifier, and light gradient boosting model classifier were used.
Results and Discussion: For all the feature selection methods, the SVM classifier gives the most accurate and precise result compared to other classification models. Furthermore, in feature selection methods, PSO gives the best result as compared to other methods for feature selection. Using the combination of the SVM classifier with the PSO method, it was observed that the accuracy, precision, recall, and F1-score were 100%.
Conclusion: The result of the study indicates that with optimal features with the best choice of the classifier algorithm, the most accurate computer-aided diagnosis of CXR can be achieved. The approach presented in this study with optimal features may be utilized as a complementary tool to assist the radiologist in the early diagnosis of disease and making a more accurate decision.
(Copyright: © 2023 Journal of Medical Physics.)

There are no conflicts of interest. Supplementary Table 1Extracted features from X-raysFeature extracted fromFeatures extracted (features name given here as mentioned in zoofs python library)FOS features“FOS_Mean,” “FOS_Variance,” “FOS_Median,” “FOS_Mode,” “FOS_Skewness,” “FOS_Kurtosis,” “FOS_Energy,” “FOS_Entropy,” “FOS_MinimalGrayLevel,” “FOS_MaximalGrayLevel,” “FOS_CoefficientOfVariation,” “FOS_10Percentile,” “FOS_25Perce-ntile,” “FOS_75Percentile,” “FOS_90Percentile,” “FOS_HistogramWidth”GLCM“GLCM_ASM,” “GLCM_Contrast,” “GLCM_Correlation,” “GLCM_SumOfSquaresVariance,” “GLCM_InverseDifferenceMoment,” “GLCM_SumAverage,” “GLCM_SumVariance,” “GLCM_SumEntropy,” “GLCM_Entropy,” “GLCM_DifferenceVariance,” “GLCM_DifferenceEntropy,” “GLCM_Information1,” “GLCM_Information2,” “GLCM_MaximalCorrelationCoefficient”LTE“LTE_LL_7,” “LTE_EE_7,” “LTE_SS_7,” “LTE_LE_7,” “LTE_ES_7,” “LTE_LS_7”GLRLM“GLRLM_ShortRunEmphasis,” “GLRLM_LongRunEmphasis,” “GLRLM_GrayLevelNo-Uniformity,” “GLRLM_RunLengthNonUniformity”, “GLRLM_RunPercentage”, “GLRLM_LowGrayLevelRunEmphasis”, “GLRLM_HighGrayLevelRunEmphasis,” “GLRLM_Short owGrayLevelEmphasis,” “GLRLM_ShortRunHighGrayLevelEmphasis,” “GLRLM_LongRunLowGrayLevelEmphasis,” “GLRLM_LongRunHighGrayLevelEmphasis”LBP“LBP_R_1_P_8_energy,” “LBP_R_1_P_8_entropy,” “LBP_R_2_P_16_energy,” “LBP_R_2_P_16_entropy,” “LBP_R_3_P_24_energy,” “LBP_R_3_P_24_entropy”DWT“DWT_bior3.3_level_1_da_mean,” “DWT_bior3.3_level_1_da_std,” “DWT_bior3.3_level_1_dd_mean,” “DWT_bior3.3_level_1_dd_std,” “DWT_bior3.3_level_1_ad_mean,” “DWT_bior3.3_level_1_ad_std,” “DWT_bior3.3_level_2_da_mean,” “DWT_bior3.3_level_2_da_std,” “DWT_bior3.3_level_2_dd_mean,” “DWT_bior3.3_level_2_dd_std,” “DWT_bior3.3_level_2_ad_mean,” “DWT_bior3.3_level_2_ad_std,” “DWT_bior3.3_level_3_da_mean,” “DWT_bior3.3_level_3_da_std,” “DWT_bior3.3_level_3_dd_mean,” “DWT_bior3.3_level_3_dd_std,” “DWT_bior3.3_level_3_ad_mean,” “DWT_bior3.3_level_3_ad_std”ZM“zenikes_0,” “zenikes_1,” “zenikes_2,” “zenikes_3,” “zenikes_4,” “zenikes_5,” “zenikes_6,” “zenikes_7,” “zenikes_8,” “zenikes_9,” “zenikes_10,” “zenikes_11,” “zenikes_12,” “zenikes_13,” “zenikes_14,” “zenikes_15,” “zenikes_16,” “zenikes_17,” “zenikes_18,” “zenikes_19,” “zenikes_20,” “zenikes_21,” “zenikes_22,” “zenikes_23,” “zenikes_24”FOS: First-order statistics, GLCM: Gray-level co-occurrence matrix, GLRLM: Gray-level run length matrix, LBP: Local binary pattern, DWT: Discrete wavelet transform, ZM: Zernike’s moments, LTE: Law’s Texture Energy