Result: Comparison of Support Vector Machine and Decision Tree Algorithm Performance with Undersampling Approach in Predicting Heart Disease Based on Lifestyle

Title:
Comparison of Support Vector Machine and Decision Tree Algorithm Performance with Undersampling Approach in Predicting Heart Disease Based on Lifestyle
Source:
Journal of Applied Informatics and Computing, Vol 9, Iss 2, Pp 318-327 (2025)
Publisher Information:
Politeknik Negeri Batam, 2025.
Publication Year:
2025
Document Type:
Academic journal Article
ISSN:
2548-6861
DOI:
10.30871/jaic.v9i2.8941
Rights:
CC BY SA
Accession Number:
edsair.doi.dedup.....947cdc19ed62549f0cbbea47e1e117ee
Database:
OpenAIRE

Further Information

Heart disease is one of the leading causes of death in the world with risk factors such as atherosclerosis, high blood pressure, and smoking. Early diagnosis is essential to reduce mortality and improve patients' quality of life. This study evaluates the performance of two machine learning algorithms, namely Support Vector Machine (SVM) and Decision Tree (DT), in predicting heart disease risk by applying undersampling techniques to handle data imbalance. The K-fold cross-validation method with K=10 and hyperparameter tuning were applied to obtain the optimal performance of both models. The results showed that SVM without undersampling achieved 92% accuracy, while with undersampling the accuracy decreased to 76%. DT without undersampling has 91% accuracy, while with undersampling the accuracy reaches 75%. The undersampling technique successfully improved the balance in recognizing minority classes, although it reduced the overall accuracy. This finding confirms that SVM is more reliable in predicting heart disease in datasets with unbalanced class distribution.