Result: Comparing Algorithms for Predictive Data Analytics

Title:
Comparing Algorithms for Predictive Data Analytics
Publisher Information:
Universitat Politècnica de Catalunya 2023-06-26
Document Type:
Electronic Resource Electronic Resource
Availability:
Open access content. Open access content
Open Access
Note:
application/pdf
English
Other Numbers:
HGF oai:upcommons.upc.edu:2117/394524
180254
1409474662
Contributing Source:
UNIV POLITECNICA DE CATALUNYA
From OAIster®, provided by the OCLC Cooperative.
Accession Number:
edsoai.on1409474662
Database:
OAIster

Further Information

The master's degree thesis is composed of theoretical and practical parts. The theoretical part describes the basics of predictive data analytics and machine learning algorithms for classification such as Logistic Regression, Decision Tree, Random Forest, SVM, and KNN. We also describe different evaluation metrics such as Recall, Precision, Accuracy, F1 Score, Cohen's Kappa, Hamming Loss, and Jaccard Index that are used to measure the performance of these algorithms. Additionally, we record the time taken for the training and prediction processes to provide insights into algorithm scalability. The key part master's thesis is the practical part that compares these algorithms with a self-implemented tool that shows results for different evaluation metrics on seven datasets. First, we describe the implementation of an application for testing where we measure evaluation metrics scores. We tested these algorithms on all seven datasets using Python libraries such as scikit-learn. Finally, we analyze the results obtained and provide final conclusions.