Treffer: Predictive Analytics for Student Online Learning Performance Using Machine Learning and Data Mining Techniques

Title:
Predictive Analytics for Student Online Learning Performance Using Machine Learning and Data Mining Techniques
Language:
English
Source:
International Journal on E-Learning. 2024 23(3):269-283.
Availability:
Association for the Advancement of Computing in Education. P.O. Box 719, Waynesville, NC 28786. Tel: 828-246-9558; Fax: 828-246-9557; e-mail: info@aace.org; Web site: http://www.aace.org
Peer Reviewed:
Y
Page Count:
15
Publication Date:
2024
Document Type:
Fachzeitschrift Journal Articles<br />Reports - Research
Education Level:
Higher Education
Postsecondary Education
Geographic Terms:
ISSN:
1537-2456
1943-5932
Entry Date:
2025
Accession Number:
EJ1457663
Database:
ERIC

Weitere Informationen

Machine learning and data mining techniques have been widely used in educational settings to identify the important features that tend to influence students' learning performance and predict their future performance. However, there is little to no research done in the context of Singapore's education. Hence, this study aims to fill the gap by developing an early detection model for weaker students using various machine learning techniques and investigate potential factors that may affect students' learning performance. All the data analysis and model development are done using Python. Firstly, exploratory data analysis is performed to analyse the dataset. Secondly, data pre-processing and feature engineering were performed. Next, three models were developed using decision tree, random forest, and neural network. The performance of the developed models was then evaluated. From our analysis, we found that neural network achieved the highest accuracy of 74.46%. From the confusion matrix, most of the values are within or near the matrix diagonal, indicating that the model is a good fit. Further model improvements could also be done, such as pruning of the decision trees, or the use of ensemble models such as soft voting classifiers to prevent overfitting and improve overall accuracy.

As Provided