Treffer: Text Mining Applied to Distance Higher Education: A Systematic Literature Review

Title:
Text Mining Applied to Distance Higher Education: A Systematic Literature Review
Language:
English
Authors:
Patrícia Takaki (ORCID 0000-0003-4777-9683), Moisés Lima Dutra
Source:
Education and Information Technologies. 2024 29(9):10851-10878.
Availability:
Springer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: customerservice@springernature.com; Web site: https://link.springer.com/
Peer Reviewed:
Y
Page Count:
28
Publication Date:
2024
Document Type:
Fachzeitschrift Journal Articles<br />Information Analyses
Education Level:
Higher Education
Postsecondary Education
DOI:
10.1007/s10639-023-12235-0
ISSN:
1360-2357
1573-7608
Entry Date:
2024
Accession Number:
EJ1430274
Database:
ERIC

Weitere Informationen

Much of the data produced and consumed by students, teachers, and educational managers is in textual format. Text Mining (TM) and Natural Language Processing (NLP) have been applied in the educational context in different ways. Ideally, such applications combine computational, linguistic, pedagogical, and psychological aspects. This article aims to gather and analyze scientific publications that have applied TM and NLP techniques in textual corpora from distance-higher education through a Systematic Literature Review. Eight scientific databases were searched (ACM DL, Scopus, Web of Science, IEEE Xplore, ArXiv, SpringerLink, ScienceDirect, and ERIC), and publications from 2017 to 2021 were selected. 718 unique publications were screened to identify primary research capable of characterizing this scientific area. 52 resulting publications were fully analyzed, and some consolidated results include: 38% of works had the professors as end users, followed by students (27%) and managers (25%); the English language was present in 50% of publications, followed by the Portuguese language (13,5%) and others languages; the text mining tasks most used were text classification (27%), sentiment analysis (17%), information extraction (15%), chatbot (15%) and topic modeling (13%); LDA (Latent Dirichlet Analysis) was the technique most used (19%); the Python language was the programming language most prevalent (42%), and 54% of works do not mention any educational construct or theory. Thus, this article presents an unprecedented overview of the field of Educational Text Mining (ETM) in distance higher education and analyses the main results obtained, aiming for future research in the area.

As Provided