Treffer: What Are the Differences between Student and ChatGPT-Generated Pseudocode? Detecting AI-Generated Pseudocode in High School Programming Using Explainable Machine Learning

Title:
What Are the Differences between Student and ChatGPT-Generated Pseudocode? Detecting AI-Generated Pseudocode in High School Programming Using Explainable Machine Learning
Language:
English
Source:
Education and Information Technologies. 2025 30(11):14853-14892.
Availability:
Springer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: customerservice@springernature.com; Web site: https://link.springer.com/
Peer Reviewed:
Y
Page Count:
40
Publication Date:
2025
Sponsoring Agency:
National Science Foundation (NSF)
Office of Elementary and Secondary Education (OESE) (ED), Education Innovation and Research (EIR)
Contract Number:
2201394
S411C230070
Document Type:
Fachzeitschrift Journal Articles<br />Reports - Research
Education Level:
High Schools
Secondary Education
DOI:
10.1007/s10639-025-13385-z
ISSN:
1360-2357
1573-7608
Entry Date:
2025
Accession Number:
EJ1478859
Database:
ERIC

Weitere Informationen

The ability of large language models (LLMs) to generate code has raised concerns in computer science education, as students may use tools like ChatGPT for programming assignments. While much research has focused on higher education, especially for languages like Java and Python, little attention has been given to K-12 settings, particularly for pseudocode. This study seeks to bridge this gap by developing explainable machine learning models for detecting pseudocode plagiarism in online programming education. A comprehensive pseudocode dataset was constructed, comprising 7,838 pseudocode submissions from 2,578 high school students enrolled in an online programming foundations course from 2020 to 2023, along with 6,300 pseudocode samples generated by three versions of ChatGPT. An ensemble model (EM) was then proposed to detect AI-generated pseudocode and was compared with six other baseline models. SHapley Additive exPlanations were used to explain how these models differentiate AI-generated pseudocode from student submissions. The results show that students' submissions have higher similarity with GPT-3 than with the other two GPT models. The proposed model can achieve a high accuracy score of 98.97%. The differences between AI-generated pseudocode and student submissions lies in several aspects: AI-generated pseudocode often begins with more complex verbs and features shorter sentence lengths. It frequently includes clear numerical or word-based indicators of sequence and tends to incorporate more comments throughout the code. This research provides practical insights for online programming and contributes to developing educational technologies and methods that strengthen academic integrity in such courses.

As Provided