This article introduces a model for accurately predicting students' final grades in the CS1 course by utilizing their grades from the first half of the course. The methodology includes three phases: training, testing, and validation, employing four regression algorithms: AdaBoost, Random Forest, Support Vector Regression (SVR), and XGBoost. Notably, the SVR algorithm outperformed the others, achieving an impressive R-squared (R2) value ranging from 72% to 91%. The discussion section focuses on four crucial aspects: the selection of data features and the percentage of course grades used for training, the comparison between predicted and actual values to demonstrate reliability, and the model's performance compared to existing literature models, highlighting its effectiveness.
Keywords: CS1; Course grade; Machine learning; Predicting final grade; Regression model.
©2023 Hidalgo Suarez et al.