Support Vector Machine for Predicting Student Dropout Under Different Normalization Methods
Loading...
Links to Files
Author/Creator
Author/Creator ORCID
Date
2025-01-16
Type of Work
Department
Program
Citation of Original Publication
Boteju, Gehan, Leon Tang, and Michael Scott Brown. "Support Vector Machine for Predicting Student Dropout Under Different Normalization Methods". 2024 IEEE International Conference on Big Data (BigData). December 2024, 8633–36. https://doi.org/10.1109/BigData62323.2024.10825023.
Rights
© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Abstract
Student dropout in universities brings significant challenges that impacts both individual futures and institutional effectiveness. Early prediction of potential dropouts is crucial for timely intervention, but it is complex because of the nature of the problem influenced by diverse socioeconomic factors. This paper utlizies Support Vector Machines (SVMs) to predict student dropout with an emphasis on exploring the efficacy of various data normalization methods to optimize prediction accuracy. Using a dataset from the UC Irvine repository, this study compares 9 different normalization techniques such as Min Max Scaler, Standard Scaler, and Power Transformer, among others, to determine their impact on the predictive performance of SVMs. Results demonstrate substantial variations in model accuracy depending on the normalization method used to show the importance of detailed selection of data preprocessing techniques. The best normalization method was the One Hot Scaler which produced an average F1 score of 0.779. This work enhances the ability to identify at-risk students earlier but also the understanding of how data normalization influences predictive modeling in educational settings.