Support Vector Machine for Predicting Student Dropout Under Different Normalization Methods

dc.contributor.authorBoteju, Gehan
dc.contributor.authorTang, Leon
dc.contributor.authorBrown, Michael Scott
dc.date.accessioned2025-03-11T14:43:04Z
dc.date.available2025-03-11T14:43:04Z
dc.date.issued2025-01-16
dc.description2024 IEEE International Conference on Big Data (BigData), Washington, DC, USA, 15-18 December 2024
dc.description.abstractStudent dropout in universities brings significant challenges that impacts both individual futures and institutional effectiveness. Early prediction of potential dropouts is crucial for timely intervention, but it is complex because of the nature of the problem influenced by diverse socioeconomic factors. This paper utlizies Support Vector Machines (SVMs) to predict student dropout with an emphasis on exploring the efficacy of various data normalization methods to optimize prediction accuracy. Using a dataset from the UC Irvine repository, this study compares 9 different normalization techniques such as Min Max Scaler, Standard Scaler, and Power Transformer, among others, to determine their impact on the predictive performance of SVMs. Results demonstrate substantial variations in model accuracy depending on the normalization method used to show the importance of detailed selection of data preprocessing techniques. The best normalization method was the One Hot Scaler which produced an average F1 score of 0.779. This work enhances the ability to identify at-risk students earlier but also the understanding of how data normalization influences predictive modeling in educational settings.
dc.description.urihttps://ieeexplore.ieee.org/document/10825023
dc.format.extent4 pages
dc.genreconference papers and proceedings
dc.genrepostprints
dc.identifierdoi:10.13016/m2wejx-whog
dc.identifier.citationBoteju, Gehan, Leon Tang, and Michael Scott Brown. "Support Vector Machine for Predicting Student Dropout Under Different Normalization Methods". 2024 IEEE International Conference on Big Data (BigData). December 2024, 8633–36. https://doi.org/10.1109/BigData62323.2024.10825023.
dc.identifier.urihttp://doi.org/10.1109/BigData62323.2024.10825023
dc.identifier.urihttp://hdl.handle.net/11603/37800
dc.language.isoen_US
dc.publisherIEEE
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC History Department
dc.rights© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.subjectVectors
dc.subjectData Pre-processing
dc.subjectData Scaling
dc.subjectData Normalization
dc.subjectAccuracy
dc.subjectPredicting Student Success
dc.subjectPredictive models
dc.subjectData preprocessing
dc.subjectPower transformers
dc.subjectBig Data
dc.subjectSupport Vector Machine
dc.subjectSocioeconomics
dc.subjectSupport vector machines
dc.subjectStandards
dc.subjectData models
dc.titleSupport Vector Machine for Predicting Student Dropout Under Different Normalization Methods
dc.typeText

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SVM_NormalizationMethods_5_Extended_Abstract.pdf
Size:
415.02 KB
Format:
Adobe Portable Document Format