Title
Balancing sequential data to predict students at-risk using adversarial networks
Abstract
AbstractHighlights •This study presents a novel adversarial-based approach to up-sample the sequential data in an educational setting.•The proposed method generates new student sequences such that the past behavior of each student is encapsulated in its next sequence.•The data from the Open University (UK) is transformed into a sequential format and used as a case study to eliminate the imbalance in students' academic performances.•The proposed approach outperforms the conventional state-of-the-art Random Over-sampling and Synthetic Minority Over-sampling techniques with an improved AUC of 7.07% and 6.53%, respectively. AbstractClass imbalance is a challenging problem especially in a supervised learning setup, as most classification algorithms are designed for balanced class distributions. Although various up-sampling approaches exist for eliminating the class imbalance, however, they do not handle the complexities of sequential data. In this study, using the data of over 30,000 students from the Open University (UK), we implement a deep-learning-based approach using adversarial networks, Sequential Conditional Generative Adversarial Network (SC-GAN) that encapsulates the past behavior of each student for its previous sequences and generates synthetic student records for the next timestamp. The proposed approach is devised to generate instances, which are augmented with the actual data to eliminate class imbalance. A performance comparison of the proposed SC-GAN with the standard up-sampling methods is also presented and the results validate the proposed method with an improved AUC of 7.07% and 6.53%, respectively, when compared with conventional Random Over-sampling and Sythetic Minority Oversampling techniques.Graphical abstractDisplay Omitted
Year
DOI
Venue
2021
10.1016/j.compeleceng.2021.107274
Periodicals
Keywords
DocType
Volume
Students At-Risk, CGAN, Class Imbalance, Sequential Data, Time-Series, Sythetic Minority Oversampling technique
Journal
93
Issue
ISSN
Citations 
C
0045-7906
0
PageRank 
References 
Authors
0.34
0
7
Name
Order
Citations
PageRank
Hajra Waheed140.74
Muhammad Anas Hasnul200.34
Saeed-Ul Hassan300.34
Naif Radi Aljohani415927.35
Salem Alelyani500.34
Ernest Edifor600.34
Raheel Nawaz735.57