Comparative Analysis of Ensemble Methods for Non-Linear Classification: A Data Mining Approach in Education

Autores/as

  • AHMED EL AOUNG Laboratory of Engineering Sciences, Faculty of Sciences, Ibnou Zohr University, Agadir
  • MOURAD AZHARI Center of Guidance and Educational Planning, Rabat, Morocco
  • MOHAMED HACHIMI Laboratory of Engineering Sciences, Faculty of Sciences, Ibnou Zohr University, Agadir, Morocco.

DOI:

https://doi.org/10.5269/bspm.82810

Resumen

Anticipating students orientation is a real challenge that defies simple linar logic. Educational data is apparently noisy , micsellaneous , and challenging to model using classical approaches. In this study, we test the hypothesis that Ensemble Methods offer a far superior path
to convergence when dealing with this complexity compared to traditional single-model classifiers.
Specifically, we pitted Bagging (Random Forest) and Boosting (Gradient Boosting - GBM) algorithms against standard baselines (SVM, Logistic Regression) using a real-world dataset of 798 high school students from the Souss-Massa region (Morocco). Therefore, not only was the objective to increase the level of accuracy, but to analyze how these models stabilize the loss function across 12 socio-academic variables.
The statistical verdict is clear: ensemble architectures handle non-linearity far more effectively. \textbf{Random Forest} secured the top spot with an Accuracy of 76.6% and an AUC of 0.865, significantly outperforming standalone Decision Trees (73.4%). The GBM concurrently showed exceptional robustness with a stable AUC of 0.859. Most importantly, our feature analysis proves that orientation isn't driven by a simple average grade; instead, Mathematics and Languages act as dominant, non-linear filters. These findings implies that for modeling complex human choices, ensemble learning is technically more reliable than linear approximations.

Referencias

Angeioplastis, A., Aliprantis, J., Konstantakis, M., and Tsimpiris, A. Predicting student performance and enhancing learning outcomes: a data-driven approach using educational data mining techniques. Computers, 14(3), 83, (2025).
Bouihi, B., Bousselham, A., Aoula, E., Ennibras, F., and Deraoui, A. Prediction of higher education student dropout based on regularized regression models. Engineering, Technology and Applied Science Research, 14(6), 17811-17815, (2024).
Damopolii, W. W., Priyasadie, N., and Zahra, A. Educational data mining in predicting student final grades. Int. J, 10(1), (2021).
El Mrabet, H., and Ait Moussa, A. A framework for predicting academic orientation using supervised machine learning. Journal of Ambient Intelligence and Humanized Computing, 14(12), 16539-16549, (2023).
Goh, E., Gallo, R. J., Strong, E., Weng, Y., Kerman, H., Freed, J. A., ... and Rodman, A. GPT-4 assistance for improvement of physician performance on patient care tasks: a randomized controlled trial. Nature Medicine, 31(4), 1233-1238, (2025).
Khosravi, A., and Azarnik, A. Leveraging educational data mining: XGBoost and random forest for predicting student achievement. International Journal of Data Science and Advanced Analytics, 6(2), 387-393, (2024).
Messaoudi, N., Naciri, J.K., Bahloul, B. Students' Results Prediction Using Machine Learning Algorithms and Online Learning during the COVID-19 Pandemic. International Journal of Modern Education and Computer Science (IJMECS), 16(4), 17-34, (2024).
Nafea, A. A., Mishlish, M., AL-Ani, M. M., Alheeti, K. M. A., and Mohammed, H. J. Enhancing Student'sPerformance Classification Using Ensemble Modeling. Iraqi Journal For Computer Science and Mathematics, 4(4), 16, (2023).
Pallathadka, H., Wenda, A., Ramirez-Asís, E., Asís-López, M., Flores-Albornoz, J., and Phasinam, K. Classification and prediction of student performance data using various machine learning algorithms. Materials today: proceedings, 80, 3782-3785, (2023).
Åževgin, H. A comparative study of ensemble methods in the field of education: Bagging and Boosting algorithms. International Journal of Assessment Tools in Education, 10(3), 544-562, (2023).
Tang, B., Li, S., and Zhao, C. Predicting the performance of students using deep ensemble learning. Journal of intelligence, 12(12), 124, (2024).

Descargas

Publicado

2026-07-01

Número

Sección

Conf. Issue: Recent Advances in Applied Mathematics, Modeling, and Engineering

Cómo citar

EL AOUNG, A., AZHARI, M., & HACHIMI, M. (2026). Comparative Analysis of Ensemble Methods for Non-Linear Classification: A Data Mining Approach in Education. Boletim Da Sociedade Paranaense De Matemática, 44(18), 1-9. https://doi.org/10.5269/bspm.82810