Model Interpretability in the Educational Data Mining Context: A Systematic Literature Mapping

Authors

DOI:

https://doi.org/10.5753/rbie.2026.6984

Keywords:

Educational Data Mining, Interpretability, Explainability, Explainable AI, Fairness, Systematic Literature Mapping, Systematic Literature Review

Abstract

Machine learning (ML) techniques in the Educational Data Mining (EDM) context enable the development of increasingly efficient prediction models. This evolution benefits from the improvement of ML techniques, the current massive collection of heterogeneous data, and the high availability of computational power. As a result, efficient models are obtained; however, they are complex and challenging to comprehend. Therefore, the field of model interpretability, also known as Explainable Artificial Intelligence (XAI) research, becomes a pivotal piece in consolidating and giving credibility to the solutions developed in the EDM context. In this sense, we presented a systematic mapping to understand how studies in the area of EDM address interpretability, aiming to answer questions related to the (i) interpretability context, (ii) interpretability methods, metrics, and objectives, (iii) education levels, (iv) educational data, and (v) ML techniques. To achieve this goal, we conducted a Systematic Literature Mapping (SLM), which involved defining a research protocol with planning, conducting, and reporting phases. These phases included defining research questions, establishing digital reference libraries, and establishing inclusion and exclusion criteria. The findings indicate that, despite the peculiarities of each study, interpretability is frequently addressed as a post hoc component rather than as a core objective of model design, limiting its systematic evaluation and comparative analysis across models. There is a need for studies in which interpretability is a central objective, including the comparison of interpretability across models from different ML techniques, the exploration or proposal of interpretability metrics (particularly agnostic ones), and the investigation of the relationship between interpretability and algorithmic fairness. Overall, this study offers a comprehensive perspective on the applicability of interpretability methods in various educational contexts, synthesizes best practices and limitations in measuring and comparing model interpretability, and highlights the importance of involving stakeholders in the development of transparent and effective EDM applications.

Downloads

Não há dados estatísticos.

Referências

Abdalkareem, M., & Min-Allah, N. (2024). Explainable models for predicting academic pathways for high school students in saudi arabia. IEEE Access, 12, 30604–30626. https://doi.org/10.1109/ACCESS.2024.3369586 [GS Search].

Adadi, A., & Berrada, M. (2018). Peeking inside the black-box: A survey on explainable artificial intelligence (XAI). IEEE Access, 6, 52138–52160. https://doi.org/10.1109/ACCESS.2018.2870052 [GS Search].

Alamri, R., & Alharbi, B. (2021). Explainable student performance prediction models: A systematic review. IEEE Access, 9, 33132–33143. https://doi.org/10.1109/ACCESS.2021.3061368 [GS Search].

Alharbi, B. (2022). Back to basics: An interpretable multi-class grade prediction framework. Arabian Journal for Science and Engineering, 47(2), 2171–2186. https://doi.org/10.1007/s13369-021-06153-x [GS Search].

Al-Jallad, N. T., Ning, X., Khairalla, M. A., & Al-Qaness, M. A. (2019). Rule mining models for predicting dropout/ stopout and switcher at college using satisfaction and ses features. International Journal of Management in Education, 13(2), 97–118. https://doi.org/10.1504/IJMIE.2019.098182 [GS Search].

Alonso, J. M., & Casalino, G. (2019). Explainable artificial intelligence for human-centric data analysis in virtual learning environments. Communications in Computer and Information Science, 1091, 125–138. https://doi.org/10.1007/978-3-030-31284-8_10 [GS Search].

Alvarez-Garcia, M., Arenas-Parra, M., & Ibar-Alonso, R. (2024). Uncovering student profiles. an explainable cluster analysis approach to PISA 2022. Computers & Education, 223, 105166. https://doi.org/10.1016/j.compedu.2024.105166 [GS Search].

Alwarthan, S., Aslam, N., & Khan, I. U. (2022). An explainable model for identifying at-risk student at higher education. IEEE Access, 10, 107649–107668. https://doi.org/10.1109/ACCESS.2022.3211070 [GS Search].

Araujo, I. (2021). Uma revisão sobre o uso de frameworks de interpretabilidade em aprendizado de máquina. Anais do XIV Encontro Unificado de Computação do Piauí e XI Simpósio de Sistemas de Informação, 105–112. https://doi.org/10.5753/enucompi.2021.17760 [GS Search].

Arévalo-Cordovilla, F. E., & Peña, M. (2024). Comparative analysis of machine learning models for predicting student success in online programming courses: A study based on lms data and external factors. Mathematics, 12(20), 3272. https://doi.org/10.3390/math12203272 [GS Search].

Armstrong, P. (2010). Bloom's taxonomy. Vanderbilt University Center for Teaching, 12(05), 2023. [GS Search].

Aytekin, M. C., & Saygın, Y. (2025). Discovering prerequisite relations using large language models. Interactive Learning Environments, 33(2), 1670–1688. https://doi.org/10.1080/10494820.2024.2375338 [GS Search].

Baker, R. S. J. d. (2010). Data mining. In P. Peterson, E. Baker, & B. McGaw (Eds.), International encyclopedia of education (third edition) (Third Edition, pp. 112–118). Elsevier. https://doi.org/https://doi.org/10.1016/B978-0-08-044894-7.01318-X [GS Search].

Baker, R. S., & Yacef, K. (2009). The state of educational data mining in 2009: A review and future visions. Journal of Educational Data Mining, 1(1), 3–17. https://doi.org/10.5281/zenodo.3554657 [GS Search].

Bakhshinategh, B., Zaiane, O. R., ElAtia, S., & Ipperciel, D. (2018). Educational data mining applications and tasks: A survey of the last 10 years. Education and Information Technologies, 23, 537–553. https://doi.org/10.1007/s10639-017-9616-z [GS Search].

Burkart, N., & Huber, M. F. (2021). A survey on the explainability of supervised machine learning. Journal of Artificial Intelligence Research, 70, 245–317. https://doi.org/10.1613/jair.1.12228 [GS Search].

Carvalho, C., Mattos, J., & Aguiar, M. (2023). Avaliação da interpretabilidade de modelos por meio da clusterização de explicações no contexto da predição de evasão no ensino superior. Anais do XXXIV Simpósio Brasileiro de Informática na Educação, 1191–1201. https://doi.org/10.5753/sbie.2023.234435 [GS Search].

Carvalho, D. V., Pereira, E. M., & Cardoso, J. S. (2019). Machine learning interpretability: A survey on methods and metrics. Electronics, 8(8). https://doi.org/10.3390/electronics8080832 [GS Search].

Cavus, M., & Kuzilek, J. (2024). An effect analysis of the balancing techniques on the counterfactual explanations of student success prediction models. Journal of Measurement and Evaluation in Education and Psychology, 15, 302–317. https://doi.org/10.21031/epod.1526704 [GS Search].

Chau, V. T. N., & Phung, N. H. (2021). A cumulative increasing kemelized nearest-neighbor bagging method for early course-level study performance prediction. 2021 7th International Conference on Engineering, Applied Sciences and Technology (ICEAST), 91–96. https://doi.org/10.1109/iceast52143.2021.9426259 [GS Search].

Chen, F., & Cui, Y. (2020). LogCF: Deep collaborative filtering with process data for enhanced learning outcome modeling. Journal of Educational Data Mining, 12(4), 66–99. https://doi.org/10.5281/zenodo.4399685 [GS Search].

Chen, S., Pian, Y., & Zheng, Y. (2023). Challenges and strategies for designing more effective educational data mining applications. 2023 Twelfth International Conference of Educational Innovation through Technology (EITT), 175–179. https://doi.org/10.1109/EITT61659.2023.00040 [GS Search].

Choi, W. C., Lam, C.-T., & Mendes, A. J. (2024). Analyzing the interpretability of machine learning prediction on student performance using shapley additive explanations. 2024 IEEE International Conference on Teaching, Assessment and Learning for Engineering (TALE), 1–8. https://doi.org/10.1109/TALE62452.2024.10834292 [GS Search].

Chou, T.-N. (2021). Apply explainable AI to sustain the assessment of learning effectiveness. IMCIC 2021 - 12th International Multi-Conference on Complexity, Informatics and Cybernetics, Proceedings, 2, 113–118. [GS Search].

Chou, T.-N. (2023). Apply an integrated responsible AI framework to sustain the assessment of learning effectiveness. International Conference on Computer Supported Education, CSEDU - Proceedings, 2, 142–149. https://doi.org/10.5220/0012058400003470 [GS Search].

Cohausz, L. (2022). Towards real interpretability of student success prediction combining methods of XAI and social science. Proceedings of the 15th International Conference on Educational Data Mining, 361–367. https://doi.org/10.5281/zenodo.6853069 [GS Search].

Colak Oz, H., Güven, Ç., & Nápoles, G. (2023). School dropout prediction and feature importance exploration in Malawi using household panel data: Machine learning approach. Journal of Computational Social Science, 6(1), 245–287. https://doi.org/10.1007/s42001-022-00195-3 [GS Search].

Colpo, M. P., Primo, T. T., & Aguiar, M. S. (2024). Lessons learned from the student dropout patterns on covid-19 pandemic: An analysis supported by machine learning. British Journal of Educational Technology, 55(2), 560–585. https://doi.org/10.1111/bjet.13380 [GS Search].

Deck, L., Schoeffer, J., De-Arteaga, M., & Kühl, N. (2024). A critical survey on fairness benefits of explainable AI. Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 1579–1595. https://doi.org/10.1145/3630106.3658990 [GS Search].

Dermeval, D., Coelho, J., & Bittencourt, I. I. (2020). Mapeamento sistemático e revisão sistemática da literatura em informática na educação. In P. A. Jaques, S. Siqueira, I. Bittencourt, & M. Pimentel (Eds.), Metodologia de pesquisa científica em informática na educação: Abordagem quantitativa (Vol. 2). SBC. [Link] [GS Search].

Gama Neto, M., Vasconcelos, G., & Zanchettin, C. (2021). Mineração de dados aplicada à predição do desempenho de escolas e técnicas de interpretabilidade dos modelos. Anais do XXXII Simpósio Brasileiro de Informática na Educação, 773–782. https://doi.org/10.5753/sbie.2021.217421 [GS Search].

Gámez-Granados, J. C., Esteban, A., Rodriguez-Lozano, F. J., & Zafra, A. (2023). An algorithm based on fuzzy ordinal classification to predict students' academic performance. Applied Intelligence, 53(22), 27537–27559. https://doi.org/10.1007/s10489-023-04810-2 [GS Search].

Garcia, S., Fernandez, A., Luengo, J., & Herrera, F. (2009). A study of statistical techniques and performance measures for genetics-based machine learning: Accuracy and interpretability. Soft Computing, 13(10), 959–977. https://doi.org/10.1007/s00500-008-0392-y [GS Search].

Garson, G. D. (1991). Interpreting neural-network connection weights. AI Expert, 6(4), 46–51. [GS Search].

Geng, J., Yang, H., & Hu, S. (2023). Noise-filtering enhanced deep cognitive diagnosis model for latent skill discovering. Intelligent Automation and Soft Computing, 37(2), 1311–1324. https://doi.org/10.32604/iasc.2023.038481 [GS Search].

Ghimire, S., Abdulla, S., Joseph, L. P., Prasad, S., Murphy, A., Devi, A., Barua, P. D., Deo, R. C., Acharya, R., & Yaseen, Z. M. (2024). Explainable artificial intelligence-machine learning models to estimate overall scores in tertiary preparatory general science course. Computers and Education: Artificial Intelligence, 7. https://doi.org/10.1016/j.caeai.2024.100331 [GS Search].

Guleria, P., & Sood, M. (2023). Explainable AI and machine learning: Performance evaluation and explainability of classifiers on educational data mining inspired career counseling. Education and Information Technologies, 28(1), 1081–1116. https://doi.org/10.1007/s10639-022-11221-2 [GS Search].

Gupta, A., Garg, D., & Kumar, P. (2022). Mining sequential learning trajectories with hidden markov models for early prediction of at-risk students in e-learning environments. IEEE Transactions on Learning Technologies, 15(6), 783–797. https://doi.org/10.1109/TLT.2022.3197486 [GS Search].

Gusmão, R., Gusmão, C., & Dias, M. (2021). A qualidade da educação para além do IDEB: Um estudo através de técnicas de mineração de dados. Anais do XXXII Simpósio Brasileiro de Informática na Educação, 803–812. https://doi.org/10.5753/sbie.2021.218177 [GS Search].

Hegazi, M. O., & Abugroon, M. A. (2016). The state of the art on educational data mining in higher education. International Journal of Computer Trends and Technology, 31(1), 46–56. [GS Search].

Henseler, J., Hubona, G., & Ray, P. A. (2016). Using PLS path modeling in new technology research: Updated guidelines. Industrial Management & Data Systems, 116(1), 2–20. https://doi.org/10.1108/IMDS-09-2015-0382 [GS Search].

Hooper, S. E., Hecker, K. G., & Artemiou, E. (2023). Using machine learning in veterinary medical education: An introduction for veterinary medicine educators. Veterinary Sciences, 10(9). https://doi.org/10.3390/vetsci10090537 [GS Search].

Hooshyar, D., Huang, Y.-M., & Yang, Y. (2022). A three-layered student learning model for prediction of failure risk in online learning. Human-centric Computing and Information Sciences, 12. https://doi.org/10.22967/HCIS.2022.12.028 [GS Search].

Huang, T., Geng, J., Yang, H., Hu, S., Ou, X., Hu, J., & Yang, Z. (2024). Interpretable neuro-cognitive diagnostic approach incorporating multidimensional features. Knowledge-Based Systems, 304. https://doi.org/10.1016/j.knosys.2024.112432 [GS Search].

Hwang, H., & Takane, Y. (2004). Generalized structured component analysis. Psychometrika, 69(1), 81–99. https://doi.org/10.1007/BF02295841 [GS Search].

Islam, M. R., Nitu, A. M., Marjan, M. A., Uddin, M. P., Afjal, M. I., & Al Mamun, M. A. (2024). Enhancing tertiary students' programming skills with an explainable educational data mining approach. PLoS ONE, 19(9 September). https://doi.org/10.1371/journal.pone.0307536 [GS Search].

Jang, Y., Choi, S., Jung, H., & Kim, H. (2022). Practical early prediction of students' performance using machine learning and eXplainable AI. Education and Information Technologies, 27(9), 12855–12889. https://doi.org/10.1007/s10639-022-11120-6 [GS Search].

Jeon, B., Shafran, E., Breitfeller, L., Levin, J., & Rosé, C. P. (2019). Time-series insights into the process of passing or failing online university courses using neural-induced interpretable student states. Proceedings of the 12th International Conference on Educational Data Mining, 330–335. https://doi.org/10.48550/arXiv.1905.00422 [GS Search].

Joshi, A., Saggar, P., Jain, R., Sharma, M., Gupta, D., & Khanna, A. (2021). CatBoost – an ensemble machine learning model for prediction and classification of student academic performance. Advances in Data Science and Adaptive Analysis, 13(03N04). https://doi.org/10.1142/S2424922X21410023 [GS Search].

Khosravi, H., Shum, S. B., Chen, G., Conati, C., Tsai, Y.-S., Kay, J., Knight, S., Martinez-Maldonado, R., Sadiq, S., & Gaševic, D. (2022). Explainable artificial intelligence in education. Computers and Education: Artificial Intelligence, 3, 100074. https://doi.org/10.1016/j.caeai.2022.100074 [GS Search].

Kitchenham, B. A., & Charters, S. (2007, July). Guidelines for performing systematic literature reviews in software engineering (tech. rep. No. EBSE 2007-001). Keele University and Durham University Joint Report. [Link] [GS Search].

Krathwohl, D. R. (2002). A revision of Bloom's taxonomy: An overview. Theory into practice, 41(4), 212–218. https://doi.org/10.1207/s15430421tip4104_2 [GS Search].

Kumar, P., & Sharma, M. (2020). Predicting academic performance of international students using machine learning techniques and human interpretable explanations using lime—case study of an indian university. Advances in Intelligent Systems and Computing, 1087, 289–303. https://doi.org/10.1007/978-981-15-1286-5_25 [GS Search].

Lallé, S., Yalçın, Ö. N., & Conati, C. (2021). Combining data-driven models and expert knowledge for personalized support to foster computational thinking skills. LAK21: 11th International Learning Analytics and Knowledge Conference, 375–385. https://doi.org/10.1145/3448139.3448175 [GS Search].

Langer, M., Oster, D., Speith, T., Hermanns, H., Kästner, L., Schmidt, E., Sesing, A., & Baum, K. (2021). What do we want from explainable artificial intelligence (XAI)? – a stakeholder perspective on XAI and a conceptual model guiding interdisciplinary XAI research. Artificial Intelligence, 296, 103473. https://doi.org/10.1016/j.artint.2021.103473 [GS Search].

Lee, Y. (2023). Identifying prerequisite courses in undergraduate biology using machine learning. Journal of Data Science, 21(4), 745–760. https://doi.org/10.6339/22-JDS1068 [GS Search].

Lek, S., Delacoste, M., Baran, P., Dimopoulos, I., Lauga, J., & Aulagnier, S. (1996). Application of neural networks to modelling nonlinear relationships in ecology. Ecological modelling, 90(1), 39–52. [GS Search].

Lemay, D. J., & Doleck, T. (2020). Grade prediction of weekly assignments in MOOCS: Mining video-viewing behavior. Education and Information Technologies, 25(2), 1333–1342. https://doi.org/10.1007/s10639-019-10022-4 [GS Search].

Lemay, D. J., & Doleck, T. (2022). Predicting completion of massive open online course (MOOC) assignments from video viewing behavior. Interactive Learning Environments, 30(10), 1782–1793. https://doi.org/10.1080/10494820.2020.1746673 [GS Search].

Linardatos, P., Papastefanopoulos, V., & Kotsiantis, S. (2021). Explainable AI: A review of machine learning interpretability methods. Entropy, 23(1). https://doi.org/10.3390/e23010018 [GS Search].

Liu, F., Bu, C., Zhang, H., Wu, L., Yu, K., & Hu, X. (2024). FDKT: Towards an interpretable deep knowledge tracing via fuzzy reasoning. ACM Transactions on Information Systems, 42(5). https://doi.org/10.1145/3656167 [GS Search].

Liu, F., Zhang, H., & Huang, W. (2023). Interpreting learner success: Enhancing knowledge tracing with attention-based IRT models in modern education. Proceedings of the 13th International Conference on Information Technology in Medicine and Education, ITME 2023, 649–653. https://doi.org/10.1109/ITME60234.2023.00135 [GS Search].

Liu, Q., Wu, R., Chen, E., Xu, G., Su, Y., Chen, Z., & Hu, G. (2018). Fuzzy cognitive diagnosis for modelling examinee performance. ACM Transactions on Intelligent Systems and Technology, 9(4). https://doi.org/10.1145/3168361 [GS Search].

Liu, Q., & Khalil, M. (2024). Explainable AI in learning analytics: Improving predictive models and advancing transparency trust. 2024 IEEE Global Engineering Education Conference (EDUCON), 1–7. https://doi.org/10.1109/EDUCON60312.2024.10578733 [GS Search].

Liu, X., Zambrano, A. F., Baker, R. S., Barany, A., Ocumpaugh, J., Zhang, J., Pankiewicz, M., Nasiar, N., & Wei, Z. (2025). Qualitative coding with GPT-4: Where it works better. Journal of Learning Analytics, 12(1), 169–185. https://doi.org/10.18608/jla.2025.8575 [GS Search].

Livieris, I. E., Karacapilidis, N., Domalis, G., & Tsakalidis, D. (2023). An advanced explainable and interpretable ML-based framework for educational data mining. Lecture Notes in Networks and Systems, 769 LNNS, 87–96. https://doi.org/10.1007/978-3-031-42134-1_9 [GS Search].

Lu, Y., Tong, L., & Cheng, Y. (2024). Advanced knowledge tracing: Incorporating process data and curricula information via an attention-based framework for accuracy and interpretability. Journal of Educational Data Mining, 16(2), 58–84. https://doi.org/10.5281/zenodo.13712553 [GS Search].

Lu, Y., Wang, D., Chen, P., Meng, Q., & Yu, S. (2022). Interpreting deep learning models for knowledge tracing. International Journal of Artificial Intelligence in Education. https://doi.org/10.1007/s40593-022-00297-z [GS Search].

Luo, Y., & Wang, Z. (2024). Feature mining algorithm for student academic prediction based on interpretable deep neural network. 12th International Conference on Information and Education Technology, ICIET 2024, 1–5. https://doi.org/10.1109/ICIET60671.2024.10542709 [GS Search].

Mandalapu, V., Elluri, L., Vyas, P., & Roy, N. (2023). Crime prediction using machine learning and deep learning: A systematic review and future directions. IEEE Access, 11, 60153–60170. https://doi.org/10.1109/ACCESS.2023.3286344 [GS Search].

Matetic, M. (2019). Mining learning management system data using interpretable neural networks. 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), 1282–1287. https://doi.org/10.23919/MIPRO.2019.8757113 [GS Search].

Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological review, 63(2), 81. https://doi.org/10.1037/h0043158 [GS Search].

Molnar, C. (2022). Interpretable machine learning: A guide for making black box models explainable (2nd ed.). [Link] [GS Search].

Montavon, G., Lapuschkin, S., Binder, A., Samek, W., & Müller, K.-R. (2017). Explaining nonlinear classification decisions with deep Taylor decomposition. Pattern Recognition, 65, 211–222. https://doi.org/10.1016/j.patcog.2016.11.008 [GS Search].

Nakagawa, H., Iwasawa, Y., & Matsuo, Y. (2021). Graph-based knowledge tracing: Modeling student proficiency using graph neural networks. Web Intelligence, 19(1-2), 87–102. https://doi.org/10.3233/WEB-210458 [GS Search].

Nauck, D. D. (2003). Measuring interpretability in rule-based classification systems. The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ'03., 1, 196–201. https://doi.org/10.1109/FUZZ.2003.1209361 [GS Search].

Nnadi, L. C., Watanobe, Y., Rahman, M. M., & John-Otumu, A. M. (2024). Prediction of students' adaptability using explainable AI in educational machine learning models. Applied Sciences, 14(12), 5141. https://doi.org/10.3390/app14125141 [GS Search].

Novillo Rangone, G., Pizarro, C., & Montejano, G. (2022). Automation of an educational data mining model applying interpretable machine learning and auto machine learning. In Á. Rocha, D. Barredo, P. C. López-López, & I. Puentes-Rivera (Eds.), Communication and smart technologies (pp. 22–30). Springer Singapore. https://doi.org/10.1007/978-981-16-5792-4_3 [GS Search].

Ozbayoglu, A. M., Gudelek, M. U., & Sezer, O. B. (2020). Deep learning for financial applications: A survey. Applied Soft Computing, 93, 106384. https://doi.org/10.1016/j.asoc.2020.106384 [GS Search].

Pandian, B. R., Aziz, A. A., Subramaniam, H., & Nawi, H. S. A. (2024). Exploring the role of machine learning in forecasting student performance in education: An in-depth review of literature. Multidisciplinary Reviews, 6, 2023ss043. https://doi.org/10.31893/multirev.2023ss043 [GS Search].

Pantazatos, D., Trilivas, A., Meli, K., Kotsifakos, D., & Douligeris, C. (2024). Machine learning and explainable artificial intelligence in education and training – status and trends. In L. A. Maglaras & C. Douligeris (Eds.), Wireless internet (pp. 110–122). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-58053-6_8 [GS Search].

Parkavi, R., Karthikeyan, P., & Sheik Abdullah, A. (2024). Enhancing personalized learning with explainable AI: A chaotic particle swarm optimization based decision support system. Applied Soft Computing, 156. https://doi.org/10.1016/j.asoc.2024.111451 [GS Search].

Pei, B., & Xing, W. (2022). An interpretable pipeline for identifying at-risk students. Journal of Educational Computing Research, 60(2), 380–405. https://doi.org/10.1177/07356331211038168 [GS Search].

Peña-Ayala, A. (2014). Educational data mining: Applications and trends (Vol. 524). Springer International Publishing. https://doi.org/10.1007/978-3-319-02738-8 [GS Search].

Pu, Y., Wu, W., Peng, T., Liu, F., Liang, Y., Yu, X., Chen, R., & Feng, P. (2022). Embedding cognitive framework with self-attention for interpretable knowledge tracing. Scientific Reports, 12(1). https://doi.org/10.1038/s41598-022-22539-9 [GS Search].

Qu, Y., Li, F., Li, L., Dou, X., & Wang, H. (2022). Can we predict student performance based on tabular and textual data? IEEE Access, 10, 86008–86019. https://doi.org/10.1109/ACCESS.2022.3198682 [GS Search].

Queiroga, E. M., Santana, D., Silva, M., Aguiar, M., Santos, V., Mello, R. F., Bittencourt, I. I., & Cechinel, C. (2024). Anticipating student abandonment and failure: Predictive models in high school settings. In A. M. Olney, I.-A. Chounta, Z. Liu, O. C. Santos, & I. I. Bittencourt (Eds.), Artificial intelligence in education (pp. 351–364). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-64302-6_25 [GS Search].

Rachha, A., & Seyam, M. (2023). Explainable AI in education: Current trends, challenges, and opportunities. SoutheastCon 2023, 232–239. https://doi.org/10.1109/SoutheastCon51012.2023.10115140 [GS Search].

Raji, N., Kumar, R. M. S., & Biji, C. (2023). Closing the gap: Exploring the untapped potential of machine learning in deaf students and hearing students' academic performance. International Journal of Advanced Technology and Engineering Exploration, 10(108), 1449–1475. https://doi.org/10.19101/IJATEE.2023.10101685 [GS Search].

Rangone, G. N., Montejano, G. A., Garis, A. G., Pizarro, C. A., & Molina, W. R. (2022). An educational data mining model based on auto machine learning and interpretable machine learning. 2022 IEEE Global Conference on Computing, Power and Communication Technologies (GlobConPT), 1–6. https://doi.org/10.1109/GlobConPT57482.2022.9938243 [GS Search].

Romero, C., & Ventura, S. (2013). Data mining in education. WIREs Data Mining and Knowledge Discovery, 3(1), 12–27. https://doi.org/10.1002/widm.1075 [GS Search].

Romero, C., & Ventura, S. (2020). Educational data mining and learning analytics: An updated survey. WIREs Data Mining and Knowledge Discovery, 10(3), e1355. https://doi.org/10.1002/widm.1355 [GS Search].

Romero, C., & Ventura, S. (2010). Educational data mining: A review of the state of the art. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 40(6), 601–618. https://doi.org/10.1109/TSMCC.2010.2053532 [GS Search].

ŞAHiN, E., Arslan, N. N., & Özdemir, D. (2025). Unlocking the black box: An in-depth review on interpretability, explainability, and reliability in deep learning. Neural Computing and Applications, 37(2), 859–965. https://doi.org/10.1007/s00521-024-10437-2 [GS Search].

Samek, W., Montavon, G., Vedaldi, A., Hansen, L. K., & Müller, K.-R. (2019). Explainable AI: Interpreting, explaining and visualizing deep learning (Vol. 11700). Springer Nature. https://doi.org/10.1007/978-3-030-28954-6 [GS Search].

Schuch, H. S., Furtado, M., Silva, G. F. S., Kawachi, I., Chiavegatto Filho, A. D. P., & Elani, H. W. (2023). Fairness of machine learning algorithms for predicting foregone preventive dental care for adults. JAMA Network Open, 6(11), e2341625–e2341625. https://doi.org/10.1001/jamanetworkopen.2023.41625 [GS Search].

Shen, S., Chen, E., Liu, Q., Huang, Z., Huang, W., Yin, Y., Su, Y., & Wang, S. (2023). Monitoring student progress for learning process-consistent knowledge tracing. IEEE Transactions on Knowledge and Data Engineering, 35(8), 8213–8227. https://doi.org/10.1109/TKDE.2022.3221985 [GS Search].

Silva Filho, R. L. L., Motejunas, P. R., Hipólito, O., & Lobo, M. B. C. M. (2007). A evasão no ensino superior brasileiro. Cadernos de Pesquisa, 37(132), 641–659. https://doi.org/10.1590/S0100-15742007000300007 [GS Search].

Silva Filho, R. L. C., Brito, K., & Adeodato, P. J. L. (2023). A data mining framework for reporting trends in the predictive contribution of factors related to educational achievement. Expert Systems with Applications, 221. https://doi.org/10.1016/j.eswa.2023.119729 [GS Search].

Singelmann, L., Alvarez, E., Swartz, E., Striker, R., Pearson, M., & Ewert, D. (2020). Predicting and understanding success in an innovation-based learning course. Proceedings of the 13th International Conference on Educational Data Mining, EDM 2020, 662–666. [GS Search].

Suaza-Medina, M., Peñabaena-Niebles, R., & Jubiz-Diaz, M. (2024). A model for predicting academic performance on standardised tests for lagging regions based on machine learning and shapley additive explanations. Scientific Reports, 14(1), 25306. https://doi.org/10.1038/s41598-024-76596-3 [GS Search].

Thuy, A., & Benoit, D. F. (2024). Explainability through uncertainty: Trustworthy decision-making with neural networks. European Journal of Operational Research, 317(2), 330–340. https://doi.org/10.1016/j.ejor.2023.09.009 [GS Search].

Tousside, B., Dama, Y., & Frochte, J. (2022). Towards explainability in modern educational data mining: A survey. Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022) - KDIR, 212–220. https://doi.org/10.5220/0011529400003335 [GS Search].

Tsiakmaki, M., & Ragos, O. (2021). A case study of interpretable counterfactual explanations for the task of predicting student academic performance. 2021 25th International Conference on Circuits, Systems, Communications and Computers (CSCC), 120–125. https://doi.org/10.1109/CSCC53858.2021.00029 [GS Search].

Van Petegem, C., Deconinck, L., Mourisse, D., Maertens, R., Strijbol, N., Dhoedt, B., De Wever, B., Dawyndt, P., & Mesuere, B. (2023). Pass/fail prediction in programming courses. Journal of Educational Computing Research, 61(1), 68–95. https://doi.org/10.1177/07356331221085595 [GS Search].

Vieira, C. P., & Digiampietri, L. A. (2022). Machine learning post-hoc interpretability: A systematic mapping study. Proceedings of the XVIII Brazilian Symposium on Information Systems. https://doi.org/10.1145/3535511.3535512 [GS Search].

Vultureanu-Albişi, A., & Badică, C. (2021). Improving students' performance by interpretable explanations using ensemble tree-based approaches. 2021 IEEE 15th International Symposium on Applied Computational Intelligence and Informatics (SACI), 215–220. https://doi.org/10.1109/SACI51354.2021.9465558 [GS Search].

Wang, J. Z., Lan, A. S., Grimaldi, P. J., & Baraniuk, R. G. (2017). A latent factor model for instructor content preference analysis. Proceedings of the 10th International Conference on Educational Data Mining, EDM 2017, 290–295. [GS Search].

Wang, Y., Zhang, Y., Liang, M., Yuan, R., Feng, J., & Wu, J. (2023). National student loans default risk prediction: A heterogeneous ensemble learning approach and the shap method. Computers and Education: Artificial Intelligence, 5. https://doi.org/10.1016/j.caeai.2023.100166 [GS Search].

Xiao, W., Ji, P., & Hu, J. (2022). A survey on educational data mining methods used for predicting students' performance. Engineering Reports, 4(5), e12482. https://doi.org/10.1002/eng2.12482 [GS Search].

Zanellati, A., Zingaro, S. P., & Gabbrielli, M. (2024). Balancing performance and explainability in academic dropout prediction. IEEE Transactions on Learning Technologies, 17, 2140–2153. https://doi.org/10.1109/TLT.2024.3425959 [GS Search].

Zhang, S., Wang, J., Yu, S., Wang, R., Han, J., Zhao, S., Liu, T., & Lv, J. (2023). An explainable deep learning framework for characterizing and interpreting human brain states. Medical Image Analysis, 83, 102665. https://doi.org/10.1016/j.media.2022.102665 [GS Search].

Arquivos adicionais

Published

2026-03-26

Como Citar

CARVALHO, C. S.; MATTOS, J. C. B. de; AGUIAR, M. S. de. Model Interpretability in the Educational Data Mining Context: A Systematic Literature Mapping. Revista Brasileira de Informática na Educação, [S. l.], v. 34, p. 314–355, 2026. DOI: 10.5753/rbie.2026.6984. Disponível em: https://journals-sol.sbc.org.br/index.php/rbie/article/view/6984. Acesso em: 1 abr. 2026.

Issue

Section

Artigos