Evaluating Data Drift Detection and Its Effects on Machine Learning System Performance

Lucas Helfstein Rocha; Kelly Braghetto

doi:10.5753/jidm.2026.5738

Authors

Lucas Helfstein Rocha Universidade de São Paulo https://orcid.org/0009-0002-0559-4453
Kelly Braghetto IME-USP https://orcid.org/0000-0001-6218-6849

DOI:

https://doi.org/10.5753/jidm.2026.5738

Keywords:

Machine Learning Systems, Data Drift Detection

Abstract

Software systems incorporating machine learning (ML) components are being increasingly deployed across various domains. Unlike traditional systems, ML systems are highly dependent on the quality of their input data, making their performance susceptible to changes in that data. This work explores the potential for improving ML systems by actively monitoring data flow and retraining models in response to drift detection. We begin by evaluating several widely used statistical and distance-based methods for detecting data drift, highlighting their advantages and limitations. Subsequently, we present experimental results using these methods on datasets exhibiting concept drift from the literature, as well as synthetic datasets with data drift. Our findings demonstrate how these techniques can enhance the robustness of ML systems, offering automatic adaptation regardless of the type of drift encountered.

Downloads

Download data is not yet available.

References

Bland, J. M. and Altman, D. G. (1995). Multiple significance tests: the Bonferroni method. Bmj, 310(6973):170.

Bock, R. (2007). MAGIC Gamma Telescope. DOI: 10.24432/C52C8B.

Dasu, T., Krishnan, S., Venkatasubramanian, S., and Yi, K. (2006). An information-theoretic approach to detecting changes in multi-dimensional data streams. In Symposium on the Interface of Statistics, Computing Science, and Applications (Interface).

Ditzler, G. and Polikar, R. (2011). Hellinger distance based drift detection for nonstationary environments. In 2011 IEEE symposium on computational intelligence in dynamic and uncertain environments (CIDUE), pages 41-48.

Gama, J. and Castillo, G. (2006). Learning with local drift detection. In Advanced Data Mining and Applications: Second International Conference, ADMA 2006, Xi'an, China, August 14-16, 2006 Proceedings 2, pages 42-55.

Guyon, I. (2003). Design of experiments of the nips 2003 variable selection benchmark. In NIPS 2003 workshop on feature extraction and feature selection, volume 253, page 40.

Harries, M. (1999). Splice-2 comparative evaluation: electricity pricing. Technical report, The University of New South Wales, Sydney.

Helfstein, L. and Braghetto, K. R. (2024). An empirical analysis of data drift detection techniques in machine learning systems. In Simposio Brasileiro de Banco de Dados (SBBD), pages 40-52. SBC.

Hodges Jr, J. (1958). The significance probability of the Smirnov two-sample test. Arkiv for matematik, 3(5):469-486.

Lu, J., Liu, A., Dong, F., Gu, F., Gama, J., and Zhang, G. (2018). Learning under concept drift: A review. IEEE Transactions on Knowledge and Data Engineering, 31(12):2346-2363.

Perez-Cruz, F. (2008). Kullback-Leibler divergence estimation of continuous distributions. In 2008 IEEE international symposium on information theory, pages 1666-1670.

Rabanser, S., Gunnemann, S., and Lipton, Z. (2019). Failing loudly: An empirical study of methods for detecting dataset shift. Advances in Neural Information Processing Systems, 32.

Schlimmer, J. C. and Granger, R. H. (1986). Incremental learning from noisy data. Machine learning, 1:317-354.

Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., Chaudhary, V., Young, M., Crespo, J. F., and Dennison, D. (2015). Hidden technical debt in machine learning systems. Advances in Neural Information Processing Systems, 2015-Janua:2503-2511.

Souza, V. M. A., Reis, D. M., Maletzke, A. G., and Batista, G. E. A. P. A. (2020). Challenges in benchmarking stream learning algorithms with real-world data. Data Mining and Knowledge Discovery, 34:1805-1858. DOI: 10.1007/s10618-020-00698-5.

Street, W. N. and Kim, Y. (2001). A streaming ensemble algorithm (sea) for large-scale classification. In Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, pages 377-382.

Webb, G. I., Hyde, R., Cao, H., Nguyen, H. L., and Petitjean, F. (2016). Characterizing concept drift. Data Mining and Knowledge Discovery, 30(4):964-994.

Evaluating Data Drift Detection and Its Effects on Machine Learning System Performance

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

Make a Submission

Metrics: