Incremental Schema Evolution in Document-oriented Databases

Authors

DOI:

https://doi.org/10.5753/jidm.2026.5700

Keywords:

Document-oriented databases, NoSQL, JSON schema, Incremental schema evolution

Abstract

Document-oriented databases offer high flexibility for storing structured, semi-structured, and unstructured data. However, the absence or obsolescence of predefined schemas can hinder the interpretation and analysis of the stored information. This study proposes an approach for the incremental evolution of schemas in document-oriented databases, aiming to continuously update the schema as new documents are inserted. The proposed solution is fully automated, requiring no manual intervention, and eliminates the need to reprocess the entire dataset. Experimental results, obtained from metadata collections derived from books and Twitter posts, show that the proposed approach achieves performance gains ranging from 1.93 to 2.85 times compared to a traditional batch-based approach.

Downloads

Download data is not yet available.

References

Abdelhédi, F., Rajhi, H., and Zurfluh, G. (2022). Extraction process of the logical schema of a document-oriented nosql database. In Proceedings of the 10th International Conference on Model-Driven Engineering and Software Development (MODELSWARD 2022), pages 61–71. DOI: 10.5220/0010899000003119.

Atmatzides, N., Bedo, M., and de Oliveira, D. (2022). Adoção de sgbds nosql em empresas brasileiras: um levantamento preliminar. In Anais do XXXVII Simpósio Brasileiro de Bancos de Dados, pages 385–390, Porto Alegre, RS, Brasil. SBC. DOI: 10.5753/sbbd.2022.226015.

Baazizi, M.-A., Colazzo, D., Ghelli, G., and Sartiani, C. (2019). Parametric schema inference for massive json datasets. The VLDB Journal, 28:497–521. DOI: 10.1007/s00778-018-0532-7.

Cánovas Izquierdo, J. L. and Cabot, J. (2013). Discovering implicit schemas in json data. In Web Engineering: 13th International Conference, ICWE 2013, Aalborg, Denmark, July 8-12, 2013. Proceedings 13, pages 68–83. Springer. DOI: 10.1007/978-3-642-39200-9_8.

Frozza, A. A., dos Santos Mello, R., and da Costa, F. d. S. (2018). An approach for schema extraction of json and extended json document collections. In IEEE International Conference on Information Reuse and Integration (IRI), pages 356–363. DOI: 10.1109/IRI.2018.00060.

Izquierdo, J. L. C. and Cabot, J. (2016). Jsondiscoverer: Visualizing the schema lurking behind json documents. Knowledge-Based Systems, 103:52–55. DOI: 10.1016/j.knosys.2016.03.020.

Klettke, M., Awolin, H., Störl, U., Müller, D., and Scherzinger, S. (2017). Uncovering the evolution history of data lakes. In 2017 IEEE international conference on big data (Big Data), pages 2462–2471. IEEE. DOI: 10.1109/BigData.2017.8258204.

Latták, I. V. and Koupil, P. (2022). A comparative analysis of json schema inference algorithms. In Proceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE 2022), pages 379–386. DOI: 10.5220/0011046000003176.

Li, X., Sun, L., Ling, M., and Peng, Y. (2023). A survey of graph neural network based recommendation in social networks. Neurocomputing, 549:126441. DOI: 10.1016/j.neucom.2023.126441.

Purnomo, Y. J. (2023). Digital marketing strategy to increase sales conversion on e-commerce platforms. Journal of Contemporary Administration and Management (ADMAN), 1(2):54–62. DOI: 10.61100/adman.v1i2.23.

Rodrigues, E., Pires, C. E., and Filho, D. N. (2024). Evolução incremental de esquemas de banco de dados orientado a documentos. In Anais do XXXIX Simpósio Brasileiro de Bancos de Dados, pages 260–273, Porto Alegre, RS, Brasil. SBC. DOI: 10.5753/sbbd.2024.240265.

Sevilla Ruiz, D., Morales, S. F., and García Molina, J. (2015). Inferring versioned schemas from nosql databases and its applications. In Conceptual Modeling: 34th International Conference, ER 2015, Stockholm, Sweden, October 19- 22, 2015, Proceedings 34, pages 467–480. Springer. DOI: 10.1007/978-3-319-25264-3_35.

Downloads

Published

2026-03-13

How to Cite

Monteiro Rodrigues, E., S. Pires, C. E. ., & Cassimiro do N. Filho, D. . (2026). Incremental Schema Evolution in Document-oriented Databases. Journal of Information and Data Management, 17(1), 146–155. https://doi.org/10.5753/jidm.2026.5700

Issue

Section

SBBD 2024 Full papers - Extended papers