Locally Differentially Private Applications with Longitudinal Data
DOI:
https://doi.org/10.5753/jidm.2026.5735Keywords:
Local Differential Privacy, Longitudinal Data, Frequency Oracle ProtocolsAbstract
Local differential privacy (LDP) was developed as a version of differential privacy (DP) that does not require a trusted curator or server. Frequency oracles, a class of LDP protocols for frequency estimation, function as the building blocks for diverse applications with LDP guarantees developed for tackling specific tasks such as answering range queries, and frequent item and itemset mining. However, these applications often build on frequency oracles with no adjustments for longitudinal data, and therefore can not provide LDP. In this paper, we investigate the practical effectiveness of state-of-the-art frequency oracle (FO) protocols designed for longitudinal data in various data analysis tasks. Specifically, we implement these protocols to perform three key tasks: answering range queries, identifying frequent items, and detecting frequent itemsets. Additionally, we incorporate post-processing techniques to enhance utility and improve overall performance. Our experimental evaluation includes four real-world datasets from diverse domains, allowing us to systematically measure and compare the utility of longitudinal LDP protocols.
Downloads
References
Arcolezi, H. H., Couchot, J.-F., Al Bouna, B., and Xiao, X. (2022a). Improving the utility of locally differentially private protocols for longitudinal and multidimensional frequency estimates. Digital Communications and Networks.
Arcolezi, H. H., Pinzón, C., Palamidessi, C., and Gambs, S. (2022b). Frequency estimation of evolving data under local differential privacy. arXiv preprint arXiv:2210.00262.
Cao, X., Jia, J., and Gong, N. Z. (2021). Data poisoning attacks to local differential privacy protocols. In 30th USENIX Security Symposium (USENIX Security 21), pages 947–964.
Ding, B., Kulkarni, J., and Yekhanin, S. (2017). Collecting telemetry data privately. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, page 3574–3583, Red Hook, NY, USA. Curran Associates Inc.
Duarte Neto, E. R., Costa Filho, J. S., Neto, A. A. M., and Machado, J. C. (2026). Alog: Adaptive longitudinal grids for geospatial data using local differential privacy. In EDBT, pages 1–14.
Dwork, C., McSherry, F., Nissim, K., and Smith, A. (2006). Calibrating noise to sensitivity in private data analysis. In Theory of Cryptography: Third Theory of Cryptography Conference, TCC 2006, New York, NY, USA, March 4-7, 2006. Proceedings 3, pages 265–284. Springer.
Dwork, C., Roth, A., et al. (2014). The algorithmic foundations of differential privacy. Foundations and Trends® in Theoretical Computer Science, 9(3–4):211–407.
Erlingsson, Ú., Pihur, V., and Korolova, A. (2014). Rappor: Randomized aggregatable privacy-preserving ordinal response. In Proceedings of the 2014 ACM SIGSAC conference on computer and communications security, pages 1054–1067.
Filho, J. S. C. and Machado, J. C. (2023). Felip: A local differentially private approach to frequency estimation on multidimensional datasets. In International Conference on Extending Database Technology.
Hong, D., Jung, W., and Shim, K. (2021). Collecting geospatial data with local differential privacy for personalized services. In 2021 IEEE 37th International Conference on Data Engineering (ICDE), pages 2237–2242.
Johnson, N., Near, J. P., and Song, D. (2018). Towards practical differential privacy for sql queries. 11(5).
Kairouz, P., Bonawitz, K., and Ramage, D. (2016). Discrete distribution estimation under local privacy. In International Conference on Machine Learning, pages 2436–2444. PMLR.
Li, J., Gan, W., Gui, Y., Wu, Y., and Yu, P. S. (2022). Frequent itemset mining with local differential privacy. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, CIKM ’22, page 1146–1155, New York, NY, USA. Association for Computing Machinery.
Li, N., Qardaji, W., Su, D., and Cao, J. (2012). Privbasis: frequent itemset mining with differential privacy. Proc. VLDB Endow., 5(11):1340–1351. DOI: 10.14778/2350229.2350251.
Liu, G., Tang, P., Hu, C., Jin, C., and Guo, S. (2023). Multidimensional data publishing with local differential privacy. In Proceedings 26th International Conference on Extending Database Technology, EDBT 2023, Ioannina, Greece, March 28-31, 2023, pages 183–194. OpenProceedings.org.
Marreiras, A. A., Neto, E. R. D., Costa Filho, J. S., and Machado, J. C. (2024). Locally differentially private and consistent frequency estimation of longitudinal data. In Simpósio Brasileiro de Banco de Dados (SBBD), pages 367–380. SBC.
Marreiras, A. A., et al. (2025). Locally Differentially Private Applications with Longitudinal Data. [Nota: Referência incompleta no original, mantida conforme o padrão dos autores].
Ren, X., Shi, L., Yu, W., Yang, S., Zhao, C., and Xu, Z. (2022). Ldp-ids: Local differential privacy for infinite data streams. In Proceedings of the 2022 International Conference on Management of Data, SIGMOD ’22, page 1064–1077, New York, NY, USA. Association for Computing Machinery.
Team, A. D. P. (2017). Learning with privacy at scale.
Wang, N., Xiao, X., Yang, Y., Zhang, Z., Gu, Y., and Yu, G. (2017a). Privsuper: A superset-first approach to frequent itemset mining under differential privacy. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE), pages 809–820. DOI: 10.1109/ICDE.2017.131.
Wang, T., Blocki, J., Li, N., and Jha, S. (2017b). Locally differentially private protocols for frequency estimation. In 26th USENIX Security Symposium (USENIX Security 17), pages 729–745.
Wang, T., Chen, J. Q., Zhang, Z., Su, D., Cheng, Y., Li, Z., Li, N., and Jha, S. (2021a). Continuous release of data streams under both centralized and local differential privacy. In Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security, CCS ’21, New York, NY, USA. Association for Computing Machinery.
Wang, T., Ding, B., Zhou, J., Hong, C., Huang, Z., Li, N., and Jha, S. (2019a). Answering multi-dimensional analytical queries under local differential privacy. SIGMOD ’19, page 159–176, New York. ACM.
Wang, T., Li, N., and Jha, S. (2018). Locally differentially private frequent itemset mining. In 2018 IEEE Symposium on Security and Privacy (SP), pages 127–143. DOI: 10.1109/SP.2018.00035.
Wang, T., Lopuhaä-Zwakenberg, M., Li, Z., Skoric, B., and Li, N. (2019b). Locally differentially private frequency estimation with consistency. arXiv preprint arXiv:1905.08320.
Wang, T., Zhao, J., Hu, Z., Yang, X., Ren, X., and Lam, K.-Y. (2021b). Local differential privacy for data collection and analysis. Neurocomputing, 426:114–133.
Warner, S. L. (1965). Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60(309):63–69.
Wu, H., Ran, R., Peng, S., Yang, M., and Guo, T. (2023). Mining frequent items from high-dimensional set-valued data under local differential privacy protection. Expert Systems with Applications, 234:121105. DOI: https://doi.org/10.1016/j.eswa.2023.121105.
Xiong, X., Chen, F., Huang, P., Tian, M., Hu, X., Chen, B., and Qin, J. (2018). Frequent itemsets mining with differential privacy over large-scale data. IEEE Access, 6:28877–28889. DOI: 10.1109/ACCESS.2018.2839752.
Yang, J., Wang, T., Li, N., Cheng, X., and Su, S. (2020). Answering multi-dimensional range queries under local differential privacy. Proc. VLDB Endow., 14(3).
Zhang, Z., Wang, T., Li, N., He, S., and Chen, J. (2018). Calm: Consistent adaptive local marginal for marginal release under local differential privacy. CCS ’18, page 212–229, New York, NY, USA. Association for Computing Machinery.
Zhu, Y., Cao, Y., Xue, Q., Wu, Q., and Zhang, Y. (2024). Heavy hitter identification over large-domain set-valued data with local differential privacy. IEEE Transactions on Information Forensics and Security, 19:414–426. DOI: 10.1109/TIFS.2023.3324726.

