Characterizing the Socioenvironmental and Behavioral Profile of Individuals with OCD Using the PNS 2019 Database

Authors

DOI:

https://doi.org/10.5753/jidm.2026.5780

Keywords:

Machine learning, Health informatics, Obsessive-Compulsive Disorder, Health database

Abstract

The objective of this study is to characterize the profile of individuals diagnosed with Obsessive-Compulsive Disorder (OCD) in the Brazilian population, considering socioenvironmental and behavioral aspects. For this purpose, the 2019 National Health Survey (PNS) database is considered. Based on a knowledge discovery process, including conceptual modeling of the domain for conceptual selection of attributes, the Explainable Boosting Machine (EBM) and Decision Tree algorithms are applied, aiming to identify relevant attributes for the classification of OCD. The results indicate that both aspects improve the model's performance, reaching an average F1-score of 63% (59% for OCD = yes, and 66% for OCD=No). Results consistent with the literature were also found, such as the relationship between OCD and poor sleep quality, diet quality, and mental disorders such as anxiety and depression, among other factors. This study has limitations, such as the use of data that may not accurately reflect socioeconomic and behavioral conditions during the development of OCD. Thus, this study serves as an exploratory guide, capable of identifying profiles more vulnerable to triggers of the disorder, but without the intention of replacing medical or psychological evaluation.

Downloads

Download data is not yet available.

References

American Psychiatric Association (2013). Manual Diagnostico e Estatistico de Transtornos Mentais. Artmed, 5 edition.

Azeredo, C. M., Cotta, R. M. M., Schott, M., Maia, T. d. M., and Marques, E. S. (2007). Avaliacao das condicoes de habitacao e saneamento: a importancia da visita domiciliar no contexto do programa de saude da familia. Ciencia e Saude Coletiva, 12(3):743-753. DOI: 10.1590/S1413-81232007000300025.

Baland, J., Chamberlain, S. R., and Sahakian, B. J. (2023). Obsessive-compulsive disorder: Etiology, neuropathology, and cognitive dysfunction. Brain and Behavior, 13. DOI: 10.1002/brb3.3000.

Bang, L., Kristensen, U. B., Wisting, L., Stedal, K., Garte, M., Ase Minde, and Oyvind Ro (2020). Presence of eating disorder symptoms in patients with obsessive-compulsive disorder. BMC Psychiatry, 20(1):36. DOI: 10.1186/s12888-020-2457-0.

Boger, S., Thomas Ehring, G. B., and Werner, G. G. (2020). Impact of childhood maltreatment on obsessive-compulsive disorder symptom severity and treatment outcome. European Journal of Psychotraumatology, 11(1):1753942. DOI: 10.1080/20008198.2020.1753942.

Brady, S., Brubaker, L., Fok, C., Gahagan, S., Lewis, C. E., Lewis, J., Lowder, J. L., Nodora, J., Stapleton, A., and Palmer, M. H. (2020). Development of conceptual models to guide public health research, practice, and policy: Synthesizing traditional and contemporary paradigms. Health Promotion Practice, 21(4):510-524. DOI: 10.1177/1524839919890869.

Brander, G., Perez-Vigil, A., Larsson, H., and Mataix-Cols, D. (2016). Systematic review of environmental risk factors for obsessive-compulsive disorder: A proposed roadmap from association to causation. Neuroscience and Biobehavioral Reviews, 65:36-62. DOI: 10.1016/j.neubiorev.2016.03.011.

Buss, P. M. and Pellegrini, A. (2007). A saude e seus determinantes sociais. Physis: Revista de Saude Coletiva, 17(1):77-93. DOI: 10.1590/S0103-73312007000100006.

Cao, L. (2010). Domain-Driven Data Mining: Challenges and Prospects. IEEE Transactions on Knowledge & Data Engineering, 22(06):755-769.

Clemmensen, L. K. H., Lonfeldt, N. N., Das, S., Lund, N. L., Uhre, V. F., Mora-Jensen, A.-R. C., Pretzmann, L., Uhre, C. F., Ritter, M., Korsbjerg, N. L. J., Hagstrom, J., Thoustrup, C. L., Clemmesen, I. T., Plessen, K. J., and Pagsberg, A. K. (2022). Associations between the severity of obsessive-compulsive disorder and vocal features in children and adolescents: Protocol for a statistical and machine learning analysis. JMIR Res Protoc, 11(10):e39613. DOI: 10.2196/39613.

Endres, D., Schiele, M., von Zedtwitz, K., Dressle, R., Maier, A., Hohagen, F., Baldermann, J., Coenen, V., Jelinek, L., Domschke, K., and Voderholzer, U. (2025). Obsessive-compulsive disorder - a state-of-the-art review. Neurosci Biobehav Rev., 177:106320. DOI: 10.1016/j.neubiorev.2025.106320.

Fayyad, U., Piatetsky-Shapiro, G., and Smyth, P. (1996). The KDD process for extracting useful knowledge from volumes of data. Commun. ACM, 39(11):27-34. DOI: 10.1145/240455.240464.

Goncalves, L., Franca, D., and Zarate, L. (2024). Relevancia do entendimento do dominio de problema na construcao de modelos computacionais de aprendizado. In Anais do XVIII Brazilian e-Science Workshop, pages 135-142, Porto Alegre, RS, Brasil. SBC. DOI: 10.5753/bresci.2024.240233.

Guyon, I., Sun-Hosoya, L., Boulle, M., Escalante, H. J., Escalera, S., Liu, Z., Jajetic, D., Ray, B., Saeed, M., Sebag, M., Statnikov, A., Tu, W.-W., and Viegas, E. (2019). Analysis of the AutoML Challenge Series 2015-2018, pages 177-219. Springer International Publishing, Cham.

Hiranandani, S., Ipek, S. I., Wilhelm, S., and Greenberg, J. L. (2023). Digital mental health interventions for obsessive compulsive and related disorders: A brief review of evidence-based interventions and future directions. Journal of Obsessive-Compulsive and Related Disorders, 36:100765. DOI: 10.1016/j.jocrd.2022.100765.

Hu, X., Liu, Q., Li, B., Tang, W., Sun, H., Li, F., Yang, Y., Gong, Q., and Huang, X. (2016). Multivariate pattern analysis of obsessive-compulsive disorder using structural neuroanatomy. European Neuropsychopharmacology, 26(2):246-254. DOI: 10.1016/j.euroneuro.2015.12.014.

Hojgaard, D., Arildskov, T. W., Skarphedinsson, G., et al. (2023). Do autistic traits predict outcome of cognitive behavioral therapy in pediatric obsessive-compulsive disorder? Research on Child and Adolescent Psychopathology, 51:1083-1095. DOI: 10.1007/s10802-023-01078-5.

IBGE (2020). Pesquisa nacional de saude 2019 - instituto brasileiro de geografia e estatistica. Disponivel em: https://www.ibge.gov.br/estatisticas/sociais/saude/9160-pesquisa-nacional-de-saude.html. Acesso em: 2024-07-15.

Kalmady, S. V., Paul, A. K., Narayanaswamy, J. C., Agrawal, R., Shivakumar, V., Greenshaw, A. J., Dursun, S. M., Greiner, R., Venkatasubramanian, G., and Reddy, Y. J. (2022). Prediction of obsessive-compulsive disorder: Importance of neurobiology-aided feature design and cross-diagnosis transfer learning. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 7(7):735-746. DOI: 10.1016/j.bpsc.2021.12.003.

Kirsten, K., Pfitzner, B., Loper, L., and Arnrich, B. (2021). Sensor-based obsessive-compulsive disorder detection with personalised federated learning. In 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA), pages 333-339. DOI: 10.1109/ICMLA52953.2021.00058.

KNIME (2023). Decision tree learner (3.6.0). Disponivel em: https://hub.knime.com/. Acesso em: 2024-07-12.

Nagata, J. M., Chu, J., Zamora, G., Ganson, K. T., Testa, A., Jackson, D. B., Costello, C. R., Murray, S. B., and Baker, F. C. (2023). Screen time and obsessive-compulsive disorder among children 9-10 years old: A prospective cohort study. Journal of Adolescent Health, 72(3):390-396. DOI: 10.1016/j.jadohealth.2022.10.023.

Pasternak, S. (2016). Habitacao e saude. Estudos Avancados, 30(86):51-66.

Patel, K., Tripathy, A. K., Padhy, L. N., Kar, S. K., Padhy, S. K., and Mohanty, S. P. (2023). Accu-help: A machine-learning-based smart healthcare framework for accurate detection of obsessive compulsive disorder. SN Comput. Sci., 5(1). DOI: 10.1007/s42979-023-02380-1.

Rodrigues, A. P. C. and Zarate, L. E. (2024). Analise dos fatores socioambientais e comportamentais na identificacao do transtorno obsessivo compulsivo: Uma abordagem com dados da pesquisa nacional de saude 2019. In Proceedings of the 39th Brazilian Symposium on Data Bases, pages 78-90, Porto Alegre, RS, Brasil. SBC.

Segalas, C., Labad, J., Salvat-Pujol, N., Real, E., Pino, A., Bertolin, S., Jimenez-Murcia, S., Soriano-Mas, C., Monasterio, C., Menchon, J. M., and Soria, V. (2021). Sleep disturbances in obsessive-compulsive disorder: influence of depression symptoms and trait anxiety. BMC Psychiatry, 21(1):42. DOI: 10.1186/s12888-021-03038-z.

Singh, A., Anjankar, V., and Sapkale, B. (2023). Obsessive-compulsive disorder (OCD): A comprehensive review of diagnosis, comorbidities, and treatment approaches. Cureus, 15:e48960. DOI: 10.7759/cureus.48960.

Souza, V., Nobre, J., and Becker, K. (2020). Characterization of anxiety, depression, and their comorbidity from texts of social networks. In Anais do XXXV Simposio Brasileiro de Bancos de Dados, pages 121-132, Porto Alegre, RS, Brasil. SBC. DOI: 10.5753/sbbd.2020.13630.

Teece, D. (2000). Strategies for managing knowledge assets: The role of firm structure and industrial context. International Journal of Strategic Management, 33:35-54. DOI: 10.1016/S0024-6301(99)00117-X.

The InterpretML Contributors (2023). Explainable boosting machine. Acesso em: 2024-05-23.

Xuanyi, L., Kang, Q., and Gu, H. (2023). A comprehensive review for machine learning on neuroimaging in obsessive-compulsive disorder. Frontiers in Human Neuroscience, 17:1280512. DOI: 10.3389/fnhum.2023.1280512.

Yin, L., Rich, C., Johannes, G., and Giles, H. (2013). Accurate intelligible models with pairwise interactions. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 623-631.

Yuan, J., Jin, Z., Guo, H., Jin, H., Zhang, X., Smith, T., and Luo, J. (2020). Constructing biomedical domain-specific knowledge graph with minimum supervision. Knowl Inf Syst, 62:317-336. DOI: 10.1007/s10115-019-01351-4.

Zaboski, B. A., Wilens, A., McNamara, J. P., and Muller, G. N. (2024). Predicting OCD severity from religiosity and personality: A machine learning and neural network approach. Journal of Mood and Anxiety Disorders, 8:100089. DOI: 10.1016/j.xjmad.2024.100089.

Downloads

Published

2026-03-13

How to Cite

Puga Campos Rodrigues, A. ., & Zarate, L. E. (2026). Characterizing the Socioenvironmental and Behavioral Profile of Individuals with OCD Using the PNS 2019 Database. Journal of Information and Data Management, 17(1), 102–111. https://doi.org/10.5753/jidm.2026.5780

Issue

Section

SBBD 2024 Full papers - Extended papers