Evaluating Heterogeneous Node Embedding Compositions Using Diversity Metrics

Authors

DOI:

https://doi.org/10.5753/jidm.2026.5727

Keywords:

Heterogeneous Embedding, Graph Embedding, Heterogeneous Graph, Representation Knowledge, Graph Neural Networks

Abstract

This paper evaluates the impact of different embedding composition strategies on classification performance, analyzing local node features, neighboring node features, and metapaths. We conduct a comprehensive experimental evaluation using an authorial Person Relationships heterogeneous graph, incorporating diversity metrics to assess dataset balance and structural complexity. This approach provides deeper insights into their influence on model effectiveness and extends prior research by comparing new results against an established baseline. The experimental findings reaffirm the effectiveness of embedding compositions, with Aggregated Features + Metapaths achieving a Micro-F1 score of 94.04\%, demonstrating highly accurate results, validated by diversity metrics. This outcome highlights the importance of embedding compositions in heterogeneous graph representations, reinforcing its potential to improve predictive performance in real-world graph structures.

Downloads

Download data is not yet available.

Author Biography

Silvio Fernando Angonese, Universidade Federal do Rio Grande do Sul - UFRGS

Institute of Informatics

References

Angonese, S. and Galante, R. (2024a). AGHE - Approach for Generating Enhanced Heterogeneous Embeddings from Heterogeneous Graphs. In Anais do LI Seminario Integrado de Software e Hardware, pages 252-263, Porto Alegre, RS, Brasil. SBC. DOI: 10.5753/semish.2024.3051.

Angonese, S. and Galante, R. (2024b). Composition of heterogeneous node embeddings - unlocking the power of heterogeneous graph representation. In Anais-XXXIX Simposio Brasileiro de Bancos de Dados, pages 626-638, Porto Alegre, RS, Brasil. SBC. DOI: 10.5753/sbbd.2024.243436.

Angonese, S. F. and Galante, R. (2025). Enhancing Graph Data Quality by Leveraging Heterogeneous Node Features and Embeddings. In Paes, A. and Verri, F. A. N., editors, Intelligent Systems, pages 383-398, Cham. Springer Nature Switzerland.

Dong, Y., Chawla, N. V., and Swami, A. (2017). Meta-Path2Vec: Scalable Representation Learning for Heterogeneous Networks. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '17, page 135-144, New York, NY, USA. Association for Computing Machinery. DOI: 10.1145/3097983.3098036.

Fu, X., Zhang, J., Meng, Z., and King, I. (2020). MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding. In Proceedings of The Web Conference 2020, WWW '20, page 2331-2341, New York, NY, USA. DOI: 10.1145/3366423.3380297.

Hamilton, W. L., Ying, R., and Leskovec, J. (2017). Inductive Representation Learning on Large Graphs. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS'17, page 1025-1035, Red Hook, NY, USA.

Harbecke, D., Chen, Y., Hennig, L., and Alt, C. (2022). Why only micro-f1? class weighting of measures for relation classification.

Kaminskas, M. and Bridge, D. (2016). Diversity, serendipity, novelty, and coverage: A survey and empirical analysis of beyond-accuracy objectives in recommender systems. ACM Transactions on Interactive Intelligent Systems, 7(1):2:1-2:42. DOI: 10.1145/2926720.

Santana, D. and Ribeiro, L. (2023). Approximate similarity joins over dense vector embeddings. In Anais do XXXVIII Simposio Brasileiro de Bancos de Dados, pages 51-62, Porto Alegre, RS, Brasil. SBC. DOI: 10.5753/sbbd.2023.232512.

Sun, Y. and Han, J. (2012). Mining Heterogeneous Information Networks: Principles and Methodologies. Morgan and Claypool Publishers.

Wang, H., Hu, W., Qiu, Z., and Du, B. (2017). Nodes' evolution diversity and link prediction in social networks. IEEE Trans. on Knowl. and Data Eng., 29(10):2263-2274. DOI: 10.1109/TKDE.2017.2728527.

Wang, X., Bo, D., Shi, C., Fan, S., Ye, Y., and Yu, P. S. (2023). A Survey on Heterogeneous Graph Embedding: Methods, Techniques, Applications and Sources. IEEE Transactions on Big Data, 9(2):415-436. DOI: 10.1109/TBDATA.2022.3177455.

Wu, S., Sun, F., Zhang, W., Xie, X., and Cui, B. (2022). Graph Neural Networks in Recommender Systems: A Survey. ACM Comput. Surv., 55(5). DOI: 10.1145/3535101.

Ying, R., He, R., Chen, K., Eksombatchai, P., Hamilton, W. L., and Leskovec, J. (2018). Graph Convolutional Neural Networks for Web-Scale Recommender Systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '18, page 974-983, New York, NY, USA. Association for Computing Machinery. DOI: 10.1145/3219819.3219890.

Zhang, C., Song, D., Huang, C., Swami, A., and Chawla, N. V. (2019). Heterogeneous Graph Neural Network. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '19, page 793-803, New York, NY, USA. Association for Computing Machinery. DOI: 10.1145/3292500.3330961.

Zhao, Y., Wang, Y., Liu, Y., Cheng, X., Aggarwal, C. C., and Derr, T. (2025). Fairness and diversity in recommender systems: A survey. ACM Trans. Intell. Syst. Technol., 16(1). DOI: 10.1145/3664928.

Downloads

Published

2026-03-13

How to Cite

Angonese, S. F., & Galante, R. (2026). Evaluating Heterogeneous Node Embedding Compositions Using Diversity Metrics. Journal of Information and Data Management, 17(1), 17–25. https://doi.org/10.5753/jidm.2026.5727

Issue

Section

SBBD 2024 Full papers - Extended papers