Language Models for Portuguese

Guest Editors:

Foreword:

We are delighted to present this special issue of the Journal of the Brazilian Computer Society dedicated to Language Models in Portuguese. The recent advancement of large language models (LLMs) has profoundly transformed the way we interact with information, create content, and develop intelligent applications. However, much of this progress has been concentrated in languages with ample data and resource availability, highlighting the need for initiatives focused on languages with less technological coverage, such as Portuguese.

Portuguese is currently the fifth most spoken language in the world, present on multiple continents and cultural contexts, and represents a fertile field for artificial intelligence research. The challenge of building, evaluating, and applying robust language models for Portuguese involves not only technical aspects, such as corpora availability, efficient architectures, and evaluation metrics, but also social, ethical, and cultural dimensions.

This special issue brings together articles that explore different perspectives on the topic, including:

  • Comparative and critical analyses of language models
  • Social, ethical, financial, and ecological issues related to language models
  • Discussion of alternative solutions for language models
  • Domain-specific language models
  • Suitability of narrow language models for specific tasks
  • Multilingual vs. Portuguese-specific models
  • Semantic issues in language models
  • Cultural issues in language models
  • Resources for training language models
  • Language model evaluation

By bringing together contributions from researchers from Brazil and abroad, this edition seeks to strengthen the Natural Language Processing (NLP) community in Portuguese, foster collaborations, and highlight scientific advances that position the Portuguese language as a key player in the era of language models.

We thank the authors who submitted their papers, the reviewers for their thoughtful and generous efforts, and the scientific community that has been mobilizing to consolidate this emerging field. We are convinced that the articles published here will contribute not only to academic advancement but also to the development of more inclusive, sustainable, and culturally sensitive technologies.

Articles:

  • Araújo, F., F. Araújo, D. Alencar, S. Pinheiro, H. Oliveira, and D. Rosário. “Dynamic Video Service Migration in Flying Edge Computing Networks”. Journal of the Brazilian Computer Society, vol. 29, no. 1, Nov. 2023, pp. 63-72. DOI: 10.5753/jbcs.2023.2228.
  • Dalmoro, B. M., and S. R. Musse. "Using Visual Features and Early Views to Classify the Popularity of Facebook Videos". Journal of the Brazilian Computer Society, vol. 28, no. 1, Dec. 2022, pp. 52-58. DOI: 10.5753/jbcs.2022.2216.
  • Haertel, F., L. Camargo, J. Lopes, A. Pernas, F. Mota, J. Barbosa, and A. Yamin. "Helix Project: Exploring the Social Internet of Things (SIoT) in Care of Blind People". Journal of the Brazilian Computer Society, vol. 28, no. 1, Dec. 2022, pp. 26-37. DOI: 10.5753/jbcs.2022.2210.
  • Jiménez, J. G., L. A. P. Paes Leme, and M. A. Casanova. "CoEPinKB: Evaluating Path Search Strategies in Knowledge Bases". Journal of the Brazilian Computer Society, vol. 28, no. 1, Dec. 2022, pp. 13-25. DOI: 10.5753/jbcs.2022.2211.
  • Mendonca-Neto, R., J. Reis, L. Okimoto, D. Fenyö, C. Silva, F. Nakamura, and E. Nakamura. "Classification of Breast Cancer Subtypes: A Study Based on Representative Genes". Journal of the Brazilian Computer Society, vol. 28, no. 1, Dec. 2022, pp. 59-68. DOI: 10.5753/jbcs.2022.2209.
  • Navaux, P. O. A., A. F. Lorenzon, and M. da S. Serpa. “Challenges in High-Performance Computing”. Journal of the Brazilian Computer Society, vol. 29, no. 1, Aug. 2023, pp. 51-62. DOI: 10.5753/jbcs.2023.2219.
  • Pereira, S. S. L., and J. E. B. Maia. "Weakly Supervised Video Anomaly Detection Combining Deep Features With Shallow Neural Networks". Journal of the Brazilian Computer Society, vol. 28, no. 1, Dec. 2022, pp. 69-79. DOI: 10.5753/jbcs.2022.2194.
  • Zahn, N. N., G. P. Dal Molin, and S. R. Musse. "Investigating Sentiments in Brazilian and German Blogs". Journal of the Brazilian Computer Society, vol. 28, no. 1, Dec. 2022, pp. 96-103. DOI: 10.5753/jbcs.2022.2214.