(1)
Almeida, T. S.; Nogueira, R.; Pedrini, H. Building High-Quality Datasets for Portuguese LLMs: From Common Crawl Snapshots to Industrial-Grade Corpora. JBCS 2025, 31, 1247-1263.