Similar Items: Building High-Quality Datasets for Portuguese LLMs: From Common Crawl Snapshots to Industrial-Grade Corpora
- Abstractive Summarization with LLMs for Texts in Brazilian Portuguese
- Pt-HotpotQA: Evaluating Multi-Hop Question Answering on Original and Portuguese-translated Datasets Using LLMs
- The Cocoruta Hub: Open and Curated Corpora, Datasets and Language Models on Brazilian Ocean Law
- BRoverbs - Measuring how much LLMs understand Portuguese proverbs
- Rewriting Stories with LLMs: Gender Bias in Generated Portuguese-language Narratives
- Evaluating LLMs on Argument Mining Tasks in Brazilian Portuguese Debate Data