Similar Items: Cross-layer Attention Sharing for Pre-trained Large Language Models
- Optimized graph convolutional shunted self-attention neural network for multilingual speech-to-text training using cross-language voice conversion of speech representations
- Can Large Language Models Generalize Analogy Solving Like Children Can?
- Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide
- Simulating Hard Attention Using Soft Attention
- ActiveLLM: Large Language Model-Based Active Learning for Textual Few-Shot Scenarios
- Plurilingual and Pluricultural Competence: A Theoretical-Methodological Review for Language Education and Teacher Training