Channels - When Audio-Language Models Fail to Leverage Multimodal Context for Dysarthric Speech Recognition :: FRELIP Discovery

Similar Items: When Audio-Language Models Fail to Leverage Multimodal Context for Dysarthric Speech Recognition

Quick Look
A Comprehensive Analysis of Tokenization and Self-Supervised Learning in End-to-End Automatic Speech Recognition applied on French Language
Quick Look
PairAlign: A Framework for Sequence Tokenization via Self-Alignment with Applications to Audio Tokenization
Quick Look
When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models
Quick Look
Towards Emotion Consistency Analysis of Large Language Models in Emotional Conversational Contexts
Quick Look
Fuzzy Fingerprinting Encoder Pre-trained Language Models for Emotion Recognition in Conversations: Human Assessment and Validity Study
Quick Look
Tibetan-TTS:Low-Resource Tibetan Speech Synthesis with Large Model Adaptation
Quick Look
GazeVLM: Active Vision via Internal Attention Control for Multimodal Reasoning
Quick Look
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?
Quick Look
The Impossibility Triangle of Long-Context Modeling
Quick Look
SAM-NER: Semantic Archetype Mediation for Zero-Shot Named Entity Recognition
Quick Look
CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing
Quick Look
Long Context Pre-Training with Lighthouse Attention
Quick Look
Ask Early, Ask Late, Ask Right: When Does Clarification Timing Matter for Long-Horizon Agents?
Quick Look
Implicit Representations of Grammaticality in Language Models
Quick Look
Geometry-Calibrated Conformal Abstention for Language Models
Quick Look
Adapting Large Language Models to a Low-Resource Agglutinative Language: A Comparative Study of LoRA and QLoRA for Bashkir
Quick Look
Unintended Negative Impacts of Promotional Language in Patent Evaluation
Quick Look
The Frequency Confound in Language-Model Surprisal and Metaphor Novelty
Quick Look
Beyond Decodability: Reconstructing Language Model Representations with an Encoding Probe
Quick Look
The Counterexample Game: Iterated Conceptual Analysis and Repair in Language Models
Quick Look
A11y-Compressor: A Framework for Enhancing the Efficiency of GUI Agent Observations through Visual Context Reconstruction and Redundancy Reduction
Quick Look
Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models
Quick Look
DPN-LE: Dual Personality Neuron Localization and Editing for Large Language Models
Quick Look
LASE: Language-Adversarial Speaker Encoding for Indic Cross-Script Identity Preservation