Similar Items: Jailbreaking Vision-Language Models Through the Visual Modality
- Towards Improving Speaker Distance Estimation through Generative Impulse Response Augmentation
- Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment
- SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering
- Learning Multimodal Energy-Based Model with Multimodal Variational Auto-Encoder via MCMC Revision
- Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents
- RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses