Similar Items: Multimodal Learning on Low-Quality Data with Conformal Predictive Self-Calibration
- PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning
- Echo-α: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation
- OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
- Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions
- Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study
- UnAC: Adaptive Visual Prompting with Abstraction and Stepwise Checking for Complex Multimodal Reasoning