Similar Items: Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs
- Representation Fréchet Loss for Visual Generation
- Large Language Models are Universal Reasoners for Visual Generation
- Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
- Enhancing Visual Question Answering with Multimodal LLMs via Chain-of-Question Guided Retrieval-Augmented Generation
- Perceptual Flow Network for Visually Grounded Reasoning
- Audio-Visual Intelligence in Large Foundation Models