Similar Items: Perceptual Flow Network for Visually Grounded Reasoning
- Unsupervised Denoising of Real Clinical Low Dose Liver CT with Perceptual Attention Networks
- Large Language Models are Universal Reasoners for Visual Generation
- Beyond Pixel Fidelity: Minimizing Perceptual Distortion and Color Bias in Night Photography Rendering
- UnAC: Adaptive Visual Prompting with Abstraction and Stepwise Checking for Complex Multimodal Reasoning
- Static and Dynamic Graph Alignment Network for Temporal Video Grounding
- BAMI: Training-Free Bias Mitigation in GUI Grounding