Similar Items: Audio-Visual Intelligence in Large Foundation Models
- Large Language Models are Universal Reasoners for Visual Generation
- Agentic AIs Are the Missing Paradigm for Out-of-Distribution Generalization in Foundation Models
- Foundation AI Models for Aerosol Optical Depth Estimation from PACE Satellite Data
- OphMAE: Bridging Volumetric and Planar Imaging with a Foundation Model for Adaptive Ophthalmological Diagnosis
- Quantifying the human visual exposome with vision language models
- Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling