Similar Items: Static and Dynamic Graph Alignment Network for Temporal Video Grounding
- CMTA: Leveraging Cross-Modal Temporal Artifacts for Generalizable AI-Generated Video Detection
- AnchorD: Metric Grounding of Monocular Depth Using Factor Graphs
- Perceptual Flow Network for Visually Grounded Reasoning
- DynoSLAM: Dynamic SLAM with Generative Graph Neural Networks for Real-World Social Navigation
- Relit-LiVE: Relight Video by Jointly Learning Environment Video
- Wasserstein-Aligned Localisation for VLM-Based Distributional OOD Detection in Medical Imaging