Similar Items: CMTA: Leveraging Cross-Modal Temporal Artifacts for Generalizable AI-Generated Video Detection
- Static and Dynamic Graph Alignment Network for Temporal Video Grounding
- Generalizable Sparse-View 3D Reconstruction from Unconstrained Images
- Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
- FreeSpec: Training-Free Long Video Generation via Singular-Spectrum Reconstruction
- ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation
- UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors