Similar Items: Unlocking Patch-Level Features for CLIP-Based Class-Incremental Learning
- GMGaze: MoE-Based Context-Aware Gaze Estimation with CLIP and Multiscale Transformer
- Few-Shot Learning Pipeline for Monkeypox Skin Disease Classification Using CNN Feature Extractors
- Does it Really Count? Assessing Semantic Grounding in Text-Guided Class-Agnostic Counting
- VoxCor: Training-Free Volumetric Features for Multimodal Voxel Correspondence
- AesRM: Improving Video Aesthetics with Expert-Level Feedback
- Exploring the Limits of End-to-End Feature-Affinity Propagation for Single-Point Supervised Infrared Small Target Detection