Channels - Unlocking Patch-Level Features for CLIP-Based Class-Incremental Learning :: FRELIP Discovery

Similar Items: Unlocking Patch-Level Features for CLIP-Based Class-Incremental Learning

Quick Look
GMGaze: MoE-Based Context-Aware Gaze Estimation with CLIP and Multiscale Transformer
Quick Look
Few-Shot Learning Pipeline for Monkeypox Skin Disease Classification Using CNN Feature Extractors
Quick Look
Does it Really Count? Assessing Semantic Grounding in Text-Guided Class-Agnostic Counting
Quick Look
VoxCor: Training-Free Volumetric Features for Multimodal Voxel Correspondence
Quick Look
AesRM: Improving Video Aesthetics with Expert-Level Feedback
Quick Look
Exploring the Limits of End-to-End Feature-Affinity Propagation for Single-Point Supervised Infrared Small Target Detection
Quick Look
Learning to Optimize Radiotherapy Plans via Fluence Maps Diffusion Model Generation and LSTM-based Optimization
Quick Look
Personal Visual Context Learning in Large Multimodal Models
Quick Look
Learning Coarse-to-Fine Osteoarthritis Representations under Noisy Hierarchical Labels
Quick Look
Multimodal Learning on Low-Quality Data with Conformal Predictive Self-Calibration
Quick Look
Relit-LiVE: Relight Video by Jointly Learning Environment Video
Quick Look
DPM++: Dynamic Masked Metric Learning for Occluded Person Re-identification
Quick Look
Contrastive Learning under Noisy Temporal Self-Supervision for Colonoscopy Videos
Quick Look
SEMIR: Semantic Minor-Induced Representation Learning on Graphs for Visual Segmentation
Quick Look
FoR-Net: Learning to Focus on Hard Regions for Efficient Semantic Segmentation
Quick Look
Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding
Quick Look
TAVIS: A Benchmark for Egocentric Active Vision and Anticipatory Gaze in Imitation Learning
Quick Look
Weakly Supervised Segmentation as Semantic-Based Regularization
Quick Look
PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning
Quick Look
SAIL: Structure-Aware Interpretable Learning for Anatomy-Aligned Post-hoc Explanations in OCT
Quick Look
Geometry-aware Prototype Learning for Cross-domain Few-shot Medical Image Segmentation
Quick Look
KAN-CL: Per-Knot Importance Regularization for Continual Learning with Kolmogorov-Arnold Networks
Quick Look
ScriptHOI: Learning Scripted State Transitions for Open-Vocabulary Human-Object Interaction Detection
Quick Look
CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models