Similar Items: Reduced-order Neural Modeling with Differentiable Simulation for High-Detail Tactile Perception
- DINORANKCLIP: DINOv3 Distillation and Injection for Vision-Language Pretraining with High-Order Ranking Consistency
- Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs
- Seeing Realism from Simulation: Efficient Video Transfer for Vision-Language-Action Data Augmentation
- RD-ViT: Recurrent-Depth Vision Transformer for Semantic Segmentation with Reduced Data Dependence Extending the Recurrent-Depth Transformer Architecture to Dense Prediction
- DynoSLAM: Dynamic SLAM with Generative Graph Neural Networks for Real-World Social Navigation
- 6D Pose Estimation via Keypoint Heatmap Regression with RGB-D Residual Neural Networks