Similar Items: Cross-Modal Feature Distillation via Sample-Aware Adaptive Masking