Similar Items: A CLIP-Based Cross-Modal Matching Model for Image-Text Retrieval
- Cross-View Image Geo-localization Based on Attention Weight Masks
- Unsupervised CycleGAN-Based Model for Optimizing Depth-of-Field Effects in Photographic Image
- Modalities of Data Diplomacy: How Negotiations Shape Data Governance in Practice
- One Single Hub Text Breaks CLIP: Identifying Vulnerabilities in Cross-Modal Encoders via Hubness
- An Effective Object Recognition Algorithm Based on Multi-source Remotely-sensed Image Fusion
- Research on Underground Coal Mine Object Detection Based on Image Enhancement and YOLOv11