Similar Items: Transformer-Based Encoder-Decoder Model for Medical Image Captioning with Concept Embedding
- Deep learning–driven image captioning: Progress through transformers and large language models
- DTSF-CDNet: A Dual-Branch Encoder-Decoder Network With Differential Transformer Skip Fusion for Image Change Detection
- Brazilian Portuguese Image Captioning with Transformers: A Study on Cross-Native-Translated Dataset
- CLIP-Flow: Decoding images encoded in CLIP space
- Efficient Explainable Metric for Semantic Communication of Images Using Image Captioning
- BED-RL: Bagging-Based Encoder–Decoder Reinforcement Learning for Dynamic Portfolio Management