Text this: Taming diffusion model for exemplar-based image translation