Text this: ViDR: Grounding Multimodal Deep Research Reports in Source Visual Evidence