Text this: UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors