Text this: Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs