Text this: 3D Primitives are a Spatial Language for VLMs