Text this: Multi-Axis Speech Similarity via Factor-Partitioned Embeddings