Text this: Enhancing point cloud processing using audio cues