Similar Items: CDFNet: Cross‐Modal Deep Fusion for Monocular 3D Semantic Scene Completion