Text this: Predicting 3D structure by latent posterior sampling