Text this: Information theoretic underpinning of self-supervised learning by clustering