Text this: Cross-layer Attention Sharing for Pre-trained Large Language Models