Similar Items: Efficient Pre-Training with Token Superposition