Similar Items: Fusion of temporal-spectral features and transformer architectures for automated music mood classification