Similar Items: KL-triggered Continual Adaptation for Nonstationary Resource Allocation: An Off-policy Actor–critic Approach with Nash Social Welfare