Similar Items: Federated Reinforcement Learning for Peak Demand Mitigation in Residential Energy Systems With Dynamic Comfort Preferences