Text this: Federated Reinforcement Learning for Peak Demand Mitigation in Residential Energy Systems With Dynamic Comfort Preferences