Text this: Exploration Hacking: Can LLMs Learn to Resist RL Training?