Text this: Extending Environments to Measure Self-reflection in Reinforcement Learning