Similar Items: Extending Environments to Measure Self-reflection in Reinforcement Learning