Similar Items: GazeVLM: Active Vision via Internal Attention Control for Multimodal Reasoning